Hi,
I realized that the inference speed got around 40 seconds not, which is more than 3 times faster than before:
We ran the FHE inference over 10 examples and achieved 100% similar predictions between the simulation and FHE. The overall accuracy for the entire data-set is expected to match the simulation. The original model (no rounding) with a maximum of 13 bits of precision runs in around 9 hours on the specified hardware. Using the rounding approach, the final model ran in 40 seconds . This significant performance improvement demonstrates the benefits of the rounding operator in the FHE setting.
I wonder how you achieved this great speed up. I think something changed in the circuit. How did the circuit achieve this much speedup?