Govur University Logo
--> --> --> -->
...

What is the primary function of performing model quantization on a neural network before deploying it to an edge device with limited power?



The primary function of model quantization is to reduce the memory footprint and computational requirements of a neural network by lowering the precision of its numerical values. In a standard neural network, parameters known as weights and activations are typically stored as 32-bit floating-point numbers, which provide high precision....

Log in to view the answer



Redundant Elements