Abstract: Neural network quantization aims at reducing bit-widths of weights and activations for memory and computational efficiency. Since a linear quantizer (i.e., round(·) function) cannot well fit ...
Digital circuit is a promising approach to implement computing-in-memory (CIM) architecture for data-intensive applications, such as neural network inference. Previous digital CIM implementations have ...