Quantization Tutorial

quantization_quick_start.py

Quantization reduces model size and speeds up inference time by reducing the number of bits required to represent weights or activations. In NNI, both post-training quantization algorithms and ...

IEEE

Advances in Pruning and Quantization for Natural Language Processing

Abstract: With ongoing advancements in natural language processing (NLP) and deep learning methods, the demand for computational and memory resources has considerably increased, which signifies the ...

IEEE

A Study of Spatial Quantization Effects on Reconfigurable Reflectarray Antennas

Abstract: From a perspective of spatial quantization, this letter systematically investigates the advantages of reconfigurable reflectarrays (RRAs) designed with closely-spaced elements. Focused on ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

quantization_quick_start.py

Advances in Pruning and Quantization for Natural Language Processing

A Study of Spatial Quantization Effects on Reconfigurable Reflectarray Antennas

Trending now