MQBench is a benchmark and framework for evluating the quantization algorithms under real world hardware deployments. Integrated with the latest features of Pytorch, MQBench can automated trace a full precision model and convert it to quantized model. It provides numerous hardware & algorithms for researchers to benchmark the deployability and reproducibility for quantization. We open source the MQBench library to facilitate the community.
Reproducibility: MQBench unifies the training hyper-parameters and compare different algorithms fairly.
Deployability: MQBench sumarizes the quantization schemes of 5 deep learning acceleraters and align the quantization point by a flexible toolkit.
These two points are always nelegected by previous works. More detailed infomation see our benchmark paper, toolkit and documentation.
MQBench is flexible to add support for new hardware or quantization algorithms. WELCOME to contribute and submit new results following the README instruction.