run()¶

Function¶

Runs the quantization algorithm. After initializing OnnxCalibrator, execute quantization via the run() function.

Prototype¶

from msmodelslim.onnx.squant_ptq import OnnxCalibrator, QuantConfig 
quant_config = QuantConfig(disable_names=[],
                     quant_mode=0,
                     amp_num=0)
input_model_path="/home/xxx/Resnet50/resnet50_pytorch.onnx"   # Configure according to the actual path of the model.
output_model_path="/home/xxx/Resnet50/resnet50_quant.onnx"    # Configure according to the actual path of the model.
calibrator = OnnxCalibrator(input_model_path, quant_config)
calibrator.run() 
calibrator.export_quant_onnx(output_model_path)