-
Notifications
You must be signed in to change notification settings - Fork 1.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
请教关于mnncompress的使用问题 #2847
Labels
question
Further information is requested
Comments
没有生效是指?生成 compress_params_index.bin 之后,需要加上这个参数再转换一下:--compressionParamsFile compress_params_index.bin |
感谢回复。没有生效指的是模型大小没有改变,在我所给出的例子当中。 |
自己尝试了一下发现模型需要在 |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
问题描述
按照文档的描述,如果使用离线量化,移除代码中参数更新部分代码。所以我的理解来看,这就等同于传入校准(calibrate)就好,等同于执行多次推理。实验下来发现量化过程没有生效。
示例代码:
模型量化前后的对比:
请问是我的使用方式问题吗? 不过使用工具箱当中的
quantize.out
是有效的The text was updated successfully, but these errors were encountered: