Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add quantization scripts #1

Merged
merged 21 commits into from
Feb 21, 2025
Merged

Add quantization scripts #1

merged 21 commits into from
Feb 21, 2025

Conversation

adrianlyjak
Copy link
Owner

No description provided.

@adrianlyjak adrianlyjak merged commit 25989a9 into main Feb 21, 2025
adrianlyjak added a commit that referenced this pull request Feb 21, 2025
* start quant calibration, and remove loguru for rich, since that's better integrated with typer

* giving up on ORT quantization

* prep for rewrite float_to_float16

* add custom float16 conversion implementation

* trying to fix bugs

* gonna take my own stab at it

* fuckit

* wip fuckit

* fuckit complete

* hmm, this exports now. Sorta wrong, should have value infos.Also, size isn't reduced???

* messy messy

* now it "converts" to fp16, but no longer producing sound

* oh boy, its all useless? neural_encoder _does_ do mixed precision quantization

* trying to figure out intel neural compressor

* better gpu support for INC

* better gpu support for INC

* add onnxruntime-gpu

* start configuration for experimenting with op selection and quant level

* grr inc stupid

* hmm, is the reference quant dynamic?

* Wrap it up
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant