Add quantization scripts #1

adrianlyjak · 2025-02-21T17:52:02Z

No description provided.

…ter integrated with typer

…e isn't reduced???

…ntization

* start quant calibration, and remove loguru for rich, since that's better integrated with typer * giving up on ORT quantization * prep for rewrite float_to_float16 * add custom float16 conversion implementation * trying to fix bugs * gonna take my own stab at it * fuckit * wip fuckit * fuckit complete * hmm, this exports now. Sorta wrong, should have value infos.Also, size isn't reduced??? * messy messy * now it "converts" to fp16, but no longer producing sound * oh boy, its all useless? neural_encoder _does_ do mixed precision quantization * trying to figure out intel neural compressor * better gpu support for INC * better gpu support for INC * add onnxruntime-gpu * start configuration for experimenting with op selection and quant level * grr inc stupid * hmm, is the reference quant dynamic? * Wrap it up

adrianlyjak and others added 21 commits February 13, 2025 08:21

start quant calibration, and remove loguru for rich, since that's bet…

d0d1044

…ter integrated with typer

giving up on ORT quantization

2942e1c

prep for rewrite float_to_float16

46329d5

add custom float16 conversion implementation

9bc9e67

trying to fix bugs

a28ceac

gonna take my own stab at it

04e750b

fuckit

f1cc329

wip fuckit

6ec778a

fuckit complete

6e40812

hmm, this exports now. Sorta wrong, should have value infos.Also, siz…

ea61563

…e isn't reduced???

messy messy

a6413b5

now it "converts" to fp16, but no longer producing sound

5a529cd

oh boy, its all useless? neural_encoder _does_ do mixed precision qua…

627f8e9

…ntization

trying to figure out intel neural compressor

66ae91d

better gpu support for INC

6a5d913

better gpu support for INC

5bd6a04

add onnxruntime-gpu

ad838a1

start configuration for experimenting with op selection and quant level

d76ebba

grr inc stupid

b8958bd

hmm, is the reference quant dynamic?

33b799c

Wrap it up

36b3ae7

adrianlyjak force-pushed the quant branch from 3f566bc to 36b3ae7 Compare February 21, 2025 17:56

adrianlyjak merged commit 25989a9 into main Feb 21, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add quantization scripts #1

Add quantization scripts #1

adrianlyjak commented Feb 21, 2025

Add quantization scripts #1

Add quantization scripts #1

Conversation

adrianlyjak commented Feb 21, 2025