Add cuda support when loading local onnx model #249

jiafatom · 2024-12-13T16:59:51Z

Tested on clean conda for the dependency llm-oga-cuda; this can work for evaluating local onnx model:
lemonade -i cuda-fpmixed_14 oga-load --dtype fp16 --device cuda oga-bench
lemonade -i cuda-fpmixed_14 oga-load --dtype fp16 --device cuda accuracy-mmlu --tests management

Signed-off-by: David Fan <[email protected]>

ramkrishna2910 · 2024-12-13T19:11:22Z

Currently supports local model execution Documentation will be added in a separate PR once end to end oga pipeline for int4 and fp16 are also tested.

jiafatom force-pushed the add_cuda branch from 004cb07 to 6ccf9b8 Compare December 13, 2024 17:01

ramkrishna2910 requested review from ramkrishna2910, jeremyfowers and danielholanda December 13, 2024 17:13

ramkrishna2910 approved these changes Dec 13, 2024

View reviewed changes

jiafatom force-pushed the add_cuda branch 2 times, most recently from c8e1eaa to f368f01 Compare December 13, 2024 18:45

Add cuda support when loading local onnx model

df178c3

Signed-off-by: David Fan <[email protected]>

jiafatom force-pushed the add_cuda branch from f368f01 to df178c3 Compare December 13, 2024 18:56

ramkrishna2910 approved these changes Dec 13, 2024

View reviewed changes

ramkrishna2910 merged commit 8c46f6b into onnx:main Dec 13, 2024
8 checks passed

jiafatom deleted the add_cuda branch December 13, 2024 20:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add cuda support when loading local onnx model #249

Add cuda support when loading local onnx model #249

jiafatom commented Dec 13, 2024 •

edited

Loading

ramkrishna2910 commented Dec 13, 2024

Add cuda support when loading local onnx model #249

Add cuda support when loading local onnx model #249

Conversation

jiafatom commented Dec 13, 2024 • edited Loading

ramkrishna2910 commented Dec 13, 2024

jiafatom commented Dec 13, 2024 •

edited

Loading