-
Notifications
You must be signed in to change notification settings - Fork 1
llama.cpp 缺失算子补全 #1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
04-02: GET_ROWS && DUP && CPY |
04-03: SIN && COS && ARGMAX |
04-07: CONV_TRANSPOSE_1D && ELU |
04-10: ROPE optimization |
04-11: 优化 ROPE 算子精度问题 && 学习 Profiling 技巧 |
|
04-14 - 04-15:
|
已完成。PR
|
复杂算子暂时没有影响大模型推理,根据实际需求支持剩余算子 |
04-21:已经完成了MUL_MAT_ID的算子实现,当前未使用GroupedMatMul加速库算子。PR |
04-22:对MUL_MAT_ID算子,使用GroupedMatMul来完成实现,仍存在一些参数上的问题,待完成,暂时先搁置,投入torchair + mindie 实现 deepseek-v2 kv cache上 |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
The text was updated successfully, but these errors were encountered: