I am Mo Zhu, a student studying CS & AI at Zhejiang University.
-
Zhejiang University
- Zhejiang University
Pinned Loading
-
massive-activations-deep
massive-activations-deep PublicForked from locuslab/massive-activations
Code accompanying the paper "Massive Activations in Large Language Models"
Python
-
gpt_paper_assistant_ori
gpt_paper_assistant_ori PublicForked from tatsu-lab/gpt_paper_assistant
Deepseek based personalized ArXiv paper assistant bot
-
llm_distillation
llm_distillation PublicForked from Nicolas-BZRD/llm-distillation
i don't know why it doesn't do well
Python
-
2018cx/Multi-Level-OT
2018cx/Multi-Level-OT PublicPytorch Implementation of "Multi-Level Optimal Transport for Universal Cross-Tokenizer Knowledge Distillation on Language Models", AAAI 2025
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.