Skip to content

Commit

Permalink
[Minor] update qserve
Browse files Browse the repository at this point in the history
  • Loading branch information
ys-2020 committed Feb 12, 2025
1 parent c296e51 commit e6c7f8b
Showing 1 changed file with 2 additions and 2 deletions.
4 changes: 2 additions & 2 deletions _data/publications.yml
Original file line number Diff line number Diff line change
Expand Up @@ -2,8 +2,8 @@ main:

- title: "QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving"
authors: Yujun Lin*, Haotian Tang*, <strong>Shang Yang*</strong>, Zhekai Zhang, Guangxuan Xiao, Chuang Gan, Song Han.
conference_short: arXiv
conference: <strong>arXiv</strong>, 2024.
conference_short: MLSys
conference: The Eighth Annual Conference on Machine Learning and Systems <strong>(MLSys)</strong>, 2025.
paper: https://arxiv.org/abs/2405.04532
code: https://github.com/mit-han-lab/qserve
image: ./assets/img/paper_teasers/QServe.png
Expand Down

0 comments on commit e6c7f8b

Please sign in to comment.