SuDIS@ZJU
SuDIS@ZJU
News
People
Publications
English
English
中文 (简体)
Draft& Verify: Lossless Large Language Model Acceleration via Self-Speculative Decoding
Jun Zhang
,
Jue Wang
,
Huan Li
,
Lidan Shou
,
Ke Chen
,
Gang Chen
,
Sharad Mehrotra
January 2024
Cite
DOI
URL
Type
Conference paper
Publication
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), ACL 2024
Cite
×