ClusterKV: Manipulating LLM KV Cache in Semantic Space for Recallable Compression.

Saved in:
Bibliographic Details
Title: ClusterKV: Manipulating LLM KV Cache in Semantic Space for Recallable Compression.
Authors: Liu, Guangda1, Li, Chengwei1, Zhao, Jieru1, zhao-jieru@sjtu.edu.cn, Zhang, Chenqi1, Guo, Minyi1
Source: DAC: Annual ACM/IEEE Design Automation Conference; 2025, Issue 62, p1-7, 7p
Database: Applied Science & Technology Source
Be the first to leave a comment!
You must be logged in first