ClusterKV: Manipulating LLM KV Cache in Semantic Space for Recallable Compression.

Saved in:
Bibliographic Details
Title: ClusterKV: Manipulating LLM KV Cache in Semantic Space for Recallable Compression.
Authors: Liu, Guangda1, Li, Chengwei1, Zhao, Jieru1, zhao-jieru@sjtu.edu.cn, Zhang, Chenqi1, Guo, Minyi1
Source: DAC: Annual ACM/IEEE Design Automation Conference; 2025, Issue 62, p1-7, 7p
Database: Applied Science & Technology Source
Description
ISSN:0738100X
DOI:10.1109/DAC63849.2025.11132479