Liu, G., Li, C., Zhao, J., Zhang, C., & Guo, M. (2025). ClusterKV: Manipulating LLM KV Cache in Semantic Space for Recallable Compression. DAC: Annual ACM/IEEE Design Automation Conference, 62, 1. https://doi.org/10.1109/DAC63849.2025.11132479
Chicago Style (17th ed.) CitationLiu, Guangda, Chengwei Li, Jieru Zhao, Chenqi Zhang, and Minyi Guo. "ClusterKV: Manipulating LLM KV Cache in Semantic Space for Recallable Compression." DAC: Annual ACM/IEEE Design Automation Conference 62 (2025): 1. https://doi.org/10.1109/DAC63849.2025.11132479.
MLA (9th ed.) CitationLiu, Guangda, et al. "ClusterKV: Manipulating LLM KV Cache in Semantic Space for Recallable Compression." DAC: Annual ACM/IEEE Design Automation Conference, 62, 2025, p. 1, https://doi.org/10.1109/DAC63849.2025.11132479.