AttenPIM: Accelerating LLM Attention with Dual-mode GEMV in Processing-in-Memory.

Saved in:
Bibliographic Details
Title: AttenPIM: Accelerating LLM Attention with Dual-mode GEMV in Processing-in-Memory.
Authors: Chen, Liyan1, liyan.chen@sjtu.edu.cn, Lyu, Dongxu1, Li, Zhenyu1, Jiang, Jianfei1, Wang, Qin1, Mao, Zhigang1, Jing, Naifeng1, sjtuj@sjtu.edu.cn
Source: DAC: Annual ACM/IEEE Design Automation Conference; 2025, Issue 62, p1-7, 7p
Database: Applied Science & Technology Source
Description
ISSN:0738100X
DOI:10.1109/DAC63849.2025.11133230