AttenPIM: Accelerating LLM Attention with Dual-mode GEMV in Processing-in-Memory.
Saved in:
| Title: | AttenPIM: Accelerating LLM Attention with Dual-mode GEMV in Processing-in-Memory. |
|---|---|
| Authors: | Chen, Liyan1, liyan.chen@sjtu.edu.cn, Lyu, Dongxu1, Li, Zhenyu1, Jiang, Jianfei1, Wang, Qin1, Mao, Zhigang1, Jing, Naifeng1, sjtuj@sjtu.edu.cn |
| Source: | DAC: Annual ACM/IEEE Design Automation Conference; 2025, Issue 62, p1-7, 7p |
| Database: | Applied Science & Technology Source |
Be the first to leave a comment!