Can Linguistic Knowledge Improve Multimodal Alignment in Vision-Language Pretraining?
Saved in:
| Title: | Can Linguistic Knowledge Improve Multimodal Alignment in Vision-Language Pretraining? |
|---|---|
| Authors: | Wang, Fei1, ft_feiw@mail.scut.edu.cn, Ding, Liang2, liangding.liam@gmail.com, Rao, Jun3, rao7jun@gmail.com, Liu, Ye4, yliu03@scut.edu.cn, Shen, Li5, mathshenli@gmail.com, Ding, Changxing6, chxding@scut.edu.cn |
| Source: | ACM Transactions on Multimedia Computing, Communications & Applications; Dec2024, Vol. 20 Issue 12, p1-22, 22p |
| Database: | Applied Science & Technology Source |
| ISSN: | 15516857 |
|---|---|
| DOI: | 10.1145/3690640 |