Can Linguistic Knowledge Improve Multimodal Alignment in Vision-Language Pretraining?

Saved in:
Bibliographic Details
Title: Can Linguistic Knowledge Improve Multimodal Alignment in Vision-Language Pretraining?
Authors: Wang, Fei1, ft_feiw@mail.scut.edu.cn, Ding, Liang2, liangding.liam@gmail.com, Rao, Jun3, rao7jun@gmail.com, Liu, Ye4, yliu03@scut.edu.cn, Shen, Li5, mathshenli@gmail.com, Ding, Changxing6, chxding@scut.edu.cn
Source: ACM Transactions on Multimedia Computing, Communications & Applications; Dec2024, Vol. 20 Issue 12, p1-22, 22p
Database: Applied Science & Technology Source
Be the first to leave a comment!
You must be logged in first