Wang, F., Ding, L., Rao, J., Liu, Y., Shen, L., & Ding, C. (2024). Can Linguistic Knowledge Improve Multimodal Alignment in Vision-Language Pretraining? ACM Transactions on Multimedia Computing, Communications & Applications, 20(12), 1. https://doi.org/10.1145/3690640
Chicago Style (17th ed.) CitationWang, Fei, Liang Ding, Jun Rao, Ye Liu, Li Shen, and Changxing Ding. "Can Linguistic Knowledge Improve Multimodal Alignment in Vision-Language Pretraining?" ACM Transactions on Multimedia Computing, Communications & Applications 20, no. 12 (2024): 1. https://doi.org/10.1145/3690640.
MLA (9th ed.) CitationWang, Fei, et al. "Can Linguistic Knowledge Improve Multimodal Alignment in Vision-Language Pretraining?" ACM Transactions on Multimedia Computing, Communications & Applications, vol. 20, no. 12, 2024, p. 1, https://doi.org/10.1145/3690640.