MMoT: Mixture-of-Modality-Tokens Transformer for Composed Multimodal Conditional Image Synthesis.

Saved in:
Bibliographic Details
Title: MMoT: Mixture-of-Modality-Tokens Transformer for Composed Multimodal Conditional Image Synthesis.
Authors: Zheng, Jianbin1, Liu, Daqing2, Wang, Chaoyue2, chaoyue.wang@outlook.com, Hu, Minghui3, Yang, Zuopeng4, Ding, Changxing1, chxding@scut.edu.cn, Tao, Dacheng2
Source: International Journal of Computer Vision; Sep2024, Vol. 132 Issue 9, p3537-3565, 29p
Database: Applied Science & Technology Source
Full text is not displayed to guests.
Be the first to leave a comment!
You must be logged in first