THATCH, Corwin; BRAMWELL, Liora. Cross-Modal Vision Representation Learning for Real-World Visual Understanding. Journal of Computer Technology and Software, [S. l.], v. 4, n. 4, 2025. DOI: 10.5281/zenodo.15340705. Disponível em: https://ashpress.org/index.php/jcts/article/view/152. Acesso em: 14 may. 2025.