Apr 5, 2024
Paper page — CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching
Posted by Cecile G. Tamura in category: robotics/AI
LVLM-Intrepret.
An interpretability tool for large vision-language models.
In the rapidly evolving landscape of artificial intelligence, multi-modal large language models are emerging as a significant area of interest.