Toggle light / dark theme

Mini-Gemini.

Mining the potential of multi-modality vision language models.

In this work, we introduce Mini-Gemini, a simple and effective framework enhancing multi-modality Vision Language Models (VLMs).


Join the discussion on this paper page.

Now, Huffman has defended his pay packet in a Q&A video on the platform.

“Look, I’m glad this question was asked because there’s been a lot of commentary on this topic,” he began, adding that his compensation—which is made up of salary and stock—is set by Reddit’s board depending on his performance.

“If the company does well, I will do well,” the CEO added. “If the company does not do well, I don’t either.”

Non-personalized content and ads are influenced by things like the content you’re currently viewing and your location (ad serving is based on general location). Personalized content and ads can also include things like video recommendations, a customized YouTube homepage, and tailored ads based on past activity, like the videos you watch and the things you search for on YouTube. We also use cookies and data to tailor the experience to be age-appropriate, if relevant.

Select “More options” to see additional information, including details about managing your privacy settings. You can also visit g.co/privacytools at any time.