Menu

Blog

Feb 3, 2021

Experimental AI framework Vx2Text generates video captions using inferences from audio and text

Posted by in category: robotics/AI

Researchers at Facebook have developed a framework, Vx2Text, that can generate captions by inferring from videos, audio, and text.

Comments are closed.