GiT Towards Generalist Vision Transformer through Universal Language Interface.
Towards Generalist Vision Transformer through Universal Language Interface.
This paper proposes a simple, yet effective framework, called GiT, simultaneously applicable for various vision tasks only with a vanilla ViT.
Join the discussion on this paper page.
Comments are closed.