Toggle light / dark theme

Paper page — Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models

Posted in futurism

Join the discussion on this paper page.