Paper page — Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models Posted by Cecile G. Tamura in futurism Mar 12024 Join the discussion on this paper page. Read more | >