Menu

Blog

Jun 26, 2024

AI Startup Etched Unveils Transformer ASIC Claiming 20x Speed-up Over NVIDIA H100

Posted by in category: robotics/AI

A new startup emerged out of stealth mode today to power the next generation of generative AI. Etched is a company that makes an application-specific integrated circuit (ASIC) to process “Transformers.” The transformer is an architecture for designing deep learning models developed by Google and is now the powerhouse behind models like OpenAI’s GPT-4o in ChatGPT, Antrophic Claude, Google Gemini, and Meta’s Llama family. Etched wanted to create an ASIC for processing only the transformer models, making a chip called Sohu. The claim is Sohu outperforms NVIDIA’s latest and greatest by an entire order of magnitude. Where a server configuration with eight NVIDIA H100 GPU clusters pushes Llama-3 70B models at 25,000 tokens per second, and the latest eight B200 “Blackwell” GPU cluster pushes 43,000 tokens/s, the eight Sohu clusters manage to output 500,000 tokens per second.

Leave a reply