April 2025

Meta News

Meta unleashes Llama API running 18x faster than OpenAI: Cerebras partnership delivers 2,600 tokens per second

Meta has unveiled its Llama API, boasting a remarkable performance boost—running 18 times faster than OpenAI. In collaboration with Cerebras, it now processes an impressive 2,600 tokens per second, setting a new benchmark in AI capabilities.