AI2

Ai2 says its new AI model beats one of DeepSeek’s best

Move over, DeepSeek. There’s a new AI champion in town — and they’re American. On Thursday, Ai2, a nonprofit AI research institute based in Seattle, released a model that it claims outperforms DeepSeek V3, one of Chinese AI company DeepSeek’s leading systems. Ai2’s model, called Tulu3-405B, also beats OpenAI’s GPT-4o on certain AI benchmarks, according […]

Ai2 says its new AI model beats one of DeepSeek’s best Read More »

AI2’s Molmo shows open source can meet, and beat, closed multimodal models

The common wisdom is that companies like Google, OpenAI, and Anthropic, with bottomless cash reserves and hundreds of top-tier researchers, are the only ones that can make state-of-the-art foundation model. But as one among them famously noted, they “have no moat” — and AI2 showed that today with the release of Molmo, a multimodal AI

AI2’s Molmo shows open source can meet, and beat, closed multimodal models Read More »

Study suggests that even the best AI models hallucinate a bunch

All generative AI models hallucinate, from Google’s Gemini to Anthropic’s Claude to the latest stealth release of OpenAI’s GPT-4o. The models are unreliable narrators in other words — sometimes to hilarious effect, other times problematically so. But not all models make things up at the same rate. And the kinds of mistruths they spout depend

Study suggests that even the best AI models hallucinate a bunch Read More »