Generative AI

Google Gemini: Everything you need to know about the generative AI models

Google’s trying to make waves with Gemini, its flagship suite of generative AI models, apps, and services. But what’s Gemini? How can you use it? And how does it stack up to other generative AI tools such as OpenAI’s ChatGPT, Meta’s Llama, and Microsoft’s Copilot? To make it easier to keep up with the latest Gemini

Google Gemini: Everything you need to know about the generative AI models Read More »

ChatGPT now understands real-time video, seven months after OpenAI first demoed it

OpenAI has finally released the real-time video capabilities for ChatGPT that it demoed nearly seven months ago. On Thursday during a livestream, the company said that Advanced Voice Mode, its human-like conversational feature for ChatGPT, is getting vision. Using the ChatGPT app, users subscribed to ChatGPT Plus, Team, and Pro can point their phones at

ChatGPT now understands real-time video, seven months after OpenAI first demoed it Read More »

Twelve Labs is building AI that can analyze and search through videos

AI models that understand videos as well as text can unlock powerful new applications. At least, that’s what Jae Lee, the co-founder of Twelve Labs, believes. Granted, Lee’s a little biased. Twelve Labs trains video-analyzing models for a range of use cases. But there may just be something to his assertion. Using Twelve Labs’ models,

Twelve Labs is building AI that can analyze and search through videos Read More »

Cartesia claims its AI is efficient enough to run pretty much anywhere

It’s becoming increasingly costly to develop and run AI. OpenAI’s AI operations costs could reach $7 billion this year, while Anthropic’s CEO recently suggested that models costing over $10 billion could arrive soon. So the hunt is on for ways to make AI cheaper. Some researchers are focusing on techniques to optimize existing model architectures — i.e.

Cartesia claims its AI is efficient enough to run pretty much anywhere Read More »

It sure looks like OpenAI trained Sora on game content — and legal experts say that could be a problem

OpenAI has never revealed exactly which data it used to train Sora, its video-generating AI. But from the looks of it, at least some of the data might’ve come from Twitch streams and walkthroughs of games. Sora launched on Monday, and I’ve been playing around with it for a bit (to the extent the capacity

It sure looks like OpenAI trained Sora on game content — and legal experts say that could be a problem Read More »

Google’s AI Overviews will soon be able to answer math and coding questions

AI Overviews, the AI-generated summaries Google supplies for certain Google Search queries, will soon be able to handle “more complex topics” and “multimodal” and “multi-step” searches, the company says, including advanced math questions and coding problems. The expanded capabilities are driven by the newly launched Gemini 2.0 model, which Google says should also deliver improvements

Google’s AI Overviews will soon be able to answer math and coding questions Read More »

Gemini 2.0, Google’s newest flagship AI, can generate text, images, and speech

Google’s next major AI model has arrived to combat a slew of new offerings from OpenAI. On Wednesday, Google announced Gemini 2.0 Flash, which the company says can natively generate images and audio in addition to text. 2.0 Flash can also use third-party apps and services, allowing it to tap into Google Search, execute code,

Gemini 2.0, Google’s newest flagship AI, can generate text, images, and speech Read More »