OpenAI

OpenAI partner says it had relatively little time to test the company’s o3 AI model

An organization OpenAI frequently partners with to probe the capabilities of its AI models and evaluate them for safety, Metr, suggests that it wasn’t given much time to test one of the company’s highly capable new releases, o3. In a blog post published Wednesday, Metr writes that one red teaming benchmark of o3 was “conducted […]

OpenAI partner says it had relatively little time to test the company’s o3 AI model Read More »

OpenAI launches a pair of AI reasoning models, o3 and o4-mini

OpenAI announced on Thursday the launch of o3 and o4-mini, new AI reasoning models designed to pause and work through questions before responding. The company calls o3 its most advanced reasoning model ever, outperforming the company’s previous models on tests measuring math, coding, reasoning, science, and visual understanding capabilities. Meanwhile, o4-mini offers what OpenAI says

OpenAI launches a pair of AI reasoning models, o3 and o4-mini Read More »

OpenAI debuts Codex CLI, an open source coding tool for terminals

In a bid to inject AI into more of the programming process, OpenAI is launching Codex CLI, a coding “agent” designed to run locally from terminal software. Announced on Wednesday alongside OpenAI’s newest AI models, o3 and o4-mini, Codex CLI links OpenAI’s models with local code and computing tasks, OpenAI says. Via Codex CLI, OpenAI’s

OpenAI debuts Codex CLI, an open source coding tool for terminals Read More »

A dev built a test to see how AI chatbots respond to controversial topics

A pseudonymous developer has created what they’re calling a “free speech eval,” SpeechMap, for the AI models powering chatbots like OpenAI’s ChatGPT and X’s Grok. The goal is to compare how different models treat sensitive and controversial subjects, the developer told TechCrunch, including political criticism and questions about civil rights and protest. AI companies have

A dev built a test to see how AI chatbots respond to controversial topics Read More »

OpenAI may ‘adjust’ its safeguards if rivals release ‘high-risk’ AI

In an update to its Preparedness Framework, the internal framework OpenAI uses to decide whether AI models are safe and what safeguards, if any, are needed during development and release, OpenAI said that it may “adjust” its requirements if a rival AI lab releases a “high-risk” system without comparable safeguards. The change reflects the increasing

OpenAI may ‘adjust’ its safeguards if rivals release ‘high-risk’ AI Read More »

OpenAI hires team behind GV-backed AI eval platform Context.ai

Context.ai, a startup building evaluations and analytics for AI models, announced Tuesday that its co-founders will join OpenAI.  Context.ai plans to wind down its products following the acqui-hire, per a message on the company’s website. When reached for comment, OpenAI declined to reveal the terms of the deal. “Evals are a requirement to building high-performing

OpenAI hires team behind GV-backed AI eval platform Context.ai Read More »

OpenAI is reportedly developing its own X-like social media platform

OpenAI is building its own X-like social media network, according to a new report from The Verge. The project is still in the early stages, but there’s an internal prototype focused on ChatGPT’s image generation that contains a social feed. The report states that it’s unknown if OpenAI plans to launch the social network as a

OpenAI is reportedly developing its own X-like social media platform Read More »

OpenAI’s new GPT-4.1 AI models focus on coding

OpenAI on Monday launched a new family of models called GPT-4.1. Yes, “4.1” — as if the company’s nomenclature wasn’t confusing enough already. There’s GPT-4.1, GPT-4.1 mini, and GPT-4.1 nano, all of which OpenAI says “excel” at coding and instruction following. Available through OpenAI’s API but not ChatGPT, the multimodal models have a 1-million-token context

OpenAI’s new GPT-4.1 AI models focus on coding Read More »

OpenAI plans to phase out GPT-4.5, its largest-ever AI model, from its API

OpenAI said on Monday that it would soon wind down the availability of GPT-4.5, its largest-ever AI model, via its API. GPT-4.5 was released only in late February. Developers will have access to GPT-4.5 via OpenAI’s API until July 14, after which they’ll have to transition to another model in OpenAI’s catalog, the company says.

OpenAI plans to phase out GPT-4.5, its largest-ever AI model, from its API Read More »