ai safety Archives - GenixPlay Studios

California lawmaker behind SB 1047 reignites push for mandated AI safety reports

California State Senator Scott Wiener on Wednesday introduced new amendments to his latest bill, SB 53, that would require the world’s largest AI companies to publish safety and security protocols and issue reports when safety incidents occur. If signed into law, California would be the first state to impose meaningful transparency requirements onto leading AI […]

California lawmaker behind SB 1047 reignites push for mandated AI safety reports Read More »

Anthropic says most AI models, not just Claude, will resort to blackmail

AI / Sasandara Dilmina

Several weeks after Anthropic released research claiming that its Claude Opus 4 AI model resorted to blackmailing engineers who tried to turn the model off in controlled test scenarios, the company is out with new research suggesting the problem is more widespread among leading AI models. On Friday, Anthropic published new safety research testing 16

Anthropic says most AI models, not just Claude, will resort to blackmail Read More »

New York passes a bill to prevent AI-fueled disasters

AI / Sasandara Dilmina

New York state lawmakers passed a bill on Thursday that aims to prevent frontier AI models from OpenAI, Google, and Anthropic from contributing to disaster scenarios, including the death or injury of more than 100 people, or more than $1 billion in damages. The passage of the RAISE Act represents a win for the AI

New York passes a bill to prevent AI-fueled disasters Read More »

ChatGPT will avoid being shut down in some life-threatening scenarios, former OpenAI researcher claims

AI / Sasandara Dilmina

Former OpenAI research leader Steven Adler published a new independent study on Wednesday claiming that, in certain scenarios, his former employer’s AI models will go to great lengths to try to avoid being shut down. In a blog post, Adler describes a series of experiments he ran on OpenAI’s latest GPT-4o model, the default model

ChatGPT will avoid being shut down in some life-threatening scenarios, former OpenAI researcher claims Read More »

Yoshua Bengio launches LawZero, a nonprofit AI safety lab

AI / Sasandara Dilmina

Turing Award winner Yoshua Bengio is launching a nonprofit AI safety lab called LawZero to build safer AI systems, he told the Financial Times on Monday. LawZero raised $30 million in philanthropic contributions from Skype founding engineer Jaan Tallinn, former Google chief Eric Schmidt, Open Philanthropy, and the Future of Life Institute, among others. The

Yoshua Bengio launches LawZero, a nonprofit AI safety lab Read More »

xAI’s promised safety report is MIA

Security / Sasandara Dilmina

Elon Musk’s AI company, xAI, has missed a self-imposed deadline to publish a finalized AI safety framework, as noted by watchdog group The Midas Project. xAI isn’t exactly known for its strong commitments to AI safety as it’s commonly understood. A recent report found that the company’s AI chatbot, Grok, would undress photos of women when

xAI’s promised safety report is MIA Read More »

Anthropic CEO wants to open the black box of AI models by 2027

AI / Sasandara Dilmina

Anthropic CEO Dario Amodei published an essay Thursday highlighting how little researchers understand about the inner workings of the world’s leading AI models. To address that, Amodei set an ambitious goal for Anthropic to reliably detect most AI model problems by 2027. Amodei acknowledges the challenge ahead. In “The Urgency of Interpretability,” the CEO says Anthropic has

Anthropic CEO wants to open the black box of AI models by 2027 Read More »

OpenAI’s latest AI models have a new safeguard to prevent biorisks

AI / Sasandara Dilmina

OpenAI says that it deployed a new system to monitor its latest AI reasoning models, o3 and o4-mini, for prompts related to biological and chemical threats. The system aims to prevent the models from offering advice that could instruct someone on carrying out potentially harmful attacks, according to OpenAI’s safety report. O3 and o4-mini represent

OpenAI’s latest AI models have a new safeguard to prevent biorisks Read More »

OpenAI ships GPT-4.1 without a safety report

AI / Sasandara Dilmina

On Monday, OpenAI launched a new family of AI models, GPT-4.1, which the company said outperformed some of its existing models on certain tests, particularly benchmarks for programming. However, GPT-4.1 didn’t ship with the safety report that typically accompanies OpenAI’s model releases, known as a model or system card. As of Tuesday morning, OpenAI had

OpenAI ships GPT-4.1 without a safety report Read More »

Google is shipping Gemini models faster than its AI safety reports

AI / Sasandara Dilmina

More than two years after Google was caught flat-footed by the release of OpenAI’s ChatGPT, the company has dramatically picked up the pace. In late March, Google launched an AI reasoning model, Gemini 2.5 Pro, that leads the industry on several benchmarks measuring coding and math capabilities. That launch came just three months after the

Google is shipping Gemini models faster than its AI safety reports Read More »