AI training

Micro1, a competitor to Scale AI, raises funds at $500M valuation

Micro1, a three-year-old startup that helps AI companies find and manage human contractors for data labeling and training, has raised a $35 million Series A funding round that values the company at $500 million. The round was led by O1 Advisors, a venture capital firm co-founded by Dick Costolo and Adam Bain, the former CEO […]

Micro1, a competitor to Scale AI, raises funds at $500M valuation Read More »

Meta is reportedly using actual tents to build data centers

Meta and Mark Zuckerberg are in a hurry to build their superintelligence tech. The company has been poaching AI researchers, while CEO Mark Zuckerberg announced on Monday that Meta is building a 5-gigawatt data center called Hyperion. The urgency is palpable. As SemiAnalysis reported last week and Business Insider noted, Meta is so eager to

Meta is reportedly using actual tents to build data centers Read More »

Exclusive: Google’s Gemini is forcing contractors to rate AI responses outside their expertise

Generative AI may look like magic, but behind the development of these systems are armies of employees at companies like Google, OpenAI, and others, known as “prompt engineers” and analysts, who rate the accuracy of chatbots’ outputs to improve their AI. But a new internal guideline passed down from Google to contractors working on Gemini,

Exclusive: Google’s Gemini is forcing contractors to rate AI responses outside their expertise Read More »

Harvard and Google to release 1 million public-domain books as AI training dataset

AI training data has a big price tag, one best-suited for deep-pocketed tech firms. This is why Harvard University plans to release a dataset that includes in the region of 1 million public-domain books, spanning genres, languages, and authors including Dickens, Dante, and Shakespeare, which are no longer copyright-protected due to their age. The new

Harvard and Google to release 1 million public-domain books as AI training dataset Read More »

Andreessen Horowitz helps founders meet compute needs with ‘Oxygen’ private GPU cluster

Andreessen Horowitz has a massive cluster of Nvidia H100 GPUs to help its portfolio of AI startups meet their compute needs, the venture capital firm confirmed for the first time on Wednesday. The program, called “Oxygen”, allows their portfolio companies to train or operate their AI models without negotiating market rates. A16Z’s Oxygen cluster gives

Andreessen Horowitz helps founders meet compute needs with ‘Oxygen’ private GPU cluster Read More »

Over 100k YouTube videos have been scraped to train AI

What do MrBeast, John Oliver and the Wall Street Journal have in common? The transcripts of their YouTube videos have been scraped to train the AI used by companies like Anthropic, Nvidia, Apple and Salesforce. An investigation from Wired and Proof News found that this dataset, which is called YouTube Subtitles, contains transcripts from over

Over 100k YouTube videos have been scraped to train AI Read More »