Over 100k YouTube videos have been scraped to train AI
What do MrBeast, John Oliver and the Wall Street Journal have in common? The transcripts of their YouTube videos have been scraped to train the AI used by companies like Anthropic, Nvidia, Apple and Salesforce. An investigation from Wired and Proof News found that this dataset, which is called YouTube Subtitles, contains transcripts from over […]
Over 100k YouTube videos have been scraped to train AI Read More »