OpenAI used over 1 million hours of YouTube data to train GPT-4 AI: Report
Hindustan TimesOpenAI used over a million hours of YouTube videos to train its large language model GPT-4, a report revealed as major tech companies are attempting to acquire more and more data to train their artificial intelligence models. As per this process, over one million hours of video content was transcribed which raised concerns about compliance with YouTube's policies as Google owned YouTube restricts use of its videos for independent applications. As per this process, over one million hours of video content was transcribed which raised concerns about compliance with YouTube's policies This comes days after YouTube CEO Neal Mohan was asked if OpenAI's Sora video generator uses data from YouTube in an interview with the Wall Stree Journal. He said that he was not aware if OpenAI used any YouTube data to train it new video tool but claimed that it would be a problem if OpenAI used YouTube videos.