1 year, 7 months ago

OpenAI defends alleged use of novels in training data sets for “innovation”

OpenAI has defended the use of copyrighted materials such as novels in data sets for training large language models, claiming that fair use protects such innovation. In a court filing dated August 28, OpenAI responded to the suit filed by authors Paul Tremblay and Mona Awad, who claimed that the AI startup used their copyrighted work to train ChatGPT. Authors have also claimed that OpenAI, Google, and Meta scraped copyrighted works available for free on book piracy websites. OpenAI said in its filing that many courts in the past had applied the fair use doctrine to acknowledge that it was permissible for “innovators” to use copyrighted work in “transformative ways.”

The Hindu

Discover Related