In what is due to be a definitive case for the future of generative AI, the New York Times is suing OpenAI for training ChatGPT using its articles without permission. The defendant alleges that the media outlet tricked its AI model specifically to produce answers verbatim.
The landscape of generative AI may not look so lawless in 2024, if the New York Times can win its landmark case against OpenAIโs parent company, Microsoft. Big if.
In what is due to be a pivotal juncture for generative AI platforms and their innate processes, the media outlet is suing ChatGPTโs creator for training its language models using NYT content without permission.
While the very nature of a deep learning model is to compartmentalise as much data as possible to generate valuable responses, the NYT alleges that ChatGPT has recited its content verbatim on several occasions.
A spokesperson said this โundermines and damagesโ the companyโs reputation while simultaneously depriving it of โsubscription, licensing, advertising, and affiliate revenue.โ The Times updated its terms of service in August 2023 to prohibit the scraping of its articles and images for AI training.
In laymenโs terms, the NYT now views ChatGPT as direct competition in the news business and isnโt keen on sharing its intellectual property without compensation.
In a juicy turn of events, however, OpenAI has stated a belief that employees at the NYT deliberately tricked the generative AI tool into replicating excerpts from its articles. Dismissing the case as being โwithout merit,โ OpenAI still hopes to partner with the media outlet โ as it has with The Associated Press, among others.
Of the apparent examples of plagiarism, which the public are obviously not privy to, OpenAI claims that the NYT either explicitly instructed the model to regurgitate or cherry-picked examples from many attempts.
The selected quotes โappear to be from year-old articles that have proliferated on multiple third-party websites,โ a company spokesperson said. OpenAI previously axed a ChatGPT feature called Browse upon discovering it unintentionally reproduced content, but seniors refute allegations that its generative AI has the same problem now.