New York Times AI copyright case shows pardon is not better than permission

The New York Times (NYT) lawsuit against OpenAI and Microsoft breaks new ground in the ongoing legal challenges posed by the use of copyrighted data to “train” or improve generative AI. Ta.

A number of lawsuits have already been filed against AI companies, including one brought by Getty Images against StabilityAI, the developer of the online text-to-image generator Stable Diffusion. Authors George R.R. Martin and John Grisham are also suing ChatGPT's owner, OpenAI, over alleged copyright infringement. But the NYT case is far from “more of the same” as it throws some interesting new arguments into the mix.

This legal action focuses on new issues regarding the value of training data and reputational damage. This is a powerful combination of trademark and copyright and may test the normally relied upon fair use defense.

No doubt it will be watched closely by media organizations looking to challenge the usual “ask for forgiveness, not permission” approach to training data. Training data is used to improve the performance of AI systems and typically consists of real-world information, often obtained from the internet.

The case also offers novel arguments not presented in other similar cases related to what is called “hallucination,” in which AI systems generate false or misleading information while presenting it as fact. It has been raised. In fact, this argument may be one of the most powerful in this case.

The NYT case in particular raises three interesting takes on the usual approach. First, NYT content's reputation for reliable news and information makes it valuable and desirable as training data for use in AI.

Second, paywalls make it commercially damaging to reproduce articles upon request. Third, ChatGPT's “illusion” is, in effect, causing reputational damage to the New York Times through false attribution.

This is not just a generative AI copyright dispute. The first argument presented by the NYT claims that ChatGPT's training phase infringes copyright because the training data used by OpenAI is protected by copyright. We've seen this type of argument played out before in other conflicts.

Fair use?

The challenge to this type of attack is the fair use shield. In the United States, fair use is a legal principle that allows the use of copyrighted material under certain circumstances, such as news reporting, academic research, and commentary.

While OpenAI's response has been very cautious so far, a key tenet of the company's statement is that its use of online data does fall under the doctrine of “fair use.”

Anticipating some of the difficulties that such a fair use defense could potentially cause, the NYT took a slightly different angle. In particular, it aims to distinguish its data from standard data. The NYT intends to capitalize on the accuracy, credibility, and prestige of its reporting. We claim that this creates a particularly desirable dataset.

Sam Altman of OpenAI: The company cited the fair use defense in its response to the lawsuit. image:
Jamesonwu1972 / Shutterstock

As a reputable and trusted source, the article claims that it has additional weight and credibility in training the generative AI and is part of a data subset that is given additional weight in its training.

ChatGPT claims that by duplicating large portions of articles in response to prompts, it can deny the paywalled NYT, its visitors, and the revenue it would otherwise receive. The introduction of some aspects of commercial competition and commercial advantage appears to be intended to avoid the normal fair use defense common to these claims.

It would be interesting to see if special weighted assertions in the training data have an impact. If this were to happen, it would set the path for other media organizations to challenge the use of their reporting in training data without their permission.

The final element of the NYT's argument offers a new angle on the issue. This suggests that his NYT brand is being harmed through the material ChatGPT generates. Although presented almost as an afterthought in the complaint, it may be the allegation that poses the most challenge to Open AI.

This is a discussion about AI's “hallucinations”. The NYT claims that ChatGPT makes matters worse because it presents information as if it were from the NYT.

The paper also suggests that consumers may act on the summary ChatGPT provides, believing that the information is from the NYT and can be trusted. The reputational damage is caused by the newspaper's inability to control what his ChatGPT creates.

This is an interesting challenge. “Illusions” are recognized as a problem with AI-generated responses, and the NYT argues that redressing the reputational damage may not be easy.

NYT's claims launch a series of novel attacks that shift focus from copyright to how copyrighted data is presented to users by ChatGPT and the value of that data to newspapers. It is something to do. This is much harder for OpenAI to defend against.

This case will be closely watched by other media publishers, especially those behind paywalls, especially in terms of how it interacts with standard fair use defenses.

Once NYT datasets are recognized as having the “enhanced value” they claim to have, there is a path to monetizing them for training AI, rather than the currently prevalent “forgive, not permit” approach. It may be opened.

Peter Vaughan is a Senior Lecturer at Nottingham Law School, Nottingham Trent University.

This article is republished from The Conversation under a Creative Commons license. Read the original article.

Source link

Sports

卡斯特·塞門亞、國際田徑總會事件、奧運、田徑新聞

February 22, 2019

南非體育部長週四指責國際田徑總會在訪問瑞士期間試圖侵犯女性身體，以支持卡斯特·塞門亞反對限製女運動員睪固酮水平的提議。兩屆奧運 800 公尺比賽冠軍塞門亞已向洛桑體育仲裁法院提出上訴，質疑國際田徑總會 (IAAF) 提出的規則。南非體育部長托科齊萊·扎薩表示，在為期一周的聽證會即將結束之際，他訪問這個阿爾卑斯山國家的目的是向他的同胞塞門亞提供支持。南非體育部長托科齊萊·扎薩召開記者會支持卡斯特·塞門亞。資料來源：法新社但她也加倍批評國際田徑總會，認為擬議的規則對於為其他女運動員創造「公平的競爭環境」是必要的。「我們談論的是對女性身體的侵犯。女性必須根據她們在[他人]眼中的形象來解釋自己，」扎薩說。「這不僅僅是關於南非……（或）關於婦女參與體育運動，」她補充道，並強調聽證會期間討論的是基本人權問題。這項有爭議的措施意味著患有所謂「雄激素過多症」或「性發育差異」（DSD）的運動員如果希望繼續以女性身份參加比賽，將接受治療以將睪固酮水平降低到規定水平以下。做某事。沙塔女士表示，南非總統拉馬福薩要求她向塞門亞先生轉達一則訊息。卡斯特·塞門亞。資料來源：美聯社總統說：「記住，你是了不起的。記住，你是一個象徵，總是提醒我們，沒有任何東西可以超越人類精神的持久力量。」扎薩引用的話說。 “我們希望確保你不會感到孤獨，”她補充道。部長講話時，塞門亞站在房間的後面，沒有向媒體發表演說。許多南非人透過媒體和網路支持塞門亞。政府上週發起了一場名為#NaturallySuperior 的運動，以爭取國際社會支持…

What's Hot

New York Times AI copyright case shows pardon is not better than permission

Fair use?

Related Posts

Leave A Reply Cancel Reply

Subscribe to Updates