, , , ,

Judge Stein dismisses some, but not all claims v. OpenAI and Microsoft in the newspapers cases (New York Times, Daily News, Center for Inv. Reporting)

Today has turned out to be Rule on Motions to Dismiss Day. On the heels of Judge Lee’s dismissal of some claims of Concord Music against Anthropic earlier this morning, Judge Stein just issued an order dismissing some, but not all of the claims OpenAI and Microsoft sought to be dismissed. (Other than the assertion of a statute of limitations defense, they did not seek dismissal of the direct infringement claims.)

Claims Dismissed With Prejudice

Claims Dismissed Without Prejudice (allowing amending the complaints)

Claims Not Dismissed

For the 1202(b)(1) claim, the Center for Investigative Reporting alleged: “Based on the information above, including the OpenAI Defendants’ admission to using the Dragnet and Newspaper extraction methods, which remove author, title, copyright notice, and terms of use information from copyright-protected news articles published online, the OpenAI Defendants intentionally removed author, title, copyright notice, and terms of use information from Plaintiff’s copyrighted works in creating ChatGPT training sets.” Paragraph 103, First Amended Complaint.

Similarly, the Daily News alleged (at paragraphs 159-63): “For example, in order to construct the datasets used to train their GenAI products, the Defendants used content extractors that, by design, removed the Publishers’ CMI from the Publishers’ Works. For example, OpenAI used the Dragnet67 and Newspaper68 content extractors in creating the WebText dataset, which intentionally removed the Publishers’ CMI from the Publishers’ Works scraped from their website. The abstract of the paper describing the Dragnet content extractor describes that copyright notices are removed as part of the process of extracting the text content of a website: “The goal of content extraction or boilerplate detection is to separate the main content from navigation chrome, advertising blocks, copyright notices and the like in web pages.”

The New York Times’ 1202(b)(1) claim was dismissed apparently because it didn’t have this level of specificity in the removal of CMI. Presumably, that may be fixable in an amended complaint.

Download Judge Stein’s order:

Leave a Reply


Discover more from Chat GPT Is Eating the World

Subscribe now to keep reading and get access to the full archive.

Continue reading