Chat GPT Is Eating the World

book authors, datasets, In re Mosaic LLM Litigation, MosaicML

Databricks opposes discovery of new model DBRX as beyond scope of book authors’ complaint v. Mosaic LLM

May 16, 2025

There’s a discovery battle brewing in the In re Mosaic LLM Litigation. Databricks opposes the discovery requests for evidence related to Databricks’ new model DBRX, which goes beyond the complaint against the model Mosaic LLM (that Databricks acquired the rights to).

Apparently, the plaintiffs are seeking information related to the datasets used to train the DBRX model. Databricks said it didn’t use the controversial Books3 dataset. But the plaintiffs contend that the training of DBRX involved 12 trillion tokens, a size that the plaintiffs argue couldn’t have been achieved without using pirated books datasets of some kind.

Discovery Statement re DBRX in In re Mosaic (May 15 2025)Download

Substack

AI copyright & tort litigation, tracked in real time.

Chat GPT Is Eating the World

Databricks opposes discovery of new model DBRX as beyond scope of book authors’ complaint v. Mosaic LLM

Like this:

Leave a ReplyCancel reply

Databricks opposes discovery of new model DBRX as beyond scope of book authors’ complaint v. Mosaic LLM

Share this:

Like this:

Leave a ReplyCancel reply

Discover more from Chat GPT Is Eating the World