Dictionary publishers Encyclopedia Britannica and Merriam-Webster have filed a lawsuit against OpenAI, according to TechCrunch, alleging the AI company scraped nearly 100,000 articles and dictionary entries without permission to train ChatGPT. The Britannica Merriam-Webster OpenAI lawsuit was filed in Manhattan federal court on March 13, 2026, marking another major copyright battle in the rapidly evolving AI industry.

What Are the Publishers Alleging?

The complaint, as reported by Bloomberg Law, accuses OpenAI of "massive copyright infringement" in multiple ways. First, the AI company allegedly used Britannica's copyrighted articles to train its large language models. Second, ChatGPT allegedly generates outputs that contain verbatim or near-verbatim reproductions of the publishers' content. Third, OpenAI allegedly uses Britannica articles in its retrieval augmented generation (RAG) workflow, which allows the AI to pull directly from copyrighted material when answering user queries.

Britannica's legal team, as studies show, is arguing that OpenAI's practices undermine their investment in creating and maintaining fact-checked, human-written content. The publishers claim they have spent decades and significant resources producing reliable reference materials, only to have OpenAI essentially free-ride on that work to build a competing product. The lawsuit also alleges trademark infringement under the Lanham Act, because ChatGPT sometimes produces inaccurate information and presents it alongside Britannica and Merriam-Webster's famous brand names, misleading users into thinking the publishers endorsed or created the AI-generated content.

Why This Britannica Merriam-Webster OpenAI lawsuit Matters for the AI Industry

This case is part of a broader wave of legal challenges that major publishers and creators are bringing against AI companies. Authors, news outlets, and now reference publishers are all fighting back against what they see as unauthorized use of their intellectual property. As noted by Reuters, the outcomes of these cases will fundamentally shape how AI companies can access and use data for training their models.

The Britannica Merriam-Webster OpenAI lawsuit is particularly significant because reference materials like dictionaries and encyclopedias represent a unique category of high-value, fact-checked content. Unlike news articles or opinion pieces, reference works are meant to be authoritative and unbiased. If AI companies can freely train on this type of material, critics argue it could eventually allow AI systems to replace the very sources people rely on for accurate information, all without compensating the humans who created that knowledge.

OpenAI has not publicly responded to the specific allegations, though the company has previously argued in similar cases that AI training constitutes fair use under copyright law. The AI industry has broadly contended that training large language models on publicly available web content is analogous to how humans learn from reading books and articles, and that the resulting AI systems create new expression rather than copying the underlying works.

Britannica's History of Protecting Its Content

This is not Britannica's first time going to battle over AI. The company filed a nearly identical lawsuit against Perplexity AI in September 2025, which remains ongoing. That case alleged the AI search engine scraped Britannica's content to build its responses, bypassing robots.txt protections and presenting verbatim reproductions as AI-generated summaries. The similarity in the two lawsuits suggests Britannica is taking an aggressive stance against what it views as systematic theft of its intellectual property.

For Gen Z users who rely on tools like ChatGPT for quick answers and research help, this lawsuit could have real implications. If the publishers win, AI companies might need to license reference content or fundamentally change how their products work. On the other hand, if OpenAI prevails, it could set a precedent allowing AI firms to train on virtually any publicly available content without permission or payment.

The Britannica Merriam-Webster OpenAI lawsuit is being watched closely by legal experts, technology companies, and media organizations alike. It represents one of the highest-profile challenges yet to the AI industry's data practices and could ultimately determine whether the information economy built over centuries of human knowledge creation gets to share in the wealth generated by AI systems trained on that knowledge.

As this case unfolds in federal court, it will test the boundaries of copyright law in the digital age and establish important precedents for how AI companies interact with the publishers, creators, and platforms whose content helps power their products. Whatever the outcome, the Britannica Merriam-Webster OpenAI lawsuit is shaping up to be a landmark case that could reshape the AI landscape for years to come.