Microsoft AI 'Mimics' Copyrighted Works, Authors Allege in Lawsuit | eWeek

Microsoft AI ‘Mimics’ Copyrighted Works, Authors Allege in Lawsuit

Microsoft office.

Image: trazika/pixabay

Written By
Megan Crouse
Megan Crouse
Jun 26, 2025
2 minute read
eWeek content and product recommendations are editorially independent. We may make money when you click on links to our partners. Learn More

A group of authors has sued Microsoft for allegedly training an artificial intelligence model using their copyrighted works without permission. The case, filed in New York on June 24, alleges the tech giant unfairly used their books to train Megatron-LM, a transformer-based algorithm that contributed to generative AI development. 

Meta Platforms, Anthropic, and OpenAI all face similar lawsuits as courts begin to set precedents on whether AI model training qualifies as transformative use. 

Authors seek damages, court order against infringement  

According to the authors — Kai Bird, Jia Tolentino, Daniel Okrent, and others — nearly 200,000 pirated books were fed into Megatron. As a result, the model could “generate a wide range of expression that mimics the syntax, voice, and themes of the copyrighted works on which it was trained,” the authors alleged in the suit first reported by Reuters

The plaintiffs seek up to $150,000 in statutory damages for each book and a court order to prevent Microsoft from engaging in similar conduct in the future. 

Generative AI companies argue that it is not possible to build large language models without vast datasets composed of text created by human authors. 

Individual court cases help build a new norm in the era of mass AI scraping 

Earlier this week, Anthropic was found to have scraped authors’ books in a manner covered under US fair use law, a California federal judge ruled. In that case, U.S. District Judge William Alsup ruled that using books for AI training is a transformative use and that the model could not be proven to have replicated exact copies of the books. Instead, the judge said, AI training should be compared to writers influencing each other rather than directly copying. 

Still, creatives have argued for years that generative AI displaces human labor while erasing the very people whose work trained the models. 

Even though AI training was found to fall under fair use in the Anthropic ruling, the case left open the possibility of compensation for the writers due to the manner in which Anthropic obtained the data from piracy websites. An upcoming trial will address that aspect. 

Similarly, a group of authors sued Meta in March over its use of LibGen, a well-known piracy website. This week, Meta had a courtroom victory in that case.

The New York Times remains locked in an ongoing court battle with OpenAI and its primary backer, Microsoft, over whether the AI companies used the newspaper’s content to train their AI models. OpenAI’s latest move was to criticize the Times for requesting that the company retain users’ ChatGPT conversations as potential evidence.

Megan Crouse

Megan Crouse has a decade of experience in business-to-business news and feature writing, including as first a writer and then the editor of Manufacturing.net. Her news and feature stories have appeared in Military & Aerospace Electronics, Fierce Wireless, TechRepublic, and eWeek. She copyedited cybersecurity news and features at Security Intelligence. She holds a degree in English Literature and minored in Creative Writing at Fairleigh Dickinson University.

eWeek Logo

eWeek has the latest technology news and analysis, buying guides, and product reviews for IT professionals and technology buyers. The site's focus is on innovative solutions and covering in-depth technical content. eWeek stays on the cutting edge of technology news and IT trends through interviews and expert analysis. Gain insight from top innovators and thought leaders in the fields of IT, business, enterprise software, startups, and more.

Property of TechnologyAdvice. © 2026 TechnologyAdvice. All Rights Reserved

Advertiser Disclosure: Some of the products that appear on this site are from companies from which TechnologyAdvice receives compensation. This compensation may impact how and where products appear on this site including, for example, the order in which they appear. TechnologyAdvice does not include all companies or all types of products available in the marketplace.