Reddit sues Perplexity for scraping of posts, expanding user data battle with AI industry

Reddit sues Perplexity for scraping of posts, expanding user data battle with AI industry


Thomas Fuller | Lightrocket | Getty Images

Social media giant Reddit has launched a lawsuit against artificial intelligence company Perplexity, alleging that it illegally scraped user posts to train its AI model, marking the latest data-rights clash between content owners and the AI industry. 

The complaint filed in New York federal court on Wednesday also named three defendants, which Reddit says helped Perplexity collect its data: Lithuanian data scraper Oxylabs, “former Russian botnet” AWMProxy, and Texas startup SerpApi.

Reddit alleged that the three smaller entities were able to extract its copyrighted content “by masking their identities, hiding their locations and disguising their web scrapers as regular people.”

Perplexity, which runs an AI-powered search engine, denied the allegations and accused Reddit of “extortion” and opposition to an open internet, while SerpApi told CNBC it “strongly disagrees” with Reddit’s claims and intends to defend itself in court. 

The case represents one of many filed by content owners accusing AI firms of using copyrighted material without permission to train their large language models. Reddit, in particular, has been on the front lines of that battle, having launched a similar ongoing lawsuit against AI startup Anthropic in June. CNBC was unable to reach Oxylabs and AWMProxy.

In a statement shared with CNBC, Ben Lee, Chief Legal Officer at Reddit, said that AI companies are” locked in an arms race for quality human content” and that pressure has fueled an “industrial-scale ‘data laundering’ economy.”

Scrapers bypass technological protections to steal data, then sell it to clients hungry for training material. Reddit is a prime target because it’s one of the largest and most dynamic collections of human conversation ever created.

Reddit — which hosts over 100,000 interest-based “subreddit” communities — said in its lawsuit that its user posts had become the most commonly cited source for AI-generated answers on Perplexity. 

It added that it sent Perplexity a cease-and-desist letter, after which it “increased the volume of citations to Reddit forty-fold.”

AI researchers have previously noted that Reddit’s large volume of moderated conversations can help make AI chatbots produce more natural-sounding responses.

In the age of artificial intelligence, Reddit has worked to leverage its massive data pool, permitting access to it only through AI-related licensing agreements. The social media company has signed such agreements with OpenAI and Alphabet‘s Google. 

In a response to the lawsuit, Perplexity, in a post on the Reddit platform, argued that it does not train AI models on content but merely summarizes and cites public Reddit discussions. Therefore, it said it is “impossible” to sign a license agreement.

“A year ago, after explaining this, Reddit insisted we pay anyway, despite lawfully accessing Reddit data. Bowing to strong arm tactics just isn’t how we do business,” the statement read, going on to describe the suit as a “show of force in Reddit’s training data negotiations with Google and OpenAI.” 

“Perplexity believes this is a sad example of what happens when public data becomes a big part of a public company’s business model,” Perplexity added, noting that data licensing has become an increasingly important source of revenue for Reddit. 

In February, Reddit’s COO Jen Wong told the trade publication Adweek that AI licensing deals with Google and OpenAI made up nearly 10% of Reddit’s revenue. 



Source

How to tackle private credit’s ‘cockroaches’ as contagion fears build
World

How to tackle private credit’s ‘cockroaches’ as contagion fears build

Private credit investors say active portfolio management, tighter lending standards and greater risk discipline are now paramount as the sector navigates rising default rates. After J.P. Morgan CEO Jamie Dimon warned last week of “cockroaches” lurking in private markets, fears of contagion and a potential repeat of the 2008 subprime lending crisis have driven central […]

Read More
China strikes conciliatory tone ahead of expected Trump-Xi meeting
World

China strikes conciliatory tone ahead of expected Trump-Xi meeting

China’s Minister of Commerce Wang Wentao spoke alongside other senior officials at a press conference on Friday, Oct. 24, 2025. Picture Alliance | Picture Alliance | Getty Images BEIJING — The U.S. and China can still find ways to work together, Chinese Commerce Minister Wang Wentao told reporters Friday, ahead of an expected meeting between […]

Read More
Japan inflation edges higher for first time since May, matching forecasts as ‘core-core’ gauge eases
World

Japan inflation edges higher for first time since May, matching forecasts as ‘core-core’ gauge eases

Government stockpiled rice, which was transported by bullet train, or the “shinkansen”, into the capital is handed over to those who pre-ordered bags, at Tokyo Station on June 10, 2025. Str | Afp | Getty Images Japan’s core inflation rate accelerated to 2.9% in September, the first increase since May and in line with expectations […]

Read More