UK kicks off review into training AI models on copyrighted content

UK kicks off review into training AI models on copyrighted content


On Dec. 9, OpenAI made its artificial intelligence video generation model Sora publicly available in the U.S. and other countries.

Cfoto | Future Publishing | Getty Images

The U.K. is drawing up measures to regulate the use of copyrighted content by tech companies to train their artificial intelligence models.

The British government on Tuesday kicked off a consultation which aims to increase clarity for both the creative industries and AI developers when it comes to both how intellectual property is obtained and then used by AI firms for training purposes.

Some artists and publishers are unhappy with the way their content is being scraped freely by companies like OpenAI and Google to train their large language models — AI models trained on huge quantities of data to generate humanlike responses.

Large language models are the foundational technology behind today’s generative AI systems, including the likes of OpenAI’s ChatGPT, Google’s Gemini and Anthropic’s Claude.

Last year, The New York Times brought a lawsuit against Microsoft and OpenAI accusing the companies of infringing its copyright and abusing intellectual property to train large language models.

In response, OpenAI disputed the NYT’s allegations, stating that the use of open web data for training AI models should be considered “fair use” and that it provides an “opt-out” for rights holders “because it’s the right thing to do.”

Separately, image distribution platform Getty Images sued another generative AI firm, Stability AI, in the U.K., accusing it of scraping millions of images from its websites without consent to train its Stable Diffusion AI model. Stability AI has disputed the suit, noting that the training and development of its model took place outside the U.K.

Proposals to be considered

First, the consultation will consider making an exception to copyright law for AI training when used in the context of commercial purposes but while still allowing rights holders to reserve their rights so they can control the use of their content.

Second, the consultation will put forward proposed measures to help creators license and be remunerated for the use of their content by AI model makers, as well as give AI developers clarity over what material can be used for training their models.

The government said more work needs to be done by both the creative industries and technology firms to ensure any standards and requirements for rights reservation and transparency are effective, accessible and widely adopted.

The government is also considering proposals that would require AI model makers to be more transparent about their model training datasets and how they’re obtained so that rights holders can understand when and how their content has been used to train AI.

That could prove controversial — technology firms aren’t especially forthcoming when it comes to the data that fuels their coveted algorithms or how they train them up, given the commercial sensitivities involved in revealing those secrets to potential competitors.

Previously, under former Prime Minister Rishi Sunak, the government attempted to agree a voluntary AI copyright code of practice.

AI copyright rules: U.K. versus U.S.

In a recent interview with CNBC, the boss of app development software firm Appian said he thinks the U.K. is well placed to be the “global leader on this issue.”

“The U.K. has put a stake in the ground declaring its prioritization of personal intellectual property rights,” Matt Calkins, Appian’s CEO, told CNBC. He cited 2018’s Data Protection Act as an example of how the U.K. is “closely associated with intellectual property rights.”

The U.K. is also not “subject to the same overwhelming lobbying blitz from domestic AI leaders that the U.S. is,” Calkins added — meaning it might not be as prone to bowing down to pressure from tech giants as politicians stateside.

“In the U.S., anybody who writes a law about AI is going to hear from Amazon, Oracle, Microsoft or Google before that bill even reaches the floor,” Calkins said.

“That’s a powerful force stopping anyone from writing sensible legislation or protecting the rights of individuals whose intellectual property is being taken wholesale by these major AI players.”

The issue of potential copyright infringement by AI firms is becoming more notable as tech firms are moving toward a more “multimodal” form of AI — that is, AI systems that can understand and generate content in the form of images and video as well as text.

Last week, OpenAI made its AI video generation model Sora publicly available in the U.S. and “most countries internationally.” The tool allows a user to type out a desired scene and produce a high-definition video clip.



Source

Meta, Google under attack as court cases bypass 30-year-old legal shield
Technology

Meta, Google under attack as court cases bypass 30-year-old legal shield

Meta Platforms CEO Mark Zuckerberg arrives outside court to take the stand at trial in a key test case accusing Meta and Google’s YouTube of harming kids’ mental health through addictive platforms, in Los Angeles, California, U.S., Feb. 18, 2026. Mike Blake | Reuters For the last three decades, internet giants have been able to […]

Read More
The Tech Download: Defense startups eye Iran war windfall as U.S. and Gulf states turn to tech
Technology

The Tech Download: Defense startups eye Iran war windfall as U.S. and Gulf states turn to tech

This report is from this week’s The Tech Download newsletter. Like what you see? You can subscribe here. Once considered a taboo sector to funnel money into by venture capitalists, defense tech has seen a remarkable shift over the past few years. It raised just $869 million globally in 2020, according to deal-counting platform Dealroom — […]

Read More
Chinese chip firms hit record high revenue driven by the AI boom and U.S. curbs
Technology

Chinese chip firms hit record high revenue driven by the AI boom and U.S. curbs

China is focusing on large language models in the artificial intelligence space. Blackdovfx | Istock | Getty Images Chinese semiconductor firms have reported record revenue last year driven by AI demand, a shortage of memory chips and U.S. export restrictions that have pushed Beijing to bolster its homegrown tech industry. Analysts and the companies themselves […]

Read More