What is data poisoning in the context of AI models?

Data poisoning refers to the manipulation of AI language models by introducing misleading or contaminated texts into their training datasets, which can lead to the models producing incorrect or ambiguous outputs.

Why is the data poisoning threat significant for AI models like ChatGPT?

The threat is significant because most AI models are trained on public internet data, meaning that any forged content can negatively influence their behavior, potentially undermining their reliability for sensitive tasks such as medical, legal, and security applications.

Who conducted the research on data poisoning and AI models?

The research was conducted by teams from the UK Centre for AI, the Alan Turing Institute, and Entropic.

What are the recommended measures to combat data poisoning?

Researchers recommend strengthening filtering and validation mechanisms for data sources, developing tools to detect contaminated content, and imposing strong transparency standards in AI model update processes.

What could happen if no action is taken against data poisoning?

If no effective action is taken, it may limit the safe reliance on AI in critical areas, increasing the risk of misinformation and unreliable outputs.

Researchers Warn: 'Poisoning' the Internet Threatens Behavior of Models Like ChatGPT

Q: How many contaminated documents are needed to affect AI model outputs?

Research indicated that introducing about 250 contaminated documents is sufficient to negatively impact the outputs of models like ChatGPT and Gemini.

Researchers have warned that language AI models, such as ChatGPT and Gemini, can be manipulated by introducing misleading texts on the internet — known as 'data poisoning' — leading to the production of incorrect or ambiguous content.

Summary of Findings

Teams from the UK Centre for AI, the Alan Turing Institute, and Entropic conducted a training experiment that showed that introducing about 250 contaminated documents is sufficient to negatively impact the outputs of the models. After that, the models produced vague and unreliable texts, demonstrating the ease with which malicious actors can influence the behavior of systems.

How is the attack carried out?

The attack relies on spreading fake or contaminated articles and posts in public places on the internet (personal websites, blogs, Wikipedia, etc.), making this material part of the dataset that is later used to train or update the models. According to the researchers, creating about 250 contaminated articles may be enough to change the model's behavior.

Why is this dangerous?

Most models are trained on public data from the internet, so any forged content becomes a potential source for learning.

Data poisoning undermines reliance on AI for sensitive tasks (medical, legal, security).

The attack is relatively easy to execute and its risks are widespread because victims may not quickly detect the manipulation.

Recommendations from Researchers and Expected Impacts

Researchers call for:

Strengthening filtering and validation mechanisms for data sources before using them in training.

Developing tools to detect contaminated content and mechanisms to trace the source of data.

Imposing strong transparency standards in AI model update processes.

Researchers point out that failing to take effective action may limit the safe reliance on AI in vital areas.

Researchers Warn: 'Poisoning' the Internet Threatens Behavior of Models Like ChatGPT

Share News

Tags

Latest News

New Recognition of the State of Palestine .. Belgium Joins Britain, Canada, and Australia

Texas Lottery Results September 3, 2025: Powerball Numbers with a $1.4 Billion Jackpot and Complete Pick 3 Results

Closure of the Streameast Platform: The Fall of the World's Largest Illegal Sports Streaming Site

Trump Extends State of Emergency Related to Syria for Another Year

The Egyptian Interior Ministry Denies Claims of 'Suicide Messages' Inside Prisons and Confirms: No Strikes or Suicide Attempts

Related News

International Oversight on Content .. and Countries Taking Action to Protect Minors

How do chatbots affect the human mind?

Researchers Warn: 'Poisoning' the Internet Threatens Behavior of Models Like ChatGPT

Authors File Lawsuit Against "Apple" for Using Pirated Books to Develop Its Artificial Intelligence