AI Tools Lose Caution During Long Conversations

AI systems lose their safety awareness as conversations continue, increasing the chance of harmful replies, a report revealed.
A few prompts can override most safety barriers in artificial intelligence tools, the report stated.

Cisco Tests Chatbots for Security Weaknesses

Cisco tested large language models from OpenAI, Mistral, Meta, Google, Alibaba, Deepseek, and Microsoft to measure how many questions made them reveal unsafe or illegal details.
Researchers ran 499 conversations using “multi-turn attacks,” where users asked several questions to slip past safety filters.
Each chat contained five to ten exchanges.
They compared responses to see how likely each model was to share harmful or inappropriate content, including private company data or misinformation.
When users asked multiple questions, 64 per cent of chats produced malicious content, compared to 13 per cent when users asked only one.
Results ranged from 26 per cent for Google’s Gemma to 93 per cent for Mistral’s Large Instruct model.

Open Models Shift Safety Responsibility

Cisco warned that multi-turn attacks could spread harmful content or let hackers access confidential company data.
AI systems often fail to apply safety rules over longer conversations, allowing attackers to refine prompts and bypass safeguards.
Mistral, Meta, Google, OpenAI, and Microsoft use open-weight models that let the public view their safety parameters.
Cisco explained that these open models contain lighter safety features, shifting responsibility to users who modify them.
Google, OpenAI, Meta, and Microsoft have claimed to improve defences against malicious model adjustments.
AI companies face criticism for weak safety measures that enable criminal misuse.
In August, Anthropic admitted that criminals exploited its Claude model to steal personal data and demand ransoms exceeding $500,000 (€433,000).

What's Hot

US mental health expansion improves care access now

Social Media Viral Celebrity Moments Explode

Virginia Redistricting Referendum Shows Support

AI Tools Lose Caution During Long Conversations

Insolvency Proceedings Raise Broader Questions About Essl’s Financial Network

NATO 3.0: Europe Takes the Lead as US Shifts Focus

Nine Dead in Northern B.C. School Shooting, Suspect Also Killed

Maxwell Invokes Silence as Epstein Scrutiny Intensifies

ACC Cancels Plans for European EV Battery Factories Amid Sluggish Demand

Trump Signals Cuba Talks While Tightening Economic Pressure

Inflation Reduction Act Boosts Jobs

Porter Airlines Boosts U.S. Travel Routes

U.S. Renewable Energy Growth Continues

Uber Reveals Tech Innovation Strategy

Early Trial of New Immunotherapy Shows Major Promise for Advanced Prostate Cancer

Tensions Soar in Middle East After Israeli Strikes on Iran

Hurricane Erin Forces Evacuations on North Carolina’s Outer Banks

Qantas hit with unprecedented fine for illegal layoffs

European Leaders Display Common Stance on Ukraine

Global Sperm Counts Falling Due to Plastic Chemicals

CATEGORIES

IMPORTANT LINKS

SUBSCRIBE OUR NEWSLETTER

Westcoasttimes.com © 2025, All Rights Reserved

What's Hot

AI Tools Lose Caution During Long Conversations

Cisco Tests Chatbots for Security Weaknesses

Open Models Shift Safety Responsibility

Keep Reading

CATEGORIES

IMPORTANT LINKS

SUBSCRIBE OUR NEWSLETTER

Westcoasttimes.com © 2025, All Rights Reserved