DeepSeek’s Safety Guardrails Failed Every Test Researchers Threw at Its AI Chatbot
1 month, 4 weeks ago

DeepSeek’s Safety Guardrails Failed Every Test Researchers Threw at Its AI Chatbot

Wired  

Ever since OpenAI released ChatGPT at the end of 2022, hackers and security researchers have tried to find holes in large language models to get around their guardrails and trick them into spewing out hate speech, bomb-making instructions, propaganda, and other harmful content. Today, security researchers from Cisco and the University of Pennsylvania are publishing findings showing that, when tested with 50 malicious prompts designed to elicit toxic content, DeepSeek’s model did not detect or block a single one. In other words, the researchers say they were shocked to achieve a “100 percent attack success rate.” The findings are part of a growing body of evidence that DeepSeek’s safety and security measures may not match those of other tech companies developing LLMs. “A hundred percent of the attacks succeeded, which tells you that there’s a trade-off,” DJ Sampath, the VP of product, AI software and platform at Cisco, tells WIRED. Separate analysis published today by the AI security company Adversa AI and shared with WIRED also suggests that DeepSeek is vulnerable to a wide range of jailbreaking tactics, from simple language tricks to complex AI-generated prompts.

History of this topic

Donald Trump administration likely to ban Chinese AI chatbot DeepSeek from US govt devices. Here’s why
3 weeks, 2 days ago
DeepSeek usage a ‘personal choice’ for public, Government says
1 month, 1 week ago
AI chatbots vulnerable to indirect prompt injection attacks, researcher warns
1 month, 2 weeks ago
AI dangerous tool, be it in Chinese or American hands: Delhi HC on plea to ban DeepSeek
1 month, 2 weeks ago
Parmy Olson: The DeepSeek AI revolution has a security problem
1 month, 3 weeks ago
8 Nations, including South Korea, Taiwan, France and others, restrict Chinese AI DeepSeek, raising data privacy concerns
Top News
1 month, 3 weeks ago
Finance Ministry bans use of ChatGPT, DeepSeek for official purposes. Here’s why
1 month, 3 weeks ago
DeepSeek AI banned on government devices
Top News
1 month, 3 weeks ago
Australia joins Italy, Ireland in banning DeepSeek AI over security concerns
1 month, 3 weeks ago
DeepSeek's models 100% more susceptible to manipulation than US-made AI models, finds research
1 month, 4 weeks ago
DeepSeek under review: US Congress blocks use of Chinese AI for employees amid data security fears
1 month, 4 weeks ago
DeepSeek faces South Korean investigation over data privacy issues, following scrutiny in Italy and France
2 months ago
US Pentagon blocks DeepSeek AI after employees found connecting to Chinese servers
2 months ago
DeepSeek accidentally leaked software keys, chat logs of several users on the open internet
2 months ago
OpenAI says DeepSeek stole ChatGPT data sets to train its AI Model, claims to have 'solid evidence'
2 months ago
Chinese-made AI model DeepSeek safe for use in India, says government source
2 months ago
Did DeepSeek copy ChatGPT to make new AI chatbot? Trump adviser thinks so
2 months ago
Experts urge caution over using DeepSeek AI chatbot because of China links
2 months ago
How DeepSeek users are forcing the AI to reveal the truth about Chinese executions
2 months ago
US to ban DeepSeek? Trump White House to investigate if AI model's risk to national security
2 months ago
UK & Australian government officials, AI experts urge users to be cautious of DeepSeek
2 months ago
US Navy bans use of China’s DeepSeek due to ‘security and ethical concerns’ over its ‘origin’
2 months ago
Analysis: DeepSeek’s AI is giving the world a window into Chinese censorship and information control
2 months ago
What is DeepSeek and how this AI chatbot challenging ChatGPT, Gemini, Meta and more?
2 months ago
DeepSeek down: Viral Chinese AI app not working and bans international users due to ‘malicious attacks’
2 months ago
As China's DeepSeek AI Gets Hit By Large-Scale 'Cyberattack', Expert Flags Major Data Privacy Concerns
2 months ago
Chinese AI chatbot DeepSeek hit by 'malicious attacks' amid popularity surge
2 months ago
Chinese tech startup DeepSeek says it was hit with ‘large-scale malicious attacks’
2 months ago
DeepSeek faces cyber attack after grand Wall Street opening, limits new users
2 months ago
What is DeepSeek, and why is it disrupting the AI sector?
Top News
2 months ago
What is China’s DeepSeek and why is it freaking out the AI world?
2 months ago
Deepfakes: A Cyber Threat Lurking in the Digital Age
5 months ago
Huge AI vulnerability could put human life at risk, researchers warn
5 months, 1 week ago
A New Trick Uses AI to Jailbreak AI Models—Including GPT-4
1 year, 3 months ago
Generative AI ‘helping criminals create more sophisticated cyber attacks’
1 year, 4 months ago
How are cybersecurity firms using AI to mitigate online threats
1 year, 6 months ago
Generative AI’s Biggest Security Flaw Is Not Easy to Fix
1 year, 6 months ago
Blending security into rapidly learning and adaptive AI proving difficult
1 year, 7 months ago
Llama 2: How Mark Zuckerberg’s new AI could lead to out-of-control chatbots
1 year, 8 months ago
Explained | Are safeguards needed to make AI systems safe?
1 year, 9 months ago
The Security Hole at the Heart of ChatGPT and Bing
1 year, 10 months ago
Cybercriminals using ChatGPT AI bot to develop malicious tools?
2 years, 2 months ago
DeepMind’s AI chatbot can do things that ChatGPT cannot, CEO claims
2 years, 2 months ago
'Deepfakes' ranked as most serious AI crime threat
4 years, 7 months ago
Micro-Chips: Deepfakes, tricking netizens
6 years ago

Discover Related