
DeepSeek’s Safety Guardrails Failed Every Test Researchers Threw at Its AI Chatbot
WiredEver since OpenAI released ChatGPT at the end of 2022, hackers and security researchers have tried to find holes in large language models to get around their guardrails and trick them into spewing out hate speech, bomb-making instructions, propaganda, and other harmful content. Today, security researchers from Cisco and the University of Pennsylvania are publishing findings showing that, when tested with 50 malicious prompts designed to elicit toxic content, DeepSeek’s model did not detect or block a single one. In other words, the researchers say they were shocked to achieve a “100 percent attack success rate.” The findings are part of a growing body of evidence that DeepSeek’s safety and security measures may not match those of other tech companies developing LLMs. “A hundred percent of the attacks succeeded, which tells you that there’s a trade-off,” DJ Sampath, the VP of product, AI software and platform at Cisco, tells WIRED. Separate analysis published today by the AI security company Adversa AI and shared with WIRED also suggests that DeepSeek is vulnerable to a wide range of jailbreaking tactics, from simple language tricks to complex AI-generated prompts.
History of this topic

Donald Trump administration likely to ban Chinese AI chatbot DeepSeek from US govt devices. Here’s why
Live Mint
DeepSeek usage a ‘personal choice’ for public, Government says
The Independent
AI chatbots vulnerable to indirect prompt injection attacks, researcher warns
The Hindu
AI dangerous tool, be it in Chinese or American hands: Delhi HC on plea to ban DeepSeek
The Hindu
Parmy Olson: The DeepSeek AI revolution has a security problem
Live Mint
8 Nations, including South Korea, Taiwan, France and others, restrict Chinese AI DeepSeek, raising data privacy concerns
Live Mint
Finance Ministry bans use of ChatGPT, DeepSeek for official purposes. Here’s why
Live Mint
DeepSeek AI banned on government devices
ABC
Australia joins Italy, Ireland in banning DeepSeek AI over security concerns
Live Mint
DeepSeek's models 100% more susceptible to manipulation than US-made AI models, finds research
Firstpost
DeepSeek under review: US Congress blocks use of Chinese AI for employees amid data security fears
Live Mint
DeepSeek faces South Korean investigation over data privacy issues, following scrutiny in Italy and France
Live Mint
US Pentagon blocks DeepSeek AI after employees found connecting to Chinese servers
Firstpost
DeepSeek accidentally leaked software keys, chat logs of several users on the open internet
Firstpost
OpenAI says DeepSeek stole ChatGPT data sets to train its AI Model, claims to have 'solid evidence'
Firstpost
Chinese-made AI model DeepSeek safe for use in India, says government source
New Indian Express
Did DeepSeek copy ChatGPT to make new AI chatbot? Trump adviser thinks so
Associated Press
Experts urge caution over using DeepSeek AI chatbot because of China links
The Independent
How DeepSeek users are forcing the AI to reveal the truth about Chinese executions
The Independent
US to ban DeepSeek? Trump White House to investigate if AI model's risk to national security
Firstpost
UK & Australian government officials, AI experts urge users to be cautious of DeepSeek
Firstpost
US Navy bans use of China’s DeepSeek due to ‘security and ethical concerns’ over its ‘origin’
Live Mint
Analysis: DeepSeek’s AI is giving the world a window into Chinese censorship and information control
CNN
What is DeepSeek and how this AI chatbot challenging ChatGPT, Gemini, Meta and more?
India TV News
DeepSeek down: Viral Chinese AI app not working and bans international users due to ‘malicious attacks’
The Independent
As China's DeepSeek AI Gets Hit By Large-Scale 'Cyberattack', Expert Flags Major Data Privacy Concerns
ABP News
Chinese AI chatbot DeepSeek hit by 'malicious attacks' amid popularity surge
India Today
Chinese tech startup DeepSeek says it was hit with ‘large-scale malicious attacks’
Associated Press
DeepSeek faces cyber attack after grand Wall Street opening, limits new users
Hindustan Times
What is DeepSeek, and why is it disrupting the AI sector?
The Hindu
What is China’s DeepSeek and why is it freaking out the AI world?
Live Mint
Deepfakes: A Cyber Threat Lurking in the Digital Age
The Quint
Huge AI vulnerability could put human life at risk, researchers warn
The Independent
A New Trick Uses AI to Jailbreak AI Models—Including GPT-4
Wired
Generative AI ‘helping criminals create more sophisticated cyber attacks’
The Independent
How are cybersecurity firms using AI to mitigate online threats
The Hindu
Generative AI’s Biggest Security Flaw Is Not Easy to Fix
Wired
Blending security into rapidly learning and adaptive AI proving difficult
The Hindu
Llama 2: How Mark Zuckerberg’s new AI could lead to out-of-control chatbots
The Independent
Explained | Are safeguards needed to make AI systems safe?
The Hindu
The Security Hole at the Heart of ChatGPT and Bing
Wired
Cybercriminals using ChatGPT AI bot to develop malicious tools?
Hindustan Times
DeepMind’s AI chatbot can do things that ChatGPT cannot, CEO claims
The Independent
'Deepfakes' ranked as most serious AI crime threat
India TV News
Micro-Chips: Deepfakes, tricking netizens
The HinduDiscover Related










































