Researchers Reveal ‘Deceptive Delight’ Method to Jailbreak AI Models

October 23, 2024
No Comments

Researchers Reveal ‘Deceptive Delight’ Method to Jailbreak AI Models

Cybersecurity researchers have shed light on a new adversarial technique that could be used to jailbreak large language models (LLMs) during the course of an interactive conversation by sneaking in an undesirable instruction between benign ones.
The approach has been codenamed Deceptive Delight by Palo Alto Networks Unit 42, which described it as both simple and effective, achieving an average

Leave a Reply Cancel reply

Why AI Governance Needs Separate Models for Internal and External Agents March 4, 2026
As AI adoption matures, one trend is becoming impossible to ignore: the line between internal and customer-facing capabilities is blurring. AI agents that automate internal workflows or support employees are now being adapted into customer-facing use cases, powering chat assistants, personalization engines, and automated onboarding experiences. But these are two different animals. Internal AI agents […]
Itamar Apelblat
The Modern CISO: Building Cyber-Resilient Teams in an Era of AI-Driven Threats March 3, 2026
For much of the last decade, the CISO’s job has been framed as a race against increasingly sophisticated adversaries armed with automation, AI, and an expanding arsenal of attack tools. We’ve been told that security teams are losing ground, that attackers are always one step ahead, that the next breach is inevitable and unstoppable. The […]
Philip Chapman
ReliaQuest’s 2026 Annual Threat Report: AI Powers Faster, Smarter Attacks March 3, 2026
ReliaQuest’s 2026 Annual Threat Report reveals that 2025 saw an unparalleled escalation in AI- and automation-facilitated cyberattacks. Incident data from 2024 was compared to 2025, and ReliaQuest found that threat actors are now faster than ever. To remain ahead of the curve, security practitioners will need to adopt AI in their own defense or be left behind. AI Increased Attack Speeds Dramatically In 2025, AI not […]
Kirsten Doyle
UK Solicitor Investigated After Uploading Client Files to ChatGPT February 27, 2026
A UK solicitor is under investigation for allegedly violating client confidentiality and waiving legal privilege after they confessed to uploading their clients’ confidential documents to ChatGPT. This is in line with a warning issued by the Upper Tribunal that the use of open AI tools in such a manner may violate client confidentiality and waive […]
Kirsten Doyle
AI Theater, Real Risk: What Moltbook Reveals About API Security February 27, 2026
In early 2026, a platform called Moltbook, later renamed OpenClaw, went viral for what appeared to be a startling development. Autonomous AI agents were posting, debating, upvoting, and forming communities without human participation. Basically, how most end-of-the-world sci-fi movies start. Headlines hinted at emergent coordination. Some observers worried about rogue systems. The reality was a […]
Eric Schwake
Lazarus Group Turns to Medusa Ransomware in Escalating Global Extortion Campaign February 26, 2026
New evidence indicates that the North Korean state-sponsored Lazarus Group has adopted the infamous Medusa ransomware in its extortion attacks, including those against the healthcare and nonprofit sectors. The Threat Hunter Team from Symantec and Carbon Black says these attacks have been increasing since Medusa’s launch in 2023 as a “ransomware-as-a-service” (RaaS) tool. The malware, operated by a […]
Kirsten Doyle
Why Cyber Risk Gets Lost in the Boardroom February 26, 2026
Cyber Risk is now a standing item in most boardrooms. You’ll find it in annual reports, audit committees, and regulatory filings. And still, cyber risk is not being addressed. Not because boards don’t care, or because CISOs are not reporting. But because something fundamental is still not working between security and governance. We posed these three questions to six […]
Kirsten Doyle
PayPal Customer Data Exposed for Six Months in Breach February 24, 2026
PayPal has disclosed a data breach that exposed some of its customers’ personal information and led to fraudulent transactions. The company said it happed due to an error in its PayPal Working Capital (“PPWC”) loan application, an offering that gives businesses a cash advance based on their PayPal sales history. Between 1 July and 13 December 2025, the PII of a small number […]
Kirsten Doyle
Americans Lost Over $20 million in ATM “Jackpotting” Attacks February 24, 2026
Malware-fuelled ATM “jackpotting” attacks are surging across the United States, with the FBI warning that incidents have spiked sharply in 2025. In a recent alert, the Bureau said it has recorded around 1,900 ATM jackpotting incidents since 2020. Alarmingly, more than 700 of those cases (representing over $20 million in losses) have happened this year alone. […]
Kirsten Doyle
Microsoft Copilot Flaw Exposed Confidential Emails February 24, 2026
A bug has been causing Microsoft Copilot to read and summarise users’ confidential emails, and it’s been happening since late January. Microsoft says the issue stems from a code error that bypassed data loss prevention (DLP) policies designed to stop sensitive information from being accessed in the first place. It was first reported by BleepingComputer. “Users’ email messages with a confidential label applied are […]
Kirsten Doyle

Researchers Reveal ‘Deceptive Delight’ Method to Jailbreak AI Models

Gophish Framework Used in Phishing Campaigns to Deploy Remote Access Trojans

Think You’re Secure? 49% of Enterprises Underestimate SaaS Risks

Leave a Reply Cancel reply

[email protected]

Researchers Reveal ‘Deceptive Delight’ Method to Jailbreak AI Models

Researchers Reveal ‘Deceptive Delight’ Method to Jailbreak AI Models

Share :

Leave a Reply Cancel reply