{"id":11153,"date":"2026-03-06T10:04:06","date_gmt":"2026-03-06T10:04:06","guid":{"rendered":"https:\/\/serisec.com\/index.php\/2026\/03\/06\/hackers-can-use-indirect-prompt-injection-allows-adversaries-to-manipulate-ai-agents-with-content\/"},"modified":"2026-03-06T10:04:06","modified_gmt":"2026-03-06T10:04:06","slug":"hackers-can-use-indirect-prompt-injection-allows-adversaries-to-manipulate-ai-agents-with-content","status":"publish","type":"post","link":"https:\/\/serisec.com\/index.php\/2026\/03\/06\/hackers-can-use-indirect-prompt-injection-allows-adversaries-to-manipulate-ai-agents-with-content\/","title":{"rendered":"Hackers Can Use Indirect Prompt Injection Allows Adversaries to Manipulate AI Agents with Content"},"content":{"rendered":"<p>    Hackers Can Use Indirect Prompt Injection Allows Adversaries to Manipulate AI Agents with Content<br \/>\n \t<BR><br \/>\n<BR><\/BR><br \/>\n    <!-- no image --><br \/>\n \t<BR><br \/>\n<BR><\/BR><\/p>\n<div>\n<p>Artificial intelligence tools are now a core part of everyday workflows \u2014 from browsers that summarize web pages to automated agents that help users make decisions online. <\/p>\n<p>As these tools become more capable, attackers are learning how to turn them against the very people they are designed to serve. <\/p>\n<p>A method called indirect prompt injection (IDPI) allows adversaries to embed hidden instructions inside ordinary-looking web content, tricking AI agents into executing commands they were never authorized to follow.<\/p>\n<p>Unlike direct prompt injection, where a person types a malicious instruction directly into a chatbot, IDPI works entirely behind the scenes. <\/p>\n<p>An attacker hides instructions inside a webpage \u2014 embedded in HTML code, user comments, metadata, or invisible text \u2014 and waits for an AI tool to visit or process that page. <\/p>\n<p>When the AI reads the page as part of a routine task, such as summarizing content or reviewing an advertisement, it may unknowingly interpret those hidden instructions as legitimate commands and act on them.<\/p>\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-large\"><img data-recalc-dims=\"1\" decoding=\"async\" src=\"https:\/\/i0.wp.com\/blogger.googleusercontent.com\/img\/b\/R29vZ2xl\/AVvXsEhOvDpRZRV_-uwvZw7ojdBfN4G5hj6AQYP2JtUBD9OB595I8L_93cLp1aISbZiPstuibQE3GwnZKE1b-EsA-bJAN32sV7wQjah493lEgtyxr8M3ocMRvzGbIJJYguSvtS5TzrCQjPiQFg8ATo0EGou5ElFRNY7ryyypBpBEyVjQICrW7pdiT84DUdO5KO0\/s16000\/Threat%2520model%2520depiction%2520for%2520web-based%2520IDPI%2520%28Source%2520-%2520Unit42%29.webp?ssl=1\" alt=\"Threat model depiction for web-based IDPI (Source - Unit42)\"><figcaption class=\"wp-element-caption\">Threat model depiction for web-based IDPI (Source \u2013 Unit42)<\/figcaption><\/figure>\n<\/div>\n<p><a href=\"https:\/\/unit42.paloaltonetworks.com\/ai-agent-prompt-injection\/\" id=\"https:\/\/unit42.paloaltonetworks.com\/ai-agent-prompt-injection\/\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">Unit 42 researchers identified that this attack<\/a> is no longer just a theory. Their analysis of large-scale real-world telemetry confirmed that IDPI attacks are actively deployed across live websites, with 22 distinct techniques documented for constructing malicious payloads. <\/p>\n<p>Their findings also revealed previously undocumented attacker goals, including the first known real-world case of IDPI being used to bypass an AI-based advertisement review system.<\/p>\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-large\"><img data-recalc-dims=\"1\" decoding=\"async\" src=\"https:\/\/i0.wp.com\/blogger.googleusercontent.com\/img\/b\/R29vZ2xl\/AVvXsEjDuCnhxn6RKHicPOnxtfz7wtTPQ4lfP43LTYSFUavZRne1Q1o6KVQ6k14pg_E9xdSU_rdSfY0uK7BwmwcNDwH6wRX2mVDywC2RdKcW4G_yKEBCog3NYjQ0Lp8OcGZjuWwA1VxAMLsI5dZMMseIav90xYMBtezavWamt0Yt08DWoCdnzJcLvEICkafUlqU\/s16000\/Example%2520of%2520Hidden%2520Prompt%2520in%2520Page%2520from%2520reviewerpress%255B.%255Dcom%2520%28Source%2520-%2520Unit42%29.webp?ssl=1\" alt=\"Example of Hidden Prompt in Page from reviewerpress[.]com (Source - Unit42)\"><figcaption class=\"wp-element-caption\">Example of Hidden Prompt in Page from reviewerpress[.]com (Source \u2013 Unit42)<\/figcaption><\/figure>\n<\/div>\n<p>The range of harm these attacks can cause is broad. Attackers have used IDPI to push phishing sites up in search rankings through SEO poisoning, attempt unauthorized financial transactions, force <a href=\"https:\/\/cybersecuritynews.com\/hackers-exploit-ai-tools-misconfiguration\/\" id=\"109466\" target=\"_blank\" rel=\"noreferrer noopener\">AI tools<\/a> to reveal sensitive information, and even issue server-side commands that could destroy entire databases. <\/p>\n<p>In one observed case, a single webpage contained as many as 24 separate injection attempts, stacking multiple delivery methods to raise the odds that at least one would successfully reach the AI.<\/p>\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-large\"><img data-recalc-dims=\"1\" decoding=\"async\" src=\"https:\/\/i0.wp.com\/blogger.googleusercontent.com\/img\/b\/R29vZ2xl\/AVvXsEjlQQDcPs3BmPg-ua3QyFOO2jpEhhwJIrX4boKuArR2f-0rZBoL9StVNXc2ArEU9zITVWOqxphjsy92C5oMe4WR4orWsSm4DEwzYcWGTvDaUnaHV8ka8DBbek54C9AeFOPACQqh9YjW1md7QQFC8ViyFZv91cO2X3REBvSiPACokFpNz6cCllp94JFukVI\/s16000\/HTML%2520Code%2520Excerpt%2520Showing%2520IDPI%2520from%2520reviewerpress%255B.%255Dcom%2520%28Source%2520-%2520Unit42%29.webp?ssl=1\" alt=\"HTML Code Excerpt Showing IDPI from reviewerpress[.]com (Source - Unit42)\"><figcaption class=\"wp-element-caption\">HTML Code Excerpt Showing IDPI from reviewerpress[.]com (Source \u2013 Unit42)<\/figcaption><\/figure>\n<\/div>\n<p>Across the telemetry reviewed, the most common attacker goal was producing irrelevant or disruptive AI output, accounting for 28.6% of cases, followed by data destruction at 14.2% and AI content moderation bypass at 9.5%. <\/p>\n<p>This shows that attackers are going after <a href=\"https:\/\/cybersecuritynews.com\/48-ai-vulnerabilities-220-percent\/\" id=\"62812\" target=\"_blank\" rel=\"noreferrer noopener\">AI systems<\/a> with a wide range of goals \u2014 from low-level noise generation to serious financial fraud.<\/p>\n<h2 class=\"wp-block-heading\" id=\"how-attackers-conceal-and-deliver-malicious-payloa\"><strong>How Attackers Conceal and Deliver Malicious Payloads<\/strong><\/h2>\n<p>One of the most significant findings in this research is how much effort attackers put into hiding their injected instructions. <\/p>\n<p>Rather than dropping a simple override command into a page, they layer multiple techniques on top of one another to avoid detection by both human reviewers and automated scanners, while still ensuring the <a href=\"https:\/\/cybersecuritynews.com\/codemender-rewrites-vulnerable-code\/\" id=\"129432\" target=\"_blank\" rel=\"noreferrer noopener\">AI agent<\/a> can read and act on the content.<\/p>\n<p>The most frequently observed delivery method, seen in 37.8% of cases, was visible plaintext \u2014 injecting the command directly into a page footer, where most users never look. <\/p>\n<p>HTML attribute cloaking was the second most common method at 19.8%, placing the malicious prompt inside HTML tag attributes where it is invisible in the browser but readable by an AI. <\/p>\n<p>CSS rendering suppression followed at 16.9%, with attackers making text invisible by setting font sizes to zero or pushing content far off-screen.<\/p>\n<p>For jailbreaking \u2014 convincing the AI to obey the injected command despite safety filters \u2014 social engineering dominated, appearing in 85.2% of cases. <\/p>\n<p>Attackers presented their instructions as if they came from a developer or administrator, using triggers like \u201cgod mode\u201d or \u201cdeveloper mode\u201d to make the model believe compliance was valid and urgent.<\/p>\n<p>Security teams and AI developers should treat untrusted web content as a potential attack source and apply input validation wherever AI agents process external data. <\/p>\n<p>Deploying spotlighting techniques \u2014 separating untrusted content from trusted system instructions \u2014 can reduce attack exposure. AI systems should follow least-privilege design, requiring explicit user approval before taking high-impact actions. <\/p>\n<p>Detection tools must move beyond keyword filters to incorporate behavioral analysis and intent classification capable of catching IDPI attempts that rely on encoding schemes, obfuscation, or multilingual methods to bypass defenses.<\/p>\n<p class=\"has-text-align-center has-background\" style=\"background:linear-gradient(180deg,rgb(238,238,238) 91%,rgb(169,184,195) 100%)\"><strong>Follow us on\u00a0<a href=\"https:\/\/news.google.com\/publications\/CAAqMggKIixDQklTR3dnTWFoY0tGV041WW1WeWMyVmpkWEpwZEhsdVpYZHpMbU52YlNnQVAB?hl=en-IN&amp;gl=IN&amp;ceid=IN:en\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">Google News<\/a>,\u00a0<a href=\"https:\/\/www.linkedin.com\/company\/cybersecurity-news\/\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">LinkedIn<\/a>,\u00a0and\u00a0<a href=\"https:\/\/x.com\/cyber_press_org\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">X<\/a>\u00a0to Get More Instant Updates<\/strong>,\u00a0<strong>Set CSN as a Preferred Source in\u00a0<a href=\"https:\/\/www.google.com\/preferences\/source?q=cybersecuritynews.com\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">Google<\/a>.<\/strong><\/p>\n<p>The post <a href=\"https:\/\/cybersecuritynews.com\/hackers-can-use-indirect-prompt-injection-allows-adversaries\/\">Hackers Can Use Indirect Prompt Injection Allows Adversaries to Manipulate AI Agents with Content<\/a> appeared first on <a href=\"https:\/\/cybersecuritynews.com\/\">Cyber Security News<\/a>.<\/p>\n<\/div>\n<p> \t<BR><br \/>\n <BR><\/BR><br \/>\n    Tushar Subhra Dutta<br \/>\n \t<BR><br \/>\n<BR><\/BR><br \/>\n<a href=\"https:\/\/cybersecuritynews.com\/hackers-can-use-indirect-prompt-injection-allows-adversaries\/\">Go to cyber-security-news<\/a><br \/>\n \t<BR><br \/>\n <BR><\/BR><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Hackers Can Use Indirect Prompt Injection Allows Adversaries to Manipulate AI Agents with Content Artificial intelligence tools are now a core part of everyday workflows \u2014 from browsers that summarize web pages to automated agents that help users make decisions online. As these tools become more capable, attackers are learning how to turn them against [&hellip;]<\/p>\n","protected":false},"author":2,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[129,63,649],"tags":[130],"class_list":["post-11153","post","type-post","status-publish","format-standard","hentry","category-cyber-security","category-cyber-security-news","category-threats","tag-cyber-security-news"],"_links":{"self":[{"href":"https:\/\/serisec.com\/index.php\/wp-json\/wp\/v2\/posts\/11153"}],"collection":[{"href":"https:\/\/serisec.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/serisec.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/serisec.com\/index.php\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/serisec.com\/index.php\/wp-json\/wp\/v2\/comments?post=11153"}],"version-history":[{"count":0,"href":"https:\/\/serisec.com\/index.php\/wp-json\/wp\/v2\/posts\/11153\/revisions"}],"wp:attachment":[{"href":"https:\/\/serisec.com\/index.php\/wp-json\/wp\/v2\/media?parent=11153"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/serisec.com\/index.php\/wp-json\/wp\/v2\/categories?post=11153"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/serisec.com\/index.php\/wp-json\/wp\/v2\/tags?post=11153"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}