{"id":12678,"date":"2026-05-07T10:05:12","date_gmt":"2026-05-07T10:05:12","guid":{"rendered":"https:\/\/serisec.com\/index.php\/2026\/05\/07\/critical-ollama-memory-leak-vulnerability-exposes-300000-servers-globally\/"},"modified":"2026-05-07T10:05:12","modified_gmt":"2026-05-07T10:05:12","slug":"critical-ollama-memory-leak-vulnerability-exposes-300000-servers-globally","status":"publish","type":"post","link":"https:\/\/serisec.com\/index.php\/2026\/05\/07\/critical-ollama-memory-leak-vulnerability-exposes-300000-servers-globally\/","title":{"rendered":"Critical Ollama Memory Leak Vulnerability Exposes 300,000 Servers Globally"},"content":{"rendered":"<p>    Critical Ollama Memory Leak Vulnerability Exposes 300,000 Servers Globally<br \/>\n \t<BR><br \/>\n<BR><\/BR><br \/>\n    <!-- no image --><br \/>\n \t<BR><br \/>\n<BR><\/BR><\/p>\n<div>\n<p>A major security flaw has placed Ollama, one of the most widely used platforms for running local AI models, at risk of a high-profile exposure event.<\/p>\n<p>The issue, dubbed \u201cBleeding Llama,\u201d allows unauthenticated <a href=\"https:\/\/cybersecuritynews.com\/hackers-exploit-ollama-model\/\" target=\"_blank\" rel=\"noreferrer noopener\">attackers to access the Ollama process and extract sensitive data<\/a> directly from memory, putting roughly 300,000 internet-facing servers worldwide at risk.<\/p>\n<p>With only three API calls, an attacker can extract prompts, system instructions, and environment variables from exposed deployments, turning AI infrastructure into an unexpected source of data leakage.<\/p>\n<p>Discovered by cybersecurity researchers at Cyera and assigned a critical CVSS score of 9.1 by the Echo CVE Numbering Authority, CVE-2026-7482 represents a massive enterprise risk.<\/p>\n<figure class=\"wp-block-image size-large\"><img data-recalc-dims=\"1\" decoding=\"async\" src=\"https:\/\/i0.wp.com\/blogger.googleusercontent.com\/img\/b\/R29vZ2xl\/AVvXsEh0lYoyPGuYCo4cou6esEv-X0LxnQ9zwwFGdbk7xOLfC7EKKZ6SrrF2_k2tidon0euaYiM8PES3S0VyF2yZeAFhpOCHvappwntv-y1RhZTK-yzNBK9Svc9pXZ2sRqkebOtjKkI8G4uUjcazorDadgnn4MceCDOHPYvGX15H-z6QcbZ_ZH3z73dM0UqMZGM\/s1600\/Screenshot%25202026-05-07%2520111242%2520%25281%2529.webp?ssl=1\" alt=\"Ollama uploads models with leaks(source :cyera)\"><figcaption class=\"wp-element-caption\">Ollama uploads models with leaks (source: Cyera)<\/figcaption><\/figure>\n<p>Ollama lets users create model instances from uploaded files, including <a href=\"https:\/\/cybersecuritynews.com\/ollama-vulnerabilities-code-execution\/\" target=\"_blank\" rel=\"noreferrer noopener\">GGUF model files<\/a> used to package tensors, metadata, and other model information for local inference.<\/p>\n<h2 class=\"wp-block-heading\" id=\"h-ollama-vulnerability-exposes-servers\"><strong>Ollama Vulnerability Exposes Servers<\/strong><\/h2>\n<p>The vulnerable path is tied to the model-creation flow, where Ollama processes uploaded files via its API and prepares them for conversion and saving.<\/p>\n<p>Researchers found that a crafted GGUF file can abuse this process by declaring a tensor shape that is much larger than the actual data stored in the file, causing the server to read beyond the intended buffer.<\/p>\n<p>The weakness appears during tensor conversion, where Ollama uses Go\u2019s\u00a0unsafe\u00a0functionality for low-level memory operations instead of staying inside normal safety boundaries.<\/p>\n<p>Because the software does not properly validate that the tensor metadata matches the actual file size, the conversion routine can trigger an <a href=\"https:\/\/cybersecuritynews.com\/out-of-bounds-read-and-write\/\" target=\"_blank\" rel=\"noreferrer noopener\">out-of-bounds heap read <\/a>and capture unrelated memory contents nearby.<\/p>\n<figure class=\"wp-block-image size-large\"><img data-recalc-dims=\"1\" decoding=\"async\" src=\"https:\/\/i0.wp.com\/blogger.googleusercontent.com\/img\/b\/R29vZ2xl\/AVvXsEgKyvv-I3g0D8YMp19JkjbzOUvb_PBmfpE4_qL08_D-jyaZIjVnGKF1a5vPuk8xPTMdDehgz9GfFKGKM1URtBlzRQwyMJHZA9gdmDwf1OQQ85NA9PrD6aTnurdF5hb-YAQGexaKvREYlQsG0yUA4YBWik_hq2Bc5k9lC6wj3_6QRFqGanKZdU-kZdO323E\/s1600\/Screenshot%25202026-05-07%2520110907%2520%25281%2529.webp?ssl=1\" alt=\"Attacker sends malformed GGUF tensor causing memory overread(source :cyera) \"><figcaption class=\"wp-element-caption\">Attacker sends malformed GGUF tensor, causing memory overread (source: Cyera)<\/figcaption><\/figure>\n<p>That leaked memory is then carried forward into a newly created model file instead of being discarded.<\/p>\n<p>The attack becomes especially dangerous because researchers found a way to preserve the leaked memory in readable form during conversion.<\/p>\n<p>By using a float-16 source tensor and forcing a float-32 destination, the attacker can rely on a lossless conversion path that preserves the stolen bytes rather than corrupting them through lossy quantization.<\/p>\n<figure class=\"wp-block-image size-large\"><img data-recalc-dims=\"1\" decoding=\"async\" src=\"https:\/\/i0.wp.com\/blogger.googleusercontent.com\/img\/b\/R29vZ2xl\/AVvXsEjcPrbK06GY_1Q3ugJdducNIlLwuABh5PsYYMB98kEdegx8qSphEi5G3HJLpgGxI2spDEppJ-5_e4PpBcrK-VzBmPiLReCDZIhdzF65GbYRy2T-SwdKgoiR_eLGZCUvWiRYBpEPa8K8lyE3iIqR_OvI2z0uLVmY2Qe8OxCbqYqh2VexjR6_eyTGBAMgh9g\/s1600\/Screenshot%25202026-05-07%2520110917%2520%25281%2529.webp?ssl=1\" alt=\"Quantization reversal exposes heap data(source : cyera)\"><figcaption class=\"wp-element-caption\">Quantization reversal exposes heap data (source: Cyera)<\/figcaption><\/figure>\n<p>Once the malicious model is created, Ollama\u2019s push functionality can upload it to an attacker-controlled server, effectively exfiltrating the leaked memory from the target system.<\/p>\n<p><a href=\"https:\/\/www.cyera.com\/research\/bleeding-llama-critical-unauthenticated-memory-leak-in-ollama\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">According to the Cyera research<\/a>, the leaked heap data can include user prompts, system prompts from other models, and environment variables stored by the host running Ollama.<\/p>\n<p>In enterprise environments, this may expose API keys, internal instructions, proprietary code, customer-related content, and other highly sensitive material processed by AI workflows.<\/p>\n<p>The risk grows further when <a href=\"https:\/\/cybersecuritynews.com\/1100-ollama-ai-servers-exposed\/\" target=\"_blank\" rel=\"noreferrer noopener\">Ollama is connected to external tools or coding assistants<\/a>, because those outputs can also pass through memory and become part of what an attacker steals.<\/p>\n<p>The issue affects Ollama deployments before version 0.17.1, which includes the relevant security fix referenced by the researchers and Echo.<\/p>\n<p>Organizations should upgrade immediately, remove any public exposure, place Ollama behind authentication controls, and restrict access to trusted internal networks only.<\/p>\n<p>Any environment that has been internet-accessible should also review logs, rotate secrets, and assume that prompts and environment data may already have been exposed.<\/p>\n<p class=\"has-text-align-center has-background\" style=\"background:linear-gradient(180deg,rgb(238,238,238) 94%,rgb(169,184,195) 100%)\"><strong>Follow us on <a href=\"https:\/\/news.google.com\/publications\/CAAqMggKIixDQklTR3dnTWFoY0tGV041WW1WeWMyVmpkWEpwZEhsdVpYZHpMbU52YlNnQVAB?hl=en-IN&amp;gl=IN&amp;ceid=IN:en\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">Google News<\/a>, <a href=\"https:\/\/www.linkedin.com\/company\/cybersecurity-news\/\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">LinkedIn<\/a>, and <a href=\"https:\/\/x.com\/cyber_press_org\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">X<\/a> for daily cybersecurity updates. <a href=\"https:\/\/cybersecuritynews.com\/contact-us\/\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">Contact us<\/a> to feature your stories.<\/strong><\/p>\n<p>The post <a href=\"https:\/\/cybersecuritynews.com\/ollama-vulnerability-exposes-servers\/\">Critical Ollama Memory Leak Vulnerability Exposes 300,000 Servers Globally<\/a> appeared first on <a href=\"https:\/\/cybersecuritynews.com\/\">Cyber Security News<\/a>.<\/p>\n<\/div>\n<p> \t<BR><br \/>\n <BR><\/BR><br \/>\n    Abinaya<br \/>\n \t<BR><br \/>\n<BR><\/BR><br \/>\n<a href=\"https:\/\/cybersecuritynews.com\/ollama-vulnerability-exposes-servers\/\">Go to cyber-security-news<\/a><br \/>\n \t<BR><br \/>\n <BR><\/BR><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Critical Ollama Memory Leak Vulnerability Exposes 300,000 Servers Globally A major security flaw has placed Ollama, one of the most widely used platforms for running local AI models, at risk of a high-profile exposure event. The issue, dubbed \u201cBleeding Llama,\u201d allows unauthenticated attackers to access the Ollama process and extract sensitive data directly from memory, [&hellip;]<\/p>\n","protected":false},"author":2,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[129,63,648],"tags":[130],"class_list":["post-12678","post","type-post","status-publish","format-standard","hentry","category-cyber-security","category-cyber-security-news","category-vulnerability-news","tag-cyber-security-news"],"_links":{"self":[{"href":"https:\/\/serisec.com\/index.php\/wp-json\/wp\/v2\/posts\/12678"}],"collection":[{"href":"https:\/\/serisec.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/serisec.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/serisec.com\/index.php\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/serisec.com\/index.php\/wp-json\/wp\/v2\/comments?post=12678"}],"version-history":[{"count":0,"href":"https:\/\/serisec.com\/index.php\/wp-json\/wp\/v2\/posts\/12678\/revisions"}],"wp:attachment":[{"href":"https:\/\/serisec.com\/index.php\/wp-json\/wp\/v2\/media?parent=12678"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/serisec.com\/index.php\/wp-json\/wp\/v2\/categories?post=12678"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/serisec.com\/index.php\/wp-json\/wp\/v2\/tags?post=12678"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}