CVE-2024-12055: Ollama: DoS via malicious gguf model file upload
HIGH PoC AVAILABLE CISA: TRACK*Any public-facing Ollama instance on version <=0.3.14 can be crashed with a single unauthenticated model upload, taking down all LLM inference on that server. Upgrade to the latest Ollama release immediately and restrict the model upload API to authenticated users only. If running Ollama in shared or production environments, treat this as critical until patched.
What is the risk?
High severity (CVSS 7.5) with network-accessible attack vector, zero authentication required, and trivial exploitation. Ollama is widely deployed in enterprise AI development and self-hosted inference stacks—often with default configurations that expose the API without authentication on internal networks or inadvertently to the internet. While not in CISA KEV and no confirmed active exploitation at time of writing, the near-zero exploitation barrier and broad deployment footprint make this a high-priority patch target. Impact is limited to availability—no confidentiality or integrity compromise—but sustained DoS of inference services can disrupt AI-dependent business processes.
What systems are affected?
| Package | Ecosystem | Vulnerable Range | Patched |
|---|---|---|---|
| Ollama | pip | — | No patch |
Do you use Ollama? You're affected.
How severe is it?
What is the attack surface?
What should I do?
6 steps-
PATCH
Upgrade Ollama to a version >0.3.14 immediately—this is the primary fix.
-
RESTRICT API ACCESS
Place Ollama behind a reverse proxy (nginx/Caddy) with authentication required before any API call reaches the service.
-
BLOCK MODEL UPLOADS
If model upload/creation is not required, block /api/push and model create endpoints at the network layer or via proxy ACLs.
-
NETWORK SEGMENTATION
Ollama should never be directly internet-exposed; bind to localhost or internal network interfaces only, enforce VPN access.
-
DETECT
Monitor for repeated unexpected Ollama process crashes or restarts as a potential exploitation indicator—alert on more than 2 restarts within 10 minutes.
-
AUDIT
Inventory all Ollama deployments including developer laptops, test environments, and CI/CD pipelines that may run with default insecure configurations.
What does CISA's SSVC say?
Source: CISA Vulnrichment (SSVC v2.0). Decision based on the CISA Coordinator decision tree.
How is it classified?
Which compliance frameworks are affected?
This CVE is relevant to:
Frequently Asked Questions
What is CVE-2024-12055?
Any public-facing Ollama instance on version <=0.3.14 can be crashed with a single unauthenticated model upload, taking down all LLM inference on that server. Upgrade to the latest Ollama release immediately and restrict the model upload API to authenticated users only. If running Ollama in shared or production environments, treat this as critical until patched.
Is CVE-2024-12055 actively exploited?
Proof-of-concept exploit code is publicly available for CVE-2024-12055, increasing the risk of exploitation.
How to fix CVE-2024-12055?
1. PATCH: Upgrade Ollama to a version >0.3.14 immediately—this is the primary fix. 2. RESTRICT API ACCESS: Place Ollama behind a reverse proxy (nginx/Caddy) with authentication required before any API call reaches the service. 3. BLOCK MODEL UPLOADS: If model upload/creation is not required, block /api/push and model create endpoints at the network layer or via proxy ACLs. 4. NETWORK SEGMENTATION: Ollama should never be directly internet-exposed; bind to localhost or internal network interfaces only, enforce VPN access. 5. DETECT: Monitor for repeated unexpected Ollama process crashes or restarts as a potential exploitation indicator—alert on more than 2 restarts within 10 minutes. 6. AUDIT: Inventory all Ollama deployments including developer laptops, test environments, and CI/CD pipelines that may run with default insecure configurations.
What systems are affected by CVE-2024-12055?
This vulnerability affects the following AI/ML architecture patterns: LLM inference serving, self-hosted AI infrastructure, model serving, agent frameworks, RAG pipelines.
What is the CVSS score for CVE-2024-12055?
CVE-2024-12055 has a CVSS v3.1 base score of 7.5 (HIGH). The EPSS exploitation probability is 0.82%.
What is the AI security impact?
Affected AI Architectures
MITRE ATLAS Techniques
AML.T0011.000 Unsafe AI Artifacts AML.T0029 Denial of AI Service AML.T0049 Exploit Public-Facing Application AML.T0076 Corrupt AI Model Compliance Controls Affected
What are the technical details?
Original Advisory
A vulnerability in Ollama versions <=0.3.14 allows a malicious user to create a customized gguf model file that can be uploaded to the public Ollama server. When the server processes this malicious model, it crashes, leading to a Denial of Service (DoS) attack. The root cause of the issue is an out-of-bounds read in the gguf.go file.
Exploitation Scenario
An adversary identifies a public-facing or network-accessible Ollama instance via Shodan/Censys scanning for the default port 11434. Requiring zero credentials, they craft a malformed GGUF model file with an invalid header or tensor metadata section designed to trigger the out-of-bounds read in gguf.go during parsing. They upload the file via the standard Ollama API model creation endpoint. When the server processes the malicious GGUF file, the out-of-bounds read causes an immediate process crash. All LLM inference is unavailable for every user and application depending on that endpoint. On container auto-restart, the attacker simply re-uploads the file to sustain the outage indefinitely. The entire attack requires only basic knowledge of the GGUF file format and a single HTTP request.
Weaknesses (CWE)
CWE-125 — Out-of-bounds Read: The product reads data past the end, or before the beginning, of the intended buffer.
- [Implementation] Assume all input is malicious. Use an "accept known good" input validation strategy, i.e., use a list of acceptable inputs that strictly conform to specifications. Reject any input that does not strictly conform to specifications, or transform it into something that does. When performing input validation, consider all potentially relevant properties, including length, type of input, the full range of acceptable values, missing or extra inputs, syntax, consistency across related fields, and conformance to business rules. As an example of business rule logic, "boat" may be syntactically valid because it only contains alphanumeric characters, but it is not valid if the input is only expected to contain colors such as "red" or "blue." Do not rely exclusively on looking for malicious or malformed inputs. This is likely to miss at least one undesirable input, especially if the code's environment changes. This can give attackers enough room to bypass the intended validation. However, denylis
- [Architecture and Design] Use a language that provides appropriate memory abstractions.
Source: MITRE CWE corpus.
CVSS Vector
CVSS:3.1/AV:N/AC:L/PR:N/UI:N/S:U/C:N/I:N/A:H References
- huntr.com/bounties/7b111d55-8215-4727-8807-c5ed4cf1bfbe Exploit 3rd Party
- github.com/fkie-cad/nvd-json-data-feeds Exploit
Timeline
Related Vulnerabilities
CVE-2026-46339 10.0 9router: unauthenticated RCE exposes LLM API keys
Same package: ollama CVE-2026-42249 9.8 Ollama: path traversal + unsigned update = silent RCE
Same package: ollama CVE-2026-42248 9.8 Ollama: silent auto-update bypasses signature check on Windows
Same package: ollama CVE-2025-63389 9.8 ollama: Missing Auth allows unauthenticated access
Same package: ollama CVE-2026-7482 9.1 Ollama: heap OOB read leaks API keys and chat data
Same package: ollama