CVE-2025-0315: Ollama: GGUF model upload causes memory exhaustion DoS
HIGH PoC AVAILABLE CISA: TRACK*Any attacker with network access to an Ollama instance can crash it by uploading a crafted GGUF model file—no credentials required by default. If your org runs Ollama for internal LLM inference, patch to >0.3.14 immediately or place the API behind an authenticated reverse proxy. Exposed Ollama instances on internal dev networks are at high risk given the zero-auth, low-complexity exploit path.
What is the risk?
High. The combination of network-accessible vector, zero authentication required, and low attack complexity makes this trivially exploitable by anyone who can reach the Ollama API port (default 11434). Ollama is routinely deployed without network restrictions in AI dev environments and on developer workstations, expanding the attack surface significantly. Availability impact is complete for the instance; confidentiality and integrity are unaffected. No active exploitation confirmed, but CVSS 7.5 and trivial reproducibility make this a priority patch.
What systems are affected?
| Package | Ecosystem | Vulnerable Range | Patched |
|---|---|---|---|
| Ollama | pip | — | No patch |
Do you use Ollama? You're affected.
How severe is it?
What is the attack surface?
What should I do?
6 steps-
Patch: Upgrade Ollama beyond version 0.3.14 as soon as a patched release is available; monitor the official GitHub releases page.
-
Network isolation: Restrict Ollama API (port 11434) to localhost or trusted internal subnets via firewall rules immediately.
-
Auth proxy: Place a reverse proxy with authentication (nginx + OAuth2 proxy, Caddy with auth middleware) in front of any network-accessible Ollama instance.
-
Restrict upload access: Audit and limit who can call /api/create and model push endpoints to administrators only.
-
Detection: Alert on rapid memory growth in Ollama processes, OOM killer events targeting Ollama, or unexpected model creation API calls from non-admin principals.
-
Workaround if patching is delayed: Disable the model creation endpoint at the network layer if untrusted users have API access.
What does CISA's SSVC say?
Source: CISA Vulnrichment (SSVC v2.0). Decision based on the CISA Coordinator decision tree.
How is it classified?
Which compliance frameworks are affected?
This CVE is relevant to:
Frequently Asked Questions
What is CVE-2025-0315?
Any attacker with network access to an Ollama instance can crash it by uploading a crafted GGUF model file—no credentials required by default. If your org runs Ollama for internal LLM inference, patch to >0.3.14 immediately or place the API behind an authenticated reverse proxy. Exposed Ollama instances on internal dev networks are at high risk given the zero-auth, low-complexity exploit path.
Is CVE-2025-0315 actively exploited?
Proof-of-concept exploit code is publicly available for CVE-2025-0315, increasing the risk of exploitation.
How to fix CVE-2025-0315?
1. Patch: Upgrade Ollama beyond version 0.3.14 as soon as a patched release is available; monitor the official GitHub releases page. 2. Network isolation: Restrict Ollama API (port 11434) to localhost or trusted internal subnets via firewall rules immediately. 3. Auth proxy: Place a reverse proxy with authentication (nginx + OAuth2 proxy, Caddy with auth middleware) in front of any network-accessible Ollama instance. 4. Restrict upload access: Audit and limit who can call /api/create and model push endpoints to administrators only. 5. Detection: Alert on rapid memory growth in Ollama processes, OOM killer events targeting Ollama, or unexpected model creation API calls from non-admin principals. 6. Workaround if patching is delayed: Disable the model creation endpoint at the network layer if untrusted users have API access.
What systems are affected by CVE-2025-0315?
This vulnerability affects the following AI/ML architecture patterns: LLM inference servers, model serving, local AI deployments, agent frameworks, RAG pipelines.
What is the CVSS score for CVE-2025-0315?
CVE-2025-0315 has a CVSS v3.1 base score of 7.5 (HIGH). The EPSS exploitation probability is 0.67%.
What is the AI security impact?
Affected AI Architectures
MITRE ATLAS Techniques
AML.T0011.000 Unsafe AI Artifacts AML.T0029 Denial of AI Service AML.T0049 Exploit Public-Facing Application Compliance Controls Affected
What are the technical details?
Original Advisory
A vulnerability in ollama/ollama <=0.3.14 allows a malicious user to create a customized GGUF model file, upload it to the Ollama server, and create it. This can cause the server to allocate unlimited memory, leading to a Denial of Service (DoS) attack.
Exploitation Scenario
An attacker enumerates internal network services and discovers an Ollama instance on port 11434—common in AI dev environments where developers run local LLMs. Using the Ollama REST API (no authentication required by default), the attacker crafts a malicious GGUF file with a manipulated header that declares tensor metadata requiring terabytes of memory allocation. They POST this file to the /api/create endpoint. The Ollama server parses the GGUF header and attempts to allocate the declared memory without bounds validation, triggering OOM and crashing the process. All dependent services—RAG pipelines, agent frameworks, internal chatbots relying on this inference endpoint—go offline instantly. The attacker can automate resubmission to prevent service recovery, creating a sustained DoS condition.
Weaknesses (CWE)
CWE-770 Allocation of Resources Without Limits or Throttling
Primary
CWE-770 Allocation of Resources Without Limits or Throttling CWE-770 — Allocation of Resources Without Limits or Throttling: The product allocates a reusable resource or group of resources on behalf of an actor without imposing any intended restrictions on the size or number of resources that can be allocated.
- [Requirements] Clearly specify the minimum and maximum expectations for capabilities, and dictate which behaviors are acceptable when resource allocation reaches limits.
- [Architecture and Design] Limit the amount of resources that are accessible to unprivileged users. Set per-user limits for resources. Allow the system administrator to define these limits. Be careful to avoid CWE-410.
Source: MITRE CWE corpus.
CVSS Vector
CVSS:3.1/AV:N/AC:L/PR:N/UI:N/S:U/C:N/I:N/A:H References
- huntr.com/bounties/da414d29-b55a-496f-b135-17e0fcec67bc Exploit 3rd Party
Timeline
Related Vulnerabilities
CVE-2026-46339 10.0 9router: unauthenticated RCE exposes LLM API keys
Same package: ollama CVE-2026-42248 9.8 Ollama: silent auto-update bypasses signature check on Windows
Same package: ollama CVE-2025-63389 9.8 ollama: Missing Auth allows unauthenticated access
Same package: ollama CVE-2026-42249 9.8 Ollama: path traversal + unsigned update = silent RCE
Same package: ollama CVE-2026-7482 9.1 Ollama: heap OOB read leaks API keys and chat data
Same package: ollama