CVE-2026-45667: open-webui: unauth endpoint drains embedding budget/DoS

GHSA-m69w-p7m4-585j MEDIUM CISA: TRACK*
Published May 14, 2026
CISO Take

Open WebUI's `/api/v1/memories/ef` endpoint accepts GET requests without any authentication, causing the application to invoke its configured embedding function — potentially a paid provider such as OpenAI or Azure — on every unauthenticated request. For teams using paid embedding backends, any external attacker who discovers this endpoint can drain embedding API budget indefinitely with a trivial scripted loop; the package's history of 91 tracked CVEs signals systemic security hygiene issues that elevate overall deployment risk. The CVSS 6.5 score materially understates financial blast radius for organizations paying per-token on embedding calls. Upgrade to v0.8.0 immediately — the vulnerable endpoint was removed entirely — or block `/api/v1/memories/ef` at your WAF or reverse proxy and audit your embedding provider's billing dashboard for anomalous spend spikes.

Sources: NVD GitHub Advisory ATLAS

What is the risk?

Risk is MEDIUM-HIGH for deployments using paid embedding providers (OpenAI, Azure OpenAI, Cohere, etc.) due to direct, unbounded financial exposure with zero exploitation complexity — a single unauthenticated GET request suffices. For self-hosted embedding models the risk degrades to availability-only (CPU/GPU DoS), remaining Medium. The attack is trivially automatable, requires no AI/ML expertise, and is detectable only through cost monitoring or access log review. ENABLE_MEMORIES defaults to true, making the majority of default Open WebUI deployments exposed without additional configuration. The package's 91 tracked CVEs indicate a pattern of insufficient security controls that should inform overall deployment posture in production AI stacks.

How does the attack unfold?

Endpoint Discovery
Attacker identifies the unauthenticated `/api/v1/memories/ef` endpoint via active network scanning, public security advisory review, or open-source code inspection of the Open WebUI repository.
AML.T0006
Unauthenticated Exploitation
Attacker sends automated GET requests to the endpoint without credentials, bypassing all authentication controls and triggering the server-side embedding function on each request.
AML.T0049
Cost Harvesting and Service Degradation
Each request generates a paid embedding API call billed to the victim's provider account while simultaneously exhausting server CPU/GPU resources, causing both financial loss and availability degradation.
AML.T0034

What systems are affected?

Package Ecosystem Vulnerable Range Patched
Open WebUI pip <= 0.7.2 0.8.0
143.3K Pushed 8d ago 77% patched ~5d to patch Full package profile →

Do you use Open WebUI? You're affected.

How severe is it?

CVSS 3.1
6.5 / 10
EPSS
0.3%
chance of exploitation in 30 days
Higher than 26% of all CVEs
Exploitation Status
Exploit Available
Exploitation: MEDIUM
Sophistication
Trivial
Exploitation Confidence
medium
CISA SSVC: Public PoC
Composite signal derived from CISA KEV, VulnCheck KEV, CISA SSVC, EPSS, Metasploit, Exploit-DB, trickest/cve, Nuclei templates, and inthewild.io exploitation reports.

What is the attack surface?

AV AC PR UI S C I A
AV Network
AC Low
PR None
UI None
S Unchanged
C None
I Low
A Low

What should I do?

5 steps
  1. PATCH

    Upgrade open-webui to v0.8.0 or later — the /api/v1/memories/ef endpoint was removed entirely, eliminating the attack surface without requiring any configuration change.

  2. INTERIM WORKAROUND

    If immediate upgrade is not possible, block the route /api/v1/memories/ef at your reverse proxy (nginx: location = /api/v1/memories/ef { return 403; }, Caddy: use handle directive with respond 403) or WAF rule.

  3. DETECTION

    Review access logs for GET requests to /api/v1/memories/ef — any such request from an unauthenticated source is anomalous on patched systems and a confirmed attack on unpatched ones.

  4. COST MONITORING

    Check embedding provider billing dashboards (OpenAI Usage, Azure AI Foundry Cost Management) for unexpected spikes; set budget alert thresholds if not already in place.

  5. NETWORK EXPOSURE

    If Open WebUI is not meant to be internet-facing, restrict access to internal networks or VPN at the perimeter.

What does CISA's SSVC say?

Decision Track*
Exploitation poc
Automatable Yes
Technical Impact partial

Source: CISA Vulnrichment (SSVC v2.0). Decision based on the CISA Coordinator decision tree.

How is it classified?

Which compliance frameworks are affected?

This CVE is relevant to:

EU AI Act
Art. 9 - Risk Management System
ISO 42001
A.6.2 - AI System Operational Controls
NIST AI RMF
MANAGE 2.4 - Risk Treatment — Emerging Risks
OWASP LLM Top 10
LLM10 - Unbounded Consumption

Frequently Asked Questions

What is CVE-2026-45667?

Open WebUI's `/api/v1/memories/ef` endpoint accepts GET requests without any authentication, causing the application to invoke its configured embedding function — potentially a paid provider such as OpenAI or Azure — on every unauthenticated request. For teams using paid embedding backends, any external attacker who discovers this endpoint can drain embedding API budget indefinitely with a trivial scripted loop; the package's history of 91 tracked CVEs signals systemic security hygiene issues that elevate overall deployment risk. The CVSS 6.5 score materially understates financial blast radius for organizations paying per-token on embedding calls. Upgrade to v0.8.0 immediately — the vulnerable endpoint was removed entirely — or block `/api/v1/memories/ef` at your WAF or reverse proxy and audit your embedding provider's billing dashboard for anomalous spend spikes.

Is CVE-2026-45667 actively exploited?

No confirmed active exploitation of CVE-2026-45667 has been reported, but organizations should still patch proactively.

How to fix CVE-2026-45667?

1. PATCH: Upgrade open-webui to v0.8.0 or later — the `/api/v1/memories/ef` endpoint was removed entirely, eliminating the attack surface without requiring any configuration change. 2. INTERIM WORKAROUND: If immediate upgrade is not possible, block the route `/api/v1/memories/ef` at your reverse proxy (nginx: `location = /api/v1/memories/ef { return 403; }`, Caddy: use `handle` directive with `respond 403`) or WAF rule. 3. DETECTION: Review access logs for GET requests to `/api/v1/memories/ef` — any such request from an unauthenticated source is anomalous on patched systems and a confirmed attack on unpatched ones. 4. COST MONITORING: Check embedding provider billing dashboards (OpenAI Usage, Azure AI Foundry Cost Management) for unexpected spikes; set budget alert thresholds if not already in place. 5. NETWORK EXPOSURE: If Open WebUI is not meant to be internet-facing, restrict access to internal networks or VPN at the perimeter.

What systems are affected by CVE-2026-45667?

This vulnerability affects the following AI/ML architecture patterns: ML UI deployments, RAG pipelines, model serving.

What is the CVSS score for CVE-2026-45667?

CVE-2026-45667 has a CVSS v3.1 base score of 6.5 (MEDIUM). The EPSS exploitation probability is 0.34%.

What is the AI security impact?

Affected AI Architectures

ML UI deploymentsRAG pipelinesmodel serving

MITRE ATLAS Techniques

AML.T0029 Denial of AI Service
AML.T0034 Cost Harvesting
AML.T0034.000 Excessive Queries
AML.T0048.000 Financial Harm
AML.T0049 Exploit Public-Facing Application

Compliance Controls Affected

EU AI Act: Art. 9
ISO 42001: A.6.2
NIST AI RMF: MANAGE 2.4
OWASP LLM Top 10: LLM10

What are the technical details?

Original Advisory

### Summary GET `/api/v1/memories/ef` is accessible without authentication and executes `request.app.state.EMBEDDING_FUNCTION(...)`. This allows any unauthenticated caller to trigger embedding generation which can lead to direct cost exposure if a paid provider is used. Code reference: `backend/open_webui/routers/memories.py` (@router.get("/ef") -> calls `request.app.state.EMBEDDING_FUNCTION("hello world"))`. ### Details GET `/api/v1/memories/ef` is reachable without authentication and triggers request.app.state.EMBEDDING_FUNCTION("hello world"). This crosses an intended security boundary by allowing unauthenticated users to invoke potentially expensive embedding computation and/or paid upstream embedding APIs. ### PoC 1. Start Open WebUI in default configuration (no special env hardening; default ENABLE_MEMORIES is true). 2. From an unauthenticated client (no cookies/Authorization header), call: curl -i http://\<host\>:\<port\>/api/v1/memories/ef 3. Observe the server performs embedding generation and returns a response like: - HTTP 200 with JSON containing the result. How it can be abused / attacker actions: - Send repeated requests to `/api/v1/memories/ef` to: - consume CPU/GPU resources (DoS) - generate sustained outbound usage to embedding providers if configured (cost + rate-limit exhaustion) - degrade latency/availability for legitimate users ### Impact If embeddings are configured to use paid/remote providers (OpenAI/Azure/etc), an attacker can generate unlimited requests and incur charges. ## Resolution Fixed in commit [e5035ea31](https://github.com/open-webui/open-webui/commit/e5035ea31e179977e805a7032c979ff59a71860a), first released in **v0.8.0** (Feb 2026). The `/api/v1/memories/ef` route was removed entirely. It was a diagnostic/debug-style endpoint that hard-coded `"hello world"` through the embedding function without any authentication dependency; there was no legitimate caller that depended on it, so deletion was the cleaner fix than retrofitting auth. Users on `>= 0.8.0` are not affected.

Exploitation Scenario

An attacker discovers an Open WebUI instance running v0.7.2 via Shodan, automated port scanning, or by finding the instance in public security advisories. Without any credentials or prior access, the attacker writes a one-line loop: `while true; do curl -s http://target/api/v1/memories/ef; done`. Each request causes the Open WebUI backend to call `request.app.state.EMBEDDING_FUNCTION('hello world')` — invoking, for example, `openai.embeddings.create(model='text-embedding-3-small', input='hello world')` — generating a paid API call billed to the victim organization's OpenAI account at ~$0.00002 per call. At a modest 100 requests/second the attacker accumulates $0.002/sec; over 24 hours that reaches $172 — but with parallel workers or higher throughput the damage scales linearly. The attacker incurs zero cost. The victim detects the attack only when a cost alert fires or an engineer notices anomalous embedding usage in the provider dashboard.

Weaknesses (CWE)

CWE-862 — Missing Authorization: The product does not perform an authorization check when an actor attempts to access a resource or perform an action.

  • [Architecture and Design] Divide the product into anonymous, normal, privileged, and administrative areas. Reduce the attack surface by carefully mapping roles with data and functionality. Use role-based access control (RBAC) [REF-229] to enforce the roles at the appropriate boundaries. Note that this approach may not protect against horizontal authorization, i.e., it will not protect a user from attacking others with the same role.
  • [Architecture and Design] Ensure that access control checks are performed related to the business logic. These checks may be different than the access control checks that are applied to more generic resources such as files, connections, processes, memory, and database records. For example, a database may restrict access for medical records to a specific database user, but each record might only be intended to be accessible to the patient and the patient's doctor [REF-7].

Source: MITRE CWE corpus.

CVSS Vector

CVSS:3.1/AV:N/AC:L/PR:N/UI:N/S:U/C:N/I:L/A:L

Timeline

Published
May 14, 2026
Last Modified
May 14, 2026
First Seen
May 15, 2026

Related Vulnerabilities