AI Component

API

AI APIs are the boundary between the application and the model. Self-hosted inference servers (vLLM, Triton, Ollama, TGI) and third-party gateways (LiteLLM, OpenRouter) expose OpenAI-compatible endpoints, and the same web-app vulnerability classes appear here: missing or weak authentication on /v1/chat/completions, broken authorization between tenants, lack of rate limiting that lets an attacker drain quota or burn GPU time, and overly permissive CORS that leaks API keys from browser-side calls. The blast radius is unusual: a single auth-bypass on an inference endpoint exposes both data and compute, and in the case of paid hosted models, directly costs money. We have seen production CVEs across most popular self-hosted servers in the last 18 months. Defenses: require auth on every endpoint, per-tenant rate limits, separate key scopes for read vs admin, and pin server versions aggressively.

325
Total CVEs
17
Pages
Page 13 of 17
Current
Severity CVE CVSS
HIGH GHSA-8372-7vhw-cm6q -
HIGH GHSA-5fw2-mwhh-9947 -
HIGH GHSA-w47f-j8rh-wx87 -
LOW CVE-2026-6597 2.7
MEDIUM CVE-2026-6608 5.3
MEDIUM CVE-2026-41495 5.3
HIGH CVE-2026-41279 7.5
HIGH CVE-2026-41266 7.5
MEDIUM CVE-2026-6393 4.3
CRITICAL GHSA-r75f-5x8p-qvmc -
HIGH GHSA-xqmj-j6mv-4862 -
HIGH GHSA-v4p8-mg3p-g94g -
MEDIUM GHSA-h2vw-ph2c-jvwf -
LOW GHSA-v8qf-fr4g-28p2 -
UNKNOWN CVE-2026-41686 -
UNKNOWN CVE-2026-42226 -
UNKNOWN CVE-2026-42227 -
UNKNOWN CVE-2026-42230 -
MEDIUM GHSA-93rg-2xm5-2p9v -
LOW CVE-2026-7846 2.6

Page 13 of 17