AI Component

API

AI APIs are the boundary between the application and the model. Self-hosted inference servers (vLLM, Triton, Ollama, TGI) and third-party gateways (LiteLLM, OpenRouter) expose OpenAI-compatible endpoints, and the same web-app vulnerability classes appear here: missing or weak authentication on /v1/chat/completions, broken authorization between tenants, lack of rate limiting that lets an attacker drain quota or burn GPU time, and overly permissive CORS that leaks API keys from browser-side calls. The blast radius is unusual: a single auth-bypass on an inference endpoint exposes both data and compute, and in the case of paid hosted models, directly costs money. We have seen production CVEs across most popular self-hosted servers in the last 18 months. Defenses: require auth on every endpoint, per-tenant rate limits, separate key scopes for read vs admin, and pin server versions aggressively.

325
Total CVEs
17
Pages
Page 12 of 17
Current
Severity CVE CVSS
HIGH GHSA-69x8-hrgq-fjj8 -
MEDIUM CVE-2026-39411 5.0
MEDIUM CVE-2026-5803 6.3
MEDIUM CVE-2026-1163 4.1
HIGH CVE-2026-40116 7.5
HIGH CVE-2026-40217 8.8
HIGH CVE-2026-40114 7.2
MEDIUM CVE-2026-40151 5.3
MEDIUM CVE-2026-35657 -
CRITICAL CVE-2026-1115 9.6
MEDIUM CVE-2026-40086 5.3
LOW GHSA-r7w7-9xr2-qq2r 3.1
HIGH GHSA-gqqj-85qm-8qhf 8.7
HIGH GHSA-6f7g-v4pp-r667 -
HIGH GHSA-4jpm-cgx2-8h37 -
HIGH GHSA-48m6-ch88-55mj 8.1
MEDIUM GHSA-m7mq-85xj-9x33 5.6
MEDIUM GHSA-6pcv-j4jx-m4vx 5.3
HIGH GHSA-xmxx-7p24-h892 -
LOW GHSA-gc9r-867r-j85f -

Page 12 of 17