AI Component

API

AI APIs are the boundary between the application and the model. Self-hosted inference servers (vLLM, Triton, Ollama, TGI) and third-party gateways (LiteLLM, OpenRouter) expose OpenAI-compatible endpoints, and the same web-app vulnerability classes appear here: missing or weak authentication on /v1/chat/completions, broken authorization between tenants, lack of rate limiting that lets an attacker drain quota or burn GPU time, and overly permissive CORS that leaks API keys from browser-side calls. The blast radius is unusual: a single auth-bypass on an inference endpoint exposes both data and compute, and in the case of paid hosted models, directly costs money. We have seen production CVEs across most popular self-hosted servers in the last 18 months. Defenses: require auth on every endpoint, per-tenant rate limits, separate key scopes for read vs admin, and pin server versions aggressively.

325
Total CVEs
17
Pages
Page 11 of 17
Current
Severity CVE CVSS
LOW CVE-2026-4993 3.3
CRITICAL GHSA-955r-262c-33jc -
MEDIUM GHSA-68f8-9mhj-h2mp -
HIGH CVE-2026-29872 8.2
HIGH CVE-2026-4399 7.5
HIGH CVE-2026-22561 7.8
MEDIUM CVE-2026-34451 -
MEDIUM CVE-2026-34450 -
MEDIUM CVE-2026-34452 -
HIGH CVE-2026-34936 7.7
MEDIUM CVE-2026-34756 6.5
CRITICAL CVE-2026-35030 9.1
UNKNOWN CVE-2026-35029 -
MEDIUM CVE-2026-34755 6.5
MEDIUM GHSA-mvv8-v4jj-g47j 6.5
MEDIUM CVE-2026-5530 6.3
HIGH CVE-2026-35020 8.4
CRITICAL CVE-2026-35022 9.8
HIGH CVE-2026-34511 -
MEDIUM GHSA-rxmx-g7hr-8mx4 -

Page 11 of 17