AI Component

API

AI APIs are the boundary between the application and the model. Self-hosted inference servers (vLLM, Triton, Ollama, TGI) and third-party gateways (LiteLLM, OpenRouter) expose OpenAI-compatible endpoints, and the same web-app vulnerability classes appear here: missing or weak authentication on /v1/chat/completions, broken authorization between tenants, lack of rate limiting that lets an attacker drain quota or burn GPU time, and overly permissive CORS that leaks API keys from browser-side calls. The blast radius is unusual: a single auth-bypass on an inference endpoint exposes both data and compute, and in the case of paid hosted models, directly costs money. We have seen production CVEs across most popular self-hosted servers in the last 18 months. Defenses: require auth on every endpoint, per-tenant rate limits, separate key scopes for read vs admin, and pin server versions aggressively.

325
Total CVEs
17
Pages
Page 16 of 17
Current
Severity CVE CVSS
MEDIUM CVE-2026-45386 4.3
MEDIUM CVE-2026-45385 4.3
MEDIUM CVE-2026-45365 5.4
MEDIUM CVE-2026-45351 6.5
HIGH CVE-2026-45349 7.1
MEDIUM CVE-2026-45347 4.3
MEDIUM CVE-2026-45345 6.5
MEDIUM CVE-2026-45339 6.5
HIGH CVE-2026-45338 7.7
HIGH CVE-2026-45331 8.5
MEDIUM CVE-2026-45318 5.4
LOW CVE-2026-45316 3.5
HIGH CVE-2026-45314 -
HIGH CVE-2026-45315 8.7
HIGH CVE-2026-45303 7.7
HIGH CVE-2026-45301 8.1
MEDIUM CVE-2026-45299 5.4
HIGH CVE-2026-45665 8.1
CRITICAL GHSA-wx9m-wx4f-4cmg 9.6
LOW GHSA-jgg6-4rpr-wfh7 -

Page 16 of 17