AI Component

Inference

Inference servers are the most actively-exploited component of the AI stack because they sit between the model and the public internet and they hold the GPU. The shape of the bugs is mostly web-app classes magnified by the cost of compute: missing auth on /v1 endpoints, SSRF that escapes the sandbox onto the platform's control plane, unsafe deserialization on model-loading paths, and path traversal in artifact-management endpoints. vLLM, Triton, TGI, BentoML, Ray Serve, and Ollama have each shipped multiple high-severity CVEs since 2023; CVE-2024-11041 in vLLM was a notable example combining prompt injection with code execution. Multi-tenant deployments are particularly exposed because a single bug typically crosses tenant boundaries. Defenses: aggressive patching, mandatory auth, network segmentation between inference and control plane, and per-tenant resource quotas to bound abuse.

698

Total CVEs

Pages

Page 1 of 35

Current

Severity	CVE	Headline	Package	CVSS
UNKNOWN	CVE-2026-25083	GROWI: Missing Auth allows unauthorized operations		-
CRITICAL	CVE-2026-25960	vllm: SSRF allows internal network access	vllm	9.8
CRITICAL	CVE-2026-30824	Flowise: auth bypass exposes NVIDIA NIM container endpoints	flowise	9.8
UNKNOWN	CVE-2018-7576	TensorFlow: NPD in 1.6.x crashes ML runtime	tensorflow	-
HIGH	CVE-2018-8825	TensorFlow 1.7: Buffer overflow enables arbitrary code exec	tensorflow	8.8
UNKNOWN	CVE-2018-7577	TensorFlow: Snappy memcpy overlap crash/mem disclosure	tensorflow	-
UNKNOWN	CVE-2019-9635	TensorFlow: NULL ptr deref DoS via malformed GIF input	tensorflow	-
CRITICAL	CVE-2019-16778	TensorFlow: heap overflow in UnsortedSegmentSum op	tensorflow	9.8
HIGH	CVE-2020-5215	TensorFlow: type confusion DoS crashes eager mode inference	tensorflow	7.5
MEDIUM	CVE-2018-21233	TensorFlow: integer overflow leaks process memory via BMP	tensorflow	6.5
MEDIUM	CVE-2020-15190	TensorFlow: null ptr deref DoS via Switch op eager runtime	tensorflow	5.3
MEDIUM	CVE-2020-15191	TensorFlow: null ptr deref in dlpack causes remote DoS	tensorflow	5.3
MEDIUM	CVE-2020-15192	TensorFlow: memory leak in dlpack DoS via low-priv input	tensorflow	4.3
HIGH	CVE-2020-15193	TensorFlow: uninitialized memory corruption via dlpack	tensorflow	7.1
MEDIUM	CVE-2020-15194	TensorFlow: DoS via SparseFillEmptyRowsGrad assertion	tensorflow	5.3
HIGH	CVE-2020-15195	TensorFlow: heap overflow in sparse gradient op	tensorflow	8.8
CRITICAL	CVE-2020-15196	TensorFlow: heap OOB read in sparse/ragged count ops	tensorflow	9.9
MEDIUM	CVE-2020-15197	TensorFlow: DoS via malformed sparse tensor input	tensorflow	6.3
MEDIUM	CVE-2020-15198	TensorFlow: heap OOB in SparseCountSparseOutput ops	tensorflow	5.4
MEDIUM	CVE-2020-15199	TensorFlow: DoS via malformed ragged tensor input	tensorflow	5.9

Page 1 of 35

Related AI Components

Framework API Agent Model Training Data RAG

Inference

Related AI Components

Weekly CISO Take + top threats