AI Security Research

2,560+ academic papers on AI security, attacks, and defenses

Total
2,560
Attack
982
Benchmark
736
Defense
350
Tool
275
Survey
144

Showing 161–180 of 442 papers

Clear filters
Benchmark MEDIUM

Moral Sycophancy in Vision Language Models

Shadman Rabby, Md. Hefzul Hossain Papon, Sabbir Ahmed +3 more

Sycophancy in Vision-Language Models (VLMs) refers to their tendency to align with user opinions, often at the expense of moral or factual accuracy....

3 months ago cs.AI PDF
Benchmark MEDIUM

Trust The Typical

Debargha Ganguly, Sreehari Sankar, Biyao Zhang +8 more

Current approaches to LLM safety fundamentally rely on a brittle cat-and-mouse game of identifying and blocking known threats via guardrails. We...

3 months ago cs.CL cs.AI cs.DC PDF

Track AI security vulnerabilities in real time

Get breaking CVE alerts, compliance reports (ISO 42001, EU AI Act), and CISO risk assessments for your AI/ML stack.

Start 14-Day Free Trial