AI Security Research

2,077+ academic papers on AI security, attacks, and defenses

Total
2,077
Attack
809
Benchmark
603
Defense
272
Tool
226
Survey
113

Showing 181–200 of 355 papers

Clear filters
Benchmark MEDIUM

ceLLMate: Sandboxing Browser AI Agents

Luoxi Meng, Henry Feng, Ilia Shumailov +1 more

Browser-using agents (BUAs) are an emerging class of AI agents that interact with web browsers in human-like ways, including clicking, scrolling,...

3 months ago cs.CR cs.LG PDF
Benchmark MEDIUM

Auditing Games for Sandbagging

Jordan Taylor, Sid Black, Dillon Bowen +10 more

Future AI systems could conceal their capabilities ('sandbagging') during evaluations, potentially misleading developers and auditors. We...

3 months ago cs.AI PDF

Track AI security vulnerabilities in real time

Get breaking CVE alerts, compliance reports (ISO 42001, EU AI Act), and CISO risk assessments for your AI/ML stack.

Start 14-Day Free Trial