AI Security Research

2,560+ academic papers on AI security, attacks, and defenses

Total
2,560
Attack
982
Benchmark
736
Defense
350
Tool
275
Survey
144

Showing 241–260 of 558 papers

Clear filters
Benchmark MEDIUM

Large-scale online deanonymization with LLMs

Simon Lermen, Daniel Paleka, Joshua Swanson +3 more

We show that large language models can be used to perform at-scale deanonymization. With full Internet access, our agent can re-identify Hacker News...

2 months ago cs.CR cs.AI cs.LG PDF
Benchmark LOW

Towards a Science of AI Agent Reliability

Stephan Rabanser, Sayash Kapoor, Peter Kirgis +3 more

AI agents are increasingly deployed to execute important tasks. While rising accuracy scores on standard benchmarks suggest rapid progress, many...

2 months ago cs.AI cs.CY cs.LG PDF
Benchmark MEDIUM

Backdooring Bias in Large Language Models

Anudeep Das, Prach Chantasantitam, Gurjot Singh +3 more

Large language models (LLMs) are increasingly deployed in settings where inducing a bias toward a certain topic can have significant consequences,...

2 months ago cs.CR cs.AI PDF

Track AI security vulnerabilities in real time

Get breaking CVE alerts, compliance reports (ISO 42001, EU AI Act), and CISO risk assessments for your AI/ML stack.

Start 14-Day Free Trial