AI Security Research

2,560+ academic papers on AI security, attacks, and defenses

Total
2,560
Attack
982
Benchmark
736
Defense
350
Tool
275
Survey
144

Showing 141–160 of 442 papers

Clear filters
Benchmark MEDIUM

Large-scale online deanonymization with LLMs

Simon Lermen, Daniel Paleka, Joshua Swanson +3 more

We show that large language models can be used to perform at-scale deanonymization. With full Internet access, our agent can re-identify Hacker News...

2 months ago cs.CR cs.AI cs.LG PDF
Benchmark MEDIUM

Backdooring Bias in Large Language Models

Anudeep Das, Prach Chantasantitam, Gurjot Singh +3 more

Large language models (LLMs) are increasingly deployed in settings where inducing a bias toward a certain topic can have significant consequences,...

2 months ago cs.CR cs.AI PDF
Benchmark MEDIUM

Optimizing Agent Planning for Security and Autonomy

Aashish Kolluri, Rishi Sharma, Manuel Costa +5 more

Indirect prompt injection attacks threaten AI agents that execute consequential actions, motivating deterministic system-level defenses. Such...

3 months ago cs.CR cs.LG PDF

Track AI security vulnerabilities in real time

Get breaking CVE alerts, compliance reports (ISO 42001, EU AI Act), and CISO risk assessments for your AI/ML stack.

Start 14-Day Free Trial