AI Security Research

2,529+ academic papers on AI security, attacks, and defenses

Total
2,529
Attack
969
Benchmark
729
Defense
345
Tool
272
Survey
142

Showing 201–220 of 345 papers

Clear filters
Defense MEDIUM

Safety Alignment of LMs via Non-cooperative Games

Anselm Paulus, Ilia Kulikov, Brandon Amos +4 more

Ensuring the safety of language models (LMs) while maintaining their usefulness remains a critical challenge in AI alignment. Current approaches rely...

4 months ago cs.AI PDF
Defense LOW

Distributional AGI Safety

Nenad Tomašev, Matija Franklin, Julian Jacobs +2 more

AI safety and alignment research has predominantly been focused on methods for safeguarding individual AI systems, resting on the assumption of an...

4 months ago cs.AI PDF

Track AI security vulnerabilities in real time

Get breaking CVE alerts, compliance reports (ISO 42001, EU AI Act), and CISO risk assessments for your AI/ML stack.

Start 14-Day Free Trial