AI Security Research

2,583+ academic papers on AI security, attacks, and defenses

Total
2,583
Attack
994
Benchmark
740
Defense
355
Tool
275
Survey
146

Showing 681–700 of 1,228 papers

Clear filters
Defense MEDIUM

What Matters For Safety Alignment?

Xing Li, Hui-Ling Zhen, Lihao Yin +3 more

This paper presents a comprehensive empirical study on the safety alignment capabilities. We evaluate what matters for safety alignment in LLMs and...

4 months ago cs.CL cs.AI cs.CR PDF
Attack MEDIUM

Extracting books from production language models

Ahmed Ahmed, A. Feder Cooper, Sanmi Koyejo +1 more

Many unresolved legal questions over LLMs and copyright center on memorization: whether specific training data have been encoded in the model's...

4 months ago cs.CL cs.AI cs.LG PDF

Track AI security vulnerabilities in real time

Get breaking CVE alerts, compliance reports (ISO 42001, EU AI Act), and CISO risk assessments for your AI/ML stack.

Start 14-Day Free Trial