AI Security Research

2,077+ academic papers on AI security, attacks, and defenses

Total
2,077
Attack
809
Benchmark
603
Defense
272
Tool
226
Survey
113

Showing 501–520 of 701 papers

Clear filters
Attack HIGH

Black-Box Guardrail Reverse-engineering Attack

Hongwei Yao, Yun Xia, Shuo Shao +3 more

Large language models (LLMs) increasingly employ guardrails to enforce ethical, legal, and application-specific constraints on their outputs. While...

4 months ago cs.CR cs.CL PDF
Attack HIGH

Jailbreaking in the Haystack

Rishi Rajesh Shah, Chen Henry Wu, Shashwat Saxena +3 more

Recent advances in long-context language models (LMs) have enabled million-token inputs, expanding their capabilities across complex tasks like...

4 months ago cs.CR cs.AI cs.CL PDF
Attack HIGH

Optimizing AI Agent Attacks With Synthetic Data

Chloe Loughridge, Paul Colognese, Avery Griffin +3 more

As AI deployments become more complex and high-stakes, it becomes increasingly important to be able to estimate their risk. AI control is one...

4 months ago cs.AI PDF

Track AI security vulnerabilities in real time

Get breaking CVE alerts, compliance reports (ISO 42001, EU AI Act), and CISO risk assessments for your AI/ML stack.

Start 14-Day Free Trial