AI Security Research

2,560+ academic papers on AI security, attacks, and defenses

Total
2,560
Attack
982
Benchmark
736
Defense
350
Tool
275
Survey
144

Showing 481–500 of 635 papers

Clear filters
Attack HIGH

Black-Box Guardrail Reverse-engineering Attack

Hongwei Yao, Yun Xia, Shuo Shao +3 more

Large language models (LLMs) increasingly employ guardrails to enforce ethical, legal, and application-specific constraints on their outputs. While...

6 months ago cs.CR cs.CL PDF
Attack HIGH

Jailbreaking in the Haystack

Rishi Rajesh Shah, Chen Henry Wu, Shashwat Saxena +3 more

Recent advances in long-context language models (LMs) have enabled million-token inputs, expanding their capabilities across complex tasks like...

6 months ago cs.CR cs.AI cs.CL PDF
Attack HIGH

Optimizing AI Agent Attacks With Synthetic Data

Chloe Loughridge, Paul Colognese, Avery Griffin +3 more

As AI deployments become more complex and high-stakes, it becomes increasingly important to be able to estimate their risk. AI control is one...

6 months ago cs.AI PDF

Track AI security vulnerabilities in real time

Get breaking CVE alerts, compliance reports (ISO 42001, EU AI Act), and CISO risk assessments for your AI/ML stack.

Start 14-Day Free Trial