AI Security Research

2,583+ academic papers on AI security, attacks, and defenses

Total
2,583
Attack
994
Benchmark
740
Defense
355
Tool
275
Survey
146

Showing 2021–2040 of 2,583 papers

Attack MEDIUM

Large Language Models for Cyber Security

Raunak Somani, Aswani Kumar Cherukuri

This paper studies the integration off Large Language Models into cybersecurity tools and protocols. The main issue discussed in this paper is how...

6 months ago cs.CR PDF
Attack HIGH

Black-Box Guardrail Reverse-engineering Attack

Hongwei Yao, Yun Xia, Shuo Shao +3 more

Large language models (LLMs) increasingly employ guardrails to enforce ethical, legal, and application-specific constraints on their outputs. While...

6 months ago cs.CR cs.CL PDF
Attack HIGH

Jailbreaking in the Haystack

Rishi Rajesh Shah, Chen Henry Wu, Shashwat Saxena +3 more

Recent advances in long-context language models (LMs) have enabled million-token inputs, expanding their capabilities across complex tasks like...

6 months ago cs.CR cs.AI cs.CL PDF
Attack HIGH

Optimizing AI Agent Attacks With Synthetic Data

Chloe Loughridge, Paul Colognese, Avery Griffin +3 more

As AI deployments become more complex and high-stakes, it becomes increasingly important to be able to estimate their risk. AI control is one...

6 months ago cs.AI PDF
Survey LOW

A Criminology of Machines

Gian Maria Campedelli

While the possibility of reaching human-like Artificial Intelligence (AI) remains controversial, the likelihood that the future will be characterized...

6 months ago cs.CY cs.AI cs.HC PDF

Track AI security vulnerabilities in real time

Get breaking CVE alerts, compliance reports (ISO 42001, EU AI Act), and CISO risk assessments for your AI/ML stack.

Start 14-Day Free Trial