AI Security Research

AI Threat Alert indexes 3,082+ peer-reviewed and preprint papers on AI/ML security — covering adversarial attacks, model defenses, red-teaming benchmarks, surveys, and security tooling. Papers are sourced from arXiv, classified by type and by relevance to real-world threats, and cross-referenced with the CVEs and incidents they relate to.

Adversarial attacks
Model defenses
Red-teaming benchmarks
Surveys
Security tooling

Total

3,082
Attack

1,196
Benchmark

883
Defense

421
Tool

321
Survey

181

Type

All Attack Defense Survey Benchmark Tool

Relevance

All High Medium

Date

All time 7 days 30 days 6 months

Showing 1921–1940 of 3,082 papers

Other MEDIUM

Conformity and Social Impact on AI Agents

Alessandro Bellina, Giordano De Marzo, David Garcia

As AI agents increasingly operate in multi-agent environments, understanding their collective behavior becomes critical for predicting the dynamics...

5 months ago cs.AI cs.CL cs.CY PDF

Attack HIGH

Multi-turn Jailbreaking Attack in Multi-Modal Large Language Models

Badhan Chandra Das, Md Tasnim Jawad, Joaquin Molto +2 more

In recent years, the security vulnerabilities of Multi-modal Large Language Models (MLLMs) have become a serious concern in the Generative Artificial...

5 months ago cs.CR cs.AI PDF

Tool HIGH

Cyber Threat Detection and Vulnerability Assessment System using Generative AI and Large Language Model

Keerthi Kumar. M, Swarun Kumar Joginpelly, Sunil Khemka +2 more

Background: Cyber-attacks have evolved rapidly in recent years, many individuals and business owners have been affected by cyber-attacks in various...

5 months ago cs.CR cs.AI cs.LG PDF

Defense LOW

Robust Reasoning as a Symmetry-Protected Topological Phase

Ilmo Sung

Large language models suffer from "hallucinations"-logical inconsistencies induced by semantic noise. We propose that current architectures operate...

5 months ago cs.LG cond-mat.dis-nn cs.AI PDF

Benchmark MEDIUM

From Understanding to Engagement: Personalized pharmacy Video Clips via Vision Language Models (VLMs)

Suyash Mishra, Qiang Li, Srikanth Patil +1 more

Vision Language Models (VLMs) are poised to revolutionize the digital transformation of pharmacyceutical industry by enabling intelligent, scalable,...

5 months ago cs.CV cs.LG PDF

Benchmark MEDIUM

Knowledge-to-Data: LLM-Driven Synthesis of Structured Network Traffic for Testbed-Free IDS Evaluation

Konstantinos E. Kampourakis, Vyron Kampourakis, Efstratios Chatzoglou +2 more

Realistic, large-scale, and well-labeled cybersecurity datasets are essential for training and evaluating Intrusion Detection Systems (IDS). However,...

5 months ago cs.CR PDF

Attack MEDIUM

Effects of personality steering on cooperative behavior in Large Language Model agents

Mizuki Sakai, Mizuki Yokoyama, Wakaba Tateishi +1 more

Large language models (LLMs) are increasingly used as autonomous agents in strategic and social interactions. Although recent studies suggest that...

5 months ago cs.AI PDF

Tool HIGH

Defense Against Indirect Prompt Injection via Tool Result Parsing

Qiang Yu, Xinran Cheng, Chuanyi Liu

As LLM agents transition from digital assistants to physical controllers in autonomous systems and robotics, they face an escalating threat from...

5 months ago cs.AI cs.CL cs.CR PDF

Benchmark LOW

Tool-MAD: A Multi-Agent Debate Framework for Fact Verification with Diverse Tool Augmentation and Adaptive Retrieval

Seyeon Jeong, Yeonjun Choi, JongWook Kim +1 more

Large Language Models (LLMs) suffer from hallucinations and factual inaccuracies, especially in complex reasoning and fact verification tasks....

5 months ago cs.CL PDF

Benchmark MEDIUM

StealthGraph: Exposing Domain-Specific Risks in LLMs through Knowledge-Graph-Guided Harmful Prompt Generation

Huawei Zheng, Xinqi Jiang, Sen Yang +3 more

Large language models (LLMs) are increasingly applied in specialized domains such as finance and healthcare, where they introduce unique safety...

5 months ago cs.CL cs.AI PDF

Defense MEDIUM

AM$^3$Safety: Towards Data Efficient Alignment of Multi-modal Multi-turn Safety for MLLMs

Han Zhu, Jiale Chen, Chengkun Cai +8 more

Multi-modal Large Language Models (MLLMs) are increasingly deployed in interactive applications. However, their safety vulnerabilities become...

5 months ago cs.CL PDF

Tool HIGH

Unified Framework for Qualifying Security Boundary of PUFs Against Machine Learning Attacks

Hongming Fei, Zilong Hu, Prosanta Gope +1 more

Physical Unclonable Functions (PUFs) serve as lightweight, hardware-intrinsic entropy sources widely deployed in IoT security applications. However,...

5 months ago cs.CR PDF

Tool MEDIUM

ResMAS: Resilience Optimization in LLM-based Multi-agent Systems

Zhilun Zhou, Zihan Liu, Jiahe Liu +5 more

Large Language Model-based Multi-Agent Systems (LLM-based MAS), where multiple LLM agents collaborate to solve complex tasks, have shown impressive...

5 months ago cs.AI PDF

Attack HIGH

Know Thy Enemy: Securing LLMs Against Prompt Injection via Diverse Data Synthesis and Instruction-Level Chain-of-Thought Learning

Zhiyuan Chang, Mingyang Li, Yuekai Huang +6 more

Large language model (LLM)-integrated applications have become increasingly prevalent, yet face critical security vulnerabilities from prompt...

5 months ago cs.AI cs.CR PDF

Attack HIGH

Constitutional Classifiers++: Efficient Production-Grade Defenses against Universal Jailbreaks

Hoagy Cunningham, Jerry Wei, Zihan Wang +26 more

We introduce enhanced Constitutional Classifiers that deliver production-grade jailbreak robustness with dramatically reduced computational costs and...

5 months ago cs.CR cs.AI PDF

Survey MEDIUM

Autonomous Agents on Blockchains: Standards, Execution Models, and Trust Boundaries

Saad Alqithami

Advances in large language models have enabled agentic AI systems that can reason, plan, and interact with external tools to execute multi-step...

5 months ago cs.AI cs.MA PDF

Tool HIGH

BackdoorAgent: A Unified Framework for Backdoor Attacks on LLM-based Agents

Yunhao Feng, Yige Li, Yutao Wu +6 more

Large language model (LLM) agents execute tasks through multi-step workflows that combine planning, memory, and tool use. While this design enables...

5 months ago cs.AI cs.CL PDF

Attack MEDIUM

Deep Dive into the Abuse of DL APIs To Create Malicious AI Models and How to Detect Them

Mohamed Nabeel, Oleksii Starov

According to Gartner, more than 70% of organizations will have integrated AI models into their workflows by the end of 2025. In order to reduce cost...

5 months ago cs.CR PDF

Survey MEDIUM

A Survey of Agentic AI and Cybersecurity: Challenges, Opportunities and Use-case Prototypes

Sahaya Jestus Lazer, Kshitiz Aryal, Maanak Gupta +1 more

Agentic AI marks an important transition from single-step generative models to systems capable of reasoning, planning, acting, and adapting over...

5 months ago cs.CR cs.AI PDF

Benchmark LOW

TSSR: Two-Stage Swap-Reward-Driven Reinforcement Learning for Character-Level SMILES Generation

Jacob Ede Levine, Yun Lyan Luo, Sai Chandra Kosaraju

The design of reliable, valid, and diverse molecules is fundamental to modern drug discovery, as improved molecular generation supports efficient...

5 months ago cs.LG cs.AI PDF

Frequently asked questions

What is AI security research?

AI security research studies how AI and machine-learning systems can be attacked and defended — covering adversarial examples, prompt injection, model poisoning, training-data extraction, and the mitigations against them. AI Threat Alert curates this research from academic sources so security teams can track the threats behind emerging AI risks.

How many AI security papers does AI Threat Alert track?

AI Threat Alert indexes 3,082+ papers on AI/ML security, classified across attack, defense, benchmark, survey, and tool categories and updated continuously.

Where do the research papers come from?

Papers are sourced from arXiv, then classified by type and by relevance to real-world AI/ML threats, and cross-referenced with the CVEs and incidents they relate to.

What topics does the AI security research cover?

Coverage spans adversarial attacks, model and system defenses, red-teaming benchmarks, literature surveys, and security tooling for LLMs, ML libraries, AI agents, and inference pipelines.

How is this different from a generic paper search?

Every paper is filtered for AI security relevance and linked to the vulnerabilities, vendors, and incidents it relates to, so the research connects directly to operational threat intelligence.

Track AI security vulnerabilities in real time

Get breaking CVE alerts, compliance reports (ISO 42001, EU AI Act), and CISO risk assessments for your AI/ML stack.

Start 14-Day Free Trial