AI Security Research

AI Threat Alert indexes 3,023+ peer-reviewed and preprint papers on AI/ML security — covering adversarial attacks, model defenses, red-teaming benchmarks, surveys, and security tooling. Papers are sourced from arXiv, classified by type and by relevance to real-world threats, and cross-referenced with the CVEs and incidents they relate to.

Adversarial attacks
Model defenses
Red-teaming benchmarks
Surveys
Security tooling

Total

3,023
Attack

1,175
Benchmark

866
Defense

407
Tool

319
Survey

176

Type

All Attack Defense Survey Benchmark Tool

Relevance

All High Medium

Date

All time 7 days 30 days 6 months

Showing 661–680 of 866 papers

Clear filters

Benchmark MEDIUM

Building Browser Agents: Architecture, Security, and Practical Solutions

Aram Vardanyan

Browser agents enable autonomous web interaction but face critical reliability and security challenges in production. This paper presents findings...

7 months ago cs.SE PDF

Benchmark HIGH

ReVul-CoT: Towards Effective Software Vulnerability Assessment with Retrieval-Augmented Generation and Chain-of-Thought Prompting

Zhijie Chen, Xiang Chen, Ziming Li +2 more

Context: Software Vulnerability Assessment (SVA) plays a vital role in evaluating and ranking vulnerabilities in software systems to ensure their...

7 months ago cs.SE PDF

Benchmark MEDIUM

Vision Language Models are Confused Tourists

Patrick Amadeus Irawan, Ikhlasul Akmal Hanif, Muhammad Dehan Al Kautsar +3 more

Although the cultural dimension has been one of the key aspects in evaluating Vision-Language Models (VLMs), their ability to remain stable across...

7 months ago cs.CV cs.CL PDF

Benchmark MEDIUM

Cognitive Inception: Agentic Reasoning against Visual Deceptions by Injecting Skepticism

Yinjie Zhao, Heng Zhao, Bihan Wen +1 more

As the development of AI-generated contents (AIGC), multi-modal Large Language Models (LLM) struggle to identify generated visual inputs from real...

7 months ago cs.AI PDF

Benchmark MEDIUM

AssurAI: Experience with Constructing Korean Socio-cultural Datasets to Discover Potential Risks of Generative AI

Chae-Gyun Lim, Seung-Ho Han, EunYoung Byun +51 more

The rapid evolution of generative AI necessitates robust safety evaluations. However, current safety datasets are predominantly English-centric,...

7 months ago cs.AI cs.CY cs.LG PDF

Benchmark HIGH

The Shawshank Redemption of Embodied AI: Understanding and Benchmarking Indirect Environmental Jailbreaks

Chunyang Li, Zifeng Kang, Junwei Zhang +4 more

The adoption of Vision-Language Models (VLMs) in embodied AI agents, while being effective, brings safety concerns such as jailbreaking. Prior work...

7 months ago cs.CR cs.CY cs.RO PDF

Benchmark MEDIUM

Q-MLLM: Vector Quantization for Robust Multimodal Large Language Model Security

Wei Zhao, Zhe Li, Yige Li +1 more

Multimodal Large Language Models (MLLMs) have demonstrated impressive capabilities in cross-modal understanding, but remain vulnerable to adversarial...

7 months ago cs.CR cs.AI PDF

Benchmark MEDIUM

Can MLLMs Detect Phishing? A Comprehensive Security Benchmark Suite Focusing on Dynamic Threats and Multimodal Evaluation in Academic Environments

Jingzhuo Zhou

The rapid proliferation of Multimodal Large Language Models (MLLMs) has introduced unprecedented security challenges, particularly in phishing...

7 months ago cs.CR cs.AI PDF

Benchmark MEDIUM

Critical Evaluation of Quantum Machine Learning for Adversarial Robustness

Saeefa Rubaiyet Nowmi, Jesus Lopez, Md Mahmudul Alam Imon +2 more

Quantum Machine Learning (QML) integrates quantum computational principles into learning algorithms, offering improved representational capacity and...

7 months ago cs.CR PDF

Benchmark MEDIUM

Harmful Traits of AI Companions

W. Bradley Knox, Katie Bradford, Samanta Varela Castro +6 more

Amid the growing prevalence of human-AI interaction, large language models and other AI-based entities increasingly provide forms of companionship to...

7 months ago cs.HC cs.AI PDF

Benchmark HIGH

Attacking Autonomous Driving Agents with Adversarial Machine Learning: A Holistic Evaluation with the CARLA Leaderboard

Henry Wong, Clement Fung, Weiran Lin +3 more

To autonomously control vehicles, driving agents use outputs from a combination of machine-learning (ML) models, controller logic, and custom...

7 months ago cs.CR cs.CV cs.LG PDF

Benchmark MEDIUM

FLARE: Adaptive Multi-Dimensional Reputation for Robust Client Reliability in Federated Learning

Abolfazl Younesi, Leon Kiss, Zahra Najafabadi Samani +2 more

Federated learning (FL) enables collaborative model training while preserving data privacy. However, it remains vulnerable to malicious clients who...

7 months ago cs.LG cs.AI cs.CR PDF

Benchmark MEDIUM

ATLAS: A High-Difficulty, Multidisciplinary Benchmark for Frontier Scientific Reasoning

Hongwei Liu, Junnan Liu, Shudong Liu +33 more

The rapid advancement of Large Language Models (LLMs) has led to performance saturation on many established benchmarks, questioning their ability to...

7 months ago cs.CL PDF

Benchmark LOW

MVI-Bench: A Comprehensive Benchmark for Evaluating Robustness to Misleading Visual Inputs in LVLMs

Huiyi Chen, Jiawei Peng, Dehai Min +5 more

Evaluating the robustness of Large Vision-Language Models (LVLMs) is essential for their continued development and responsible deployment in...

7 months ago cs.CV PDF

Benchmark MEDIUM

Tight and Practical Privacy Auditing for Differentially Private In-Context Learning

Yuyang Xia, Ruixuan Liu, Li Xiong

Large language models (LLMs) perform in-context learning (ICL) by adapting to tasks from prompt demonstrations, which in practice often contain...

7 months ago cs.CR PDF

Benchmark MEDIUM

SmartPoC: Generating Executable and Validated PoCs for Smart Contract Bug Reports

Longfei Chen, Ruibin Yan, Taiyu Wong +2 more

Smart contracts are prone to vulnerabilities and are analyzed by experts as well as automated systems, such as static analysis and AI-assisted...

7 months ago cs.SE cs.CR PDF

Benchmark LOW

Concept Regions Matter: Benchmarking CLIP with a New Cluster-Importance Approach

Aishwarya Agarwal, Srikrishna Karanam, Vineet Gandhi

Contrastive vision-language models (VLMs) such as CLIP achieve strong zero-shot recognition yet remain vulnerable to spurious correlations,...

7 months ago cs.CV PDF

Benchmark MEDIUM

Privacy-Preserving Federated Learning from Partial Decryption Verifiable Threshold Multi-Client Functional Encryption

Minjie Wang, Jinguang Han, Weizhi Meng

In federated learning, multiple parties can cooperate to train the model without directly exchanging their own private data, but the gradient leakage...

7 months ago cs.CR cs.AI PDF

Benchmark LOW

GenSIaC: Toward Security-Aware Infrastructure-as-Code Generation with Large Language Models

Yikun Li, Matteo Grella, Daniel Nahmias +5 more

In recent years, Infrastructure as Code (IaC) has emerged as a critical approach for managing and provisioning IT infrastructure through code and...

7 months ago cs.CR cs.SE PDF

Benchmark HIGH

AttackVLA: Benchmarking Adversarial and Backdoor Attacks on Vision-Language-Action Models

Jiayu Li, Yunhan Zhao, Xiang Zheng +4 more

Vision-Language-Action (VLA) models enable robots to interpret natural-language instructions and perform diverse tasks, yet their integration of...

7 months ago cs.CR cs.AI cs.CV PDF

Frequently asked questions

What is AI security research?

AI security research studies how AI and machine-learning systems can be attacked and defended — covering adversarial examples, prompt injection, model poisoning, training-data extraction, and the mitigations against them. AI Threat Alert curates this research from academic sources so security teams can track the threats behind emerging AI risks.

How many AI security papers does AI Threat Alert track?

AI Threat Alert indexes 3,023+ papers on AI/ML security, classified across attack, defense, benchmark, survey, and tool categories and updated continuously.

Where do the research papers come from?

Papers are sourced from arXiv, then classified by type and by relevance to real-world AI/ML threats, and cross-referenced with the CVEs and incidents they relate to.

What topics does the AI security research cover?

Coverage spans adversarial attacks, model and system defenses, red-teaming benchmarks, literature surveys, and security tooling for LLMs, ML libraries, AI agents, and inference pipelines.

How is this different from a generic paper search?

Every paper is filtered for AI security relevance and linked to the vulnerabilities, vendors, and incidents it relates to, so the research connects directly to operational threat intelligence.

Track AI security vulnerabilities in real time

Get breaking CVE alerts, compliance reports (ISO 42001, EU AI Act), and CISO risk assessments for your AI/ML stack.

Start 14-Day Free Trial