AI Security Research

AI Threat Alert indexes 3,023+ peer-reviewed and preprint papers on AI/ML security — covering adversarial attacks, model defenses, red-teaming benchmarks, surveys, and security tooling. Papers are sourced from arXiv, classified by type and by relevance to real-world threats, and cross-referenced with the CVEs and incidents they relate to.

Adversarial attacks
Model defenses
Red-teaming benchmarks
Surveys
Security tooling

Total

3,023
Attack

1,175
Benchmark

866
Defense

407
Tool

319
Survey

176

Type

All Attack Defense Survey Benchmark Tool

Relevance

All High Medium

Date

All time 7 days 30 days 6 months

Showing 541–560 of 866 papers

Clear filters

Benchmark LOW

Towards Faithful Reasoning in Comics for Small MLLMs

Chengcheng Feng, Haojie Yin, Yucheng Jin +1 more

Comic-based visual question answering (CVQA) poses distinct challenges to multimodal large language models (MLLMs) due to its reliance on symbolic...

5 months ago cs.CV cs.AI PDF

Benchmark LOW

BiPrompt: Bilateral Prompt Optimization for Visual and Textual Debiasing in Vision-Language Models

Sunny Gupta, Shounak Das, Amit Sethi

Vision language foundation models such as CLIP exhibit impressive zero-shot generalization yet remain vulnerable to spurious correlations across...

5 months ago cs.CV cs.AI cs.LG PDF

Benchmark MEDIUM

Exploring Approaches for Detecting Memorization of Recommender System Data in Large Language Models

Antonio Colacicco, Vito Guida, Dario Di Palma +2 more

Large Language Models (LLMs) are increasingly applied in recommendation scenarios due to their strong natural language understanding and generation...

5 months ago cs.IR cs.AI cs.CL PDF

Benchmark LOW

AI Agent Systems: Architectures, Applications, and Evaluation

Bin Xu

AI agents -- systems that combine foundation models with reasoning, planning, memory, and tool use -- are rapidly becoming a practical interface...

5 months ago cs.AI PDF

Benchmark MEDIUM

Lying with Truths: Open-Channel Multi-Agent Collusion for Belief Manipulation via Generative Montage

Jinwei Hu, Xinmiao Huang, Youcheng Sun +2 more

As large language models (LLMs) transition to autonomous agents synthesizing real-time information, their reasoning capabilities introduce an...

5 months ago cs.CL cs.AI cs.MA PDF

Benchmark MEDIUM

JMedEthicBench: A Multi-Turn Conversational Benchmark for Evaluating Medical Safety in Japanese Large Language Models

Junyu Liu, Zirui Li, Qian Niu +7 more

As Large Language Models (LLMs) are increasingly deployed in healthcare field, it becomes essential to carefully evaluate their medical safety before...

5 months ago cs.CL cs.AI PDF

Benchmark HIGH

How Real is Your Jailbreak? Fine-grained Jailbreak Evaluation with Anchored Reference

Songyang Liu, Chaozhuo Li, Rui Pu +5 more

Jailbreak attacks present a significant challenge to the safety of Large Language Models (LLMs), yet current automated evaluation methods largely...

5 months ago cs.CR cs.CL PDF

Benchmark MEDIUM

Adaptive Hierarchical Evaluation of LLMs and SAST tools for CWE Prediction in Python

Muntasir Adnan, Carlos C. N. Kuhn

Large Language Models have become integral to software development, yet they frequently generate vulnerable code. Existing code vulnerability...

5 months ago cs.SE cs.AI PDF

Benchmark MEDIUM

MCP-SandboxScan: WASM-based Secure Execution and Runtime Analysis for MCP Tools

Zhuoran Tan, Run Hao, Jeremy Singer +2 more

Tool-augmented LLM agents raise new security risks: tool executions can introduce runtime-only behaviors, including prompt injection and unintended...

5 months ago cs.CR cs.SE PDF

Benchmark MEDIUM

Byzantine-Robust Federated Learning Framework with Post-Quantum Secure Aggregation for Real-Time Threat Intelligence Sharing in Critical IoT Infrastructure

Milad Rahmati, Nima Rahmati

The proliferation of Internet of Things devices in critical infrastructure has created unprecedented cybersecurity challenges, necessitating...

5 months ago cs.CR cs.LG PDF

Benchmark MEDIUM

NOS-Gate: Queue-Aware Streaming IDS for Consumer Gateways under Timing-Controlled Evasion

Muhammad Bilal, Omer Tariq, Hasan Ahmed

Timing and burst patterns can leak through encryption, and an adaptive adversary can exploit them. This undermines metadata-only detection in a...

5 months ago cs.CR cs.LG cs.NI PDF

Benchmark LOW

ClinicalReTrial: A Self-Evolving AI Agent for Clinical Trial Protocol Optimization

Sixue Xing, Xuanye Xia, Kerui Wu +3 more

Clinical trial failure remains a central bottleneck in drug development, where minor protocol design flaws can irreversibly compromise outcomes...

5 months ago cs.AI cs.MA PDF

Benchmark HIGH

An Empirical Evaluation of LLM-Based Approaches for Code Vulnerability Detection: RAG, SFT, and Dual-Agent Systems

Md Hasan Saju, Maher Muhtadi, Akramul Azim

The rapid advancement of Large Language Models (LLMs) presents new opportunities for automated software vulnerability detection, a crucial task in...

5 months ago cs.SE cs.AI PDF

Benchmark MEDIUM

Encyclo-K: Evaluating LLMs with Dynamically Composed Knowledge Statements

Yiming Liang, Yizhi Li, Yantao Du +14 more

Benchmarks play a crucial role in tracking the rapid advancement of large language models (LLMs) and identifying their capability boundaries....

6 months ago cs.CL cs.AI PDF

Benchmark MEDIUM

PriceSeer: Evaluating Large Language Models in Real-Time Stock Prediction

Bohan Liang, Zijian Chen, Qi Jia +3 more

Stock prediction, a subject closely related to people's investment activities in fully dynamic and live environments, has been widely studied....

6 months ago q-fin.ST cs.LG PDF

Benchmark MEDIUM

Safe in the Future, Dangerous in the Past: Dissecting Temporal and Linguistic Vulnerabilities in LLMs

Muhammad Abdullahi Said, Muhammad Sammani Sani

As Large Language Models (LLMs) integrate into critical global infrastructure, the assumption that safety alignment transfers zero-shot from English...

6 months ago cs.CL cs.AI cs.CY PDF

Benchmark HIGH

Language Model Agents Under Attack: A Cross Model-Benchmark of Profit-Seeking Behaviors in Customer Service

Jingyu Zhang

Customer-service LLM agents increasingly make policy-bound decisions (refunds, rebooking, billing disputes), but the same ``helpful'' interaction...

6 months ago cs.CR cs.HC PDF

Benchmark MEDIUM

Taming Hallucinations: Boosting MLLMs' Video Understanding via Counterfactual Video Generation

Zhe Huang, Hao Wen, Aiming Hao +6 more

Multimodal Large Language Models (MLLMs) have made remarkable progress in video understanding. However, they suffer from a critical vulnerability: an...

6 months ago cs.CV cs.AI PDF

Benchmark MEDIUM

Enhanced Web Payload Classification Using WAMM: An AI-Based Framework for Dataset Refinement and Model Evaluation

Heba Osama, Omar Elebiary, Youssef Qassim +4 more

Web applications increasingly face evasive and polymorphic attack payloads, yet traditional web application firewalls (WAFs) based on static rule...

6 months ago cs.CR PDF

Benchmark HIGH

Prompt-Induced Over-Generation as Denial-of-Service: A Black-Box Attack-Side Benchmark

Manu, Yi Guo, Kanchana Thilakarathna +5 more

Large Language Models (LLMs) can be driven into over-generation, emitting thousands of tokens before producing an end-of-sequence (EOS) token. This...

6 months ago cs.CR cs.AI cs.LG PDF

Frequently asked questions

What is AI security research?

AI security research studies how AI and machine-learning systems can be attacked and defended — covering adversarial examples, prompt injection, model poisoning, training-data extraction, and the mitigations against them. AI Threat Alert curates this research from academic sources so security teams can track the threats behind emerging AI risks.

How many AI security papers does AI Threat Alert track?

AI Threat Alert indexes 3,023+ papers on AI/ML security, classified across attack, defense, benchmark, survey, and tool categories and updated continuously.

Where do the research papers come from?

Papers are sourced from arXiv, then classified by type and by relevance to real-world AI/ML threats, and cross-referenced with the CVEs and incidents they relate to.

What topics does the AI security research cover?

Coverage spans adversarial attacks, model and system defenses, red-teaming benchmarks, literature surveys, and security tooling for LLMs, ML libraries, AI agents, and inference pipelines.

How is this different from a generic paper search?

Every paper is filtered for AI security relevance and linked to the vulnerabilities, vendors, and incidents it relates to, so the research connects directly to operational threat intelligence.

Track AI security vulnerabilities in real time

Get breaking CVE alerts, compliance reports (ISO 42001, EU AI Act), and CISO risk assessments for your AI/ML stack.

Start 14-Day Free Trial