AI Security Research

AI Threat Alert indexes 3,023+ peer-reviewed and preprint papers on AI/ML security — covering adversarial attacks, model defenses, red-teaming benchmarks, surveys, and security tooling. Papers are sourced from arXiv, classified by type and by relevance to real-world threats, and cross-referenced with the CVEs and incidents they relate to.

Adversarial attacks
Model defenses
Red-teaming benchmarks
Surveys
Security tooling

Total

3,023
Attack

1,175
Benchmark

866
Defense

407
Tool

319
Survey

176

Type

All Attack Defense Survey Benchmark Tool

Relevance

All High Medium

Date

All time 7 days 30 days 6 months

Showing 281–300 of 407 papers

Clear filters

Defense LOW

Emergent Learner Agency in Implicit Human-AI Collaboration: How AI Personas Reshape Creative-Regulatory Interaction

Yueqiao Jin, Roberto Martinez-Maldonado, Dragan Gašević +1 more

Generative AI is increasingly embedded in collaborative learning, yet little is known about how AI personas shape learner agency when AI teammates...

6 months ago cs.HC PDF

Defense LOW

Distributional AGI Safety

Nenad Tomašev, Matija Franklin, Julian Jacobs +2 more

AI safety and alignment research has predominantly been focused on methods for safeguarding individual AI systems, resting on the assumption of an...

6 months ago cs.AI PDF

Defense LOW

From Personalization to Prejudice: Bias and Discrimination in Memory-Enhanced AI Agents for Recruitment

Himanshu Gharat, Himanshi Agrawal, Gourab K. Patro

Large Language Models (LLMs) have empowered AI agents with advanced capabilities for understanding, reasoning, and interacting across diverse tasks....

6 months ago cs.AI cs.IR PDF

Defense MEDIUM

From Essence to Defense: Adaptive Semantic-aware Watermarking for Embedding-as-a-Service Copyright Protection

Hao Li, Yubing Ren, Yanan Cao +3 more

Benefiting from the superior capabilities of large language models in natural language understanding and generation, Embeddings-as-a-Service (EaaS)...

6 months ago cs.CR cs.CL PDF

Defense LOW

PediatricAnxietyBench: Evaluating Large Language Model Safety Under Parental Anxiety and Pressure in Pediatric Consultations

Vahideh Zolfaghari

Large language models (LLMs) are increasingly consulted by parents for pediatric guidance, yet their safety under real-world adversarial pressures is...

6 months ago cs.AI PDF

Defense MEDIUM

Cloud Security Leveraging AI: A Fusion-Based AISOC for Malware and Log Behaviour Detection

Nnamdi Philip Okonkwo, Lubna Luxmi Dhirani

Cloud Security Operations Center (SOC) enable cloud governance, risk and compliance by providing insights visibility and control. Cloud SOC triages...

6 months ago cs.CR cs.LG PDF

Defense MEDIUM

C-ing Clearly: Enhanced Binary Code Explanations using C code

Teodor Poncu, Ioana Pintilie, Marius Dragoi +2 more

Large Language Models (LLMs) typically excel at coding tasks involving high-level programming languages, as opposed to lower-level programming...

6 months ago cs.CL cs.LG PDF

Defense MEDIUM

Auto-Tuning Safety Guardrails for Black-Box Large Language Models

Perry Abdulkadir

Large language models (LLMs) are increasingly deployed behind safety guardrails such as system prompts and content filters, especially in settings...

6 months ago cs.CR cs.CL cs.LG PDF

Defense MEDIUM

Taint-Based Code Slicing for LLMs-based Malicious NPM Package Detection

Dang-Khoa Nguyen, Gia-Thang Ho, Quang-Minh Pham +5 more

Software supply chain attacks targeting the npm ecosystem have become increasingly sophisticated, leveraging obfuscation and complex logic to evade...

6 months ago cs.CR PDF

Defense MEDIUM

Super Suffixes: Bypassing Text Generation Alignment and Guard Models Simultaneously

Andrew Adiletta, Kathryn Adiletta, Kemal Derya +1 more

The rapid deployment of Large Language Models (LLMs) has created an urgent need for enhanced security and privacy measures in Machine Learning (ML)....

6 months ago cs.CR cs.AI PDF

Defense MEDIUM

Challenges of Evaluating LLM Safety for User Welfare

Manon Kempermann, Sai Suresh Macharla Vasu, Mahalakshmi Raveenthiran +2 more

Safety evaluations of large language models (LLMs) typically focus on universal risks like dangerous capabilities or undesirable propensities....

6 months ago cs.AI cs.CY PDF

Defense MEDIUM

Phishing Email Detection Using Large Language Models

Najmul Hasan, Prashanth BusiReddyGari, Haitao Zhao +3 more

Email phishing is one of the most prevalent and globally consequential vectors of cyber intrusion. As systems increasingly deploy Large Language...

6 months ago cs.CR cs.IR PDF

Defense MEDIUM

Black-Box Behavioral Distillation Breaks Safety Alignment in Medical LLMs

Sohely Jahan, Ruimin Sun

As medical large language models (LLMs) become increasingly integrated into clinical workflows, concerns around alignment robustness, and safety are...

6 months ago cs.LG PDF

Defense MEDIUM

Secure and Privacy-Preserving Federated Learning for Next-Generation Underground Mine Safety

Mohamed Elmahallawy, Sanjay Madria, Samuel Frimpong

Underground mining operations depend on sensor networks to monitor critical parameters such as temperature, gas concentration, and miner movement,...

6 months ago cs.CR cs.LG PDF

Defense HIGH

Llama-based source code vulnerability detection: Prompt engineering vs Fine tuning

Dyna Soumhane Ouchebara, Stéphane Dupont

The significant increase in software production, driven by the acceleration of development cycles over the past two decades, has led to a steady rise...

6 months ago cs.SE cs.AI cs.CR PDF

Defense LOW

Amulet: Fast TEE-Shielded Inference for On-Device Model Protection

Zikai Mao, Lingchen Zhao, Lei Xu +4 more

On-device machine learning (ML) introduces new security concerns about model privacy. Storing valuable trained ML models on user devices exposes them...

6 months ago cs.CR PDF

Defense MEDIUM

MINES: Explainable Anomaly Detection through Web API Invariant Inference

Wenjie Zhang, Yun Lin, Chun Fung Amos Kwok +5 more

Detecting the anomalies of web applications, important infrastructures for running modern companies and governments, is crucial for providing...

6 months ago cs.SE cs.CR cs.DB PDF

Defense MEDIUM

CKG-LLM: LLM-Assisted Detection of Smart Contract Access Control Vulnerabilities Based on Knowledge Graphs

Xiaoqi Li, Hailu Kuang, Wenkai Li +2 more

Traditional approaches for smart contract analysis often rely on intermediate representations such as abstract syntax trees, control-flow graphs, or...

6 months ago cs.CR PDF

Defense MEDIUM

GSAE: Graph-Regularized Sparse Autoencoders for Robust LLM Safety Steering

Jehyeok Yeon, Federico Cinus, Yifan Wu +1 more

Large language models (LLMs) face critical safety challenges, as they can be manipulated to generate harmful content through adversarial prompts and...

6 months ago cs.LG cs.AI PDF

Defense MEDIUM

DEFEND: Poisoned Model Detection and Malicious Client Exclusion Mechanism for Secure Federated Learning-based Road Condition Classification

Sheng Liu, Panos Papadimitratos

Federated Learning (FL) has drawn the attention of the Intelligent Transportation Systems (ITS) community. FL can train various models for ITS tasks,...

6 months ago cs.CR cs.AI PDF

Frequently asked questions

What is AI security research?

AI security research studies how AI and machine-learning systems can be attacked and defended — covering adversarial examples, prompt injection, model poisoning, training-data extraction, and the mitigations against them. AI Threat Alert curates this research from academic sources so security teams can track the threats behind emerging AI risks.

How many AI security papers does AI Threat Alert track?

AI Threat Alert indexes 3,023+ papers on AI/ML security, classified across attack, defense, benchmark, survey, and tool categories and updated continuously.

Where do the research papers come from?

Papers are sourced from arXiv, then classified by type and by relevance to real-world AI/ML threats, and cross-referenced with the CVEs and incidents they relate to.

What topics does the AI security research cover?

Coverage spans adversarial attacks, model and system defenses, red-teaming benchmarks, literature surveys, and security tooling for LLMs, ML libraries, AI agents, and inference pipelines.

How is this different from a generic paper search?

Every paper is filtered for AI security relevance and linked to the vulnerabilities, vendors, and incidents it relates to, so the research connects directly to operational threat intelligence.

Track AI security vulnerabilities in real time

Get breaking CVE alerts, compliance reports (ISO 42001, EU AI Act), and CISO risk assessments for your AI/ML stack.

Start 14-Day Free Trial