AI Security Research

2,589+ academic papers on AI security, attacks, and defenses

Total

2,589

Attack

998

Benchmark

740

Defense

355

Tool

276

Survey

147

Type

All Attack Defense Survey Benchmark Tool

Relevance

All High Medium

Date

All time 7 days 30 days 6 months

Showing 1301–1320 of 1,931 papers

Clear filters

Attack HIGH

Uncovering and Understanding FPR Manipulation Attack in Industrial IoT Networks

Mohammad Shamim Ahsan, Peng Liu

In the network security domain, due to practical issues -- including imbalanced data and heterogeneous legitimate network traffic -- adversarial...

3 months ago cs.CR cs.LG PDF

Defense MEDIUM

The Side Effects of Being Smart: Safety Risks in MLLMs' Multi-Image Reasoning

Renmiao Chen, Yida Lu, Shiyao Cui +6 more

As Multimodal Large Language Models (MLLMs) acquire stronger reasoning capabilities to handle complex, multi-image instructions, this advancement may...

3 months ago cs.CV cs.CL PDF

Tool LOW

Zero-shot adaptable task planning for autonomous construction robots: a comparative study of lightweight single and multi-AI agent systems

Hossein Naderi, Alireza Shojaei, Lifu Huang +3 more

Robots are expected to play a major role in the future construction industry but face challenges due to high costs and difficulty adapting to dynamic...

3 months ago cs.RO cs.AI PDF

Defense MEDIUM

Rethinking On-Device LLM Reasoning: Why Analogical Mapping Outperforms Abstract Thinking for IoT DDoS Detection

William Pan, Guiran Liu, Binrong Zhu +4 more

The rapid expansion of IoT deployments has intensified cybersecurity threats, notably Distributed Denial of Service (DDoS) attacks, characterized by...

3 months ago cs.CR eess.SY PDF

Attack HIGH

SecureSplit: Mitigating Backdoor Attacks in Split Learning

Zhihao Dou, Dongfei Cui, Weida Wang +7 more

Split Learning (SL) offers a framework for collaborative model training that respects data privacy by allowing participants to share the same dataset...

3 months ago cs.CR cs.DC cs.LG PDF

Attack MEDIUM

PAC-Private Responses with Adversarial Composition

Xiaochen Zhu, Mayuri Sridhar, Srinivas Devadas

Modern machine learning models are increasingly deployed behind APIs. This renders standard weight-privatization methods (e.g. DP-SGD) unnecessarily...

3 months ago cs.LG cs.CR PDF

Attack MEDIUM

VirtualCrime: Evaluating Criminal Potential of Large Language Models via Sandbox Simulation

Yilin Tang, Yu Wang, Lanlan Qiu +4 more

Large language models (LLMs) have shown strong capabilities in multi-step decision-making, planning and actions, and are increasingly integrated into...

3 months ago cs.CR PDF

Benchmark MEDIUM

Turn-Based Structural Triggers: Prompt-Free Backdoors in Multi-Turn LLMs

Yiyang Lu, Jinwen He, Yue Zhao +2 more

Large Language Models (LLMs) are widely integrated into interactive systems such as dialogue agents and task-oriented assistants. This growing...

3 months ago cs.CR cs.LG PDF

Attack MEDIUM

RECAP: A Resource-Efficient Method for Adversarial Prompting in Large Language Models

Rishit Chugh

The deployment of large language models (LLMs) has raised security concerns due to their susceptibility to producing harmful or policy-violating...

3 months ago cs.CL cs.AI cs.CR PDF

Attack HIGH

PINA: Prompt Injection Attack against Navigation Agents

Jiani Liu, Yixin He, Lanlan Fan +5 more

Navigation agents powered by large language models (LLMs) convert natural language instructions into executable plans and actions. Compared to...

3 months ago cs.CR PDF

Benchmark HIGH

Vulnerability of LLMs' Stated Beliefs? LLMs Belief Resistance Check Through Strategic Persuasive Conversation Interventions

Fan Huang, Haewoon Kwak, Jisun An

Large Language Models (LLMs) are increasingly employed in various question-answering tasks. However, recent studies showcase that LLMs are...

3 months ago cs.CL cs.AI PDF

Tool LOW

Motion-to-Response Content Generation via Multi-Agent AI System with Real-Time Safety Verification

HyeYoung Lee

This paper proposes a multi-agent artificial intelligence system that generates response-oriented media content in real time based on audio-derived...

3 months ago cs.AI cs.SD PDF

Benchmark LOW

DRGW: Learning Disentangled Representations for Robust Graph Watermarking

Jiasen Li, Yanwei Liu, Zhuoyi Shang +2 more

Graph-structured data is foundational to numerous web applications, and watermarking is crucial for protecting their intellectual property and...

3 months ago cs.LG cs.CR PDF

Benchmark HIGH

AgenticRed: Optimizing Agentic Systems for Automated Red-teaming

Jiayi Yuan, Jonathan Nöther, Natasha Jaques +1 more

While recent automated red-teaming methods show promise for systematically exposing model vulnerabilities, most existing approaches rely on...

3 months ago cs.AI cs.NE PDF

Attack HIGH

SilentDrift: Exploiting Action Chunking for Stealthy Backdoor Attacks on Vision-Language-Action Models

Bingxin Xu, Yuzhang Shang, Binghui Wang +1 more

Vision-Language-Action (VLA) models are increasingly deployed in safety-critical robotic applications, yet their security vulnerabilities remain...

3 months ago cs.CR cs.AI cs.RO PDF

Benchmark LOW

QERS: Quantum Encryption Resilience Score for Post-Quantum Cryptography in Computer, IoT, and IIoT Systems

Jonatan Rassekhnia

Post-quantum cryptography (PQC) is becoming essential for securing Internet of Things (IoT) and Industrial IoT (IIoT) systems against quantum-enabled...

3 months ago cs.CR cs.NI PDF

Attack HIGH

Sockpuppetting: Jailbreaking LLMs Without Optimization Through Output Prefix Injection

Asen Dotsinski, Panagiotis Eustratiadis

As open-weight large language models (LLMs) increase in capabilities, safeguarding them against malicious prompts and understanding possible attack...

3 months ago cs.CL cs.CR cs.LG PDF

Benchmark HIGH

OI-Bench: An Option Injection Benchmark for Evaluating LLM Susceptibility to Directive Interference

Yow-Fu Liou, Yu-Chien Tang, Yu-Hsiang Liu +1 more

Benchmarking large language models (LLMs) is critical for understanding their capabilities, limitations, and robustness. In addition to interface...

3 months ago cs.CL PDF

Attack HIGH

Prompt Injection Mitigation with Agentic AI, Nested Learning, and AI Sustainability via Semantic Caching

Diego Gosmar, Deborah A. Dahl

Prompt injection remains a central obstacle to the safe deployment of large language models, particularly in multi-agent settings where intermediate...

3 months ago cs.AI cs.MA PDF

Attack HIGH

CODE: A Contradiction-Based Deliberation Extension Framework for Overthinking Attacks on Retrieval-Augmented Generation

Xiaolei Zhang, Xiaojun Jia, Liquan Chen +1 more

Introducing reasoning models into Retrieval-Augmented Generation (RAG) systems enhances task performance through step-by-step reasoning, logical...

3 months ago cs.CR PDF

Track AI security vulnerabilities in real time

Get breaking CVE alerts, compliance reports (ISO 42001, EU AI Act), and CISO risk assessments for your AI/ML stack.

Start 14-Day Free Trial