AI Security Research

2,529+ academic papers on AI security, attacks, and defenses

Total

2,529

Attack

969

Benchmark

729

Defense

345

Tool

272

Survey

142

Type

All Attack Defense Survey Benchmark Tool

Relevance

All High Medium

Date

All time 7 days 30 days 6 months

Showing 241–260 of 312 papers

Clear filters

Attack MEDIUM

Adversarially Robust and Interpretable Magecart Malware Detection

Pedro Pereira, José Gouveia, João Vitorino +2 more

Magecart skimming attacks have emerged as a significant threat to client-side security and user trust in online payment systems. This paper addresses...

6 months ago cs.CR PDF

Attack MEDIUM

Inter-Agent Trust Models: A Comparative Study of Brief, Claim, Proof, Stake, Reputation and Constraint in Agentic Web Protocol Design-A2A, AP2, ERC-8004, and Beyond

Botao 'Amber' Hu, Helena Rong

As the "agentic web" takes shape-billions of AI agents (often LLM-powered) autonomously transacting and collaborating-trust shifts from human...

6 months ago cs.HC cs.AI cs.MA PDF

Attack MEDIUM

Adaptive and Robust Data Poisoning Detection and Sanitization in Wearable IoT Systems using Large Language Models

W. K. M Mithsara, Ning Yang, Ahmed Imteaj +2 more

The widespread integration of wearable sensing devices in Internet of Things (IoT) ecosystems, particularly in healthcare, smart homes, and...

6 months ago cs.LG cs.CR PDF

Attack MEDIUM

Verifying LLM Inference to Detect Model Weight Exfiltration

Roy Rinberg, Adam Karvonen, Alexander Hoover +2 more

As large AI models become increasingly valuable assets, the risk of model weight exfiltration from inference servers grows accordingly. An attacker...

6 months ago cs.CR cs.LG PDF

Attack MEDIUM

ShadowLogic: Backdoors in Any Whitebox LLM

Kasimir Schulz, Amelia Kawasaki, Leo Ring

Large language models (LLMs) are widely deployed across various applications, often with safeguards to prevent the generation of harmful or...

6 months ago cs.CR cs.AI PDF

Attack MEDIUM

Diffusion LLMs are Natural Adversaries for any LLM

David Lüdke, Tom Wollschläger, Paul Ungermann +2 more

We introduce a novel framework that transforms the resource-intensive (adversarial) prompt optimization problem into an \emph{efficient, amortized...

6 months ago cs.LG stat.ML PDF

Attack MEDIUM

Measuring the Security of Mobile LLM Agents under Adversarial Prompts from Untrusted Third-Party Channels

Chenghao Du, Quanfeng Huang, Tingxuan Tang +3 more

Large Language Models (LLMs) have transformed software development, enabling AI-powered applications known as LLM-based agents that promise to...

6 months ago cs.CR PDF

Attack MEDIUM

PVMark: Enabling Public Verifiability for LLM Watermarking Schemes

Haohua Duan, Liyao Xiang, Xin Zhang

Watermarking schemes for large language models (LLMs) have been proposed to identify the source of the generated text, mitigating the potential...

6 months ago cs.CR cs.CL cs.LG PDF

Attack MEDIUM

PEEL: A Poisoning-Exposing Encoding Theoretical Framework for Local Differential Privacy

Lisha Shuai, Jiuling Dong, Nan Zhang +5 more

Local Differential Privacy (LDP) is a widely adopted privacy-protection model in the Internet of Things (IoT) due to its lightweight, decentralized,...

6 months ago cs.CR PDF

Attack MEDIUM

SmoothGuard: Defending Multimodal Large Language Models with Noise Perturbation and Clustering Aggregation

Guangzhi Su, Shuchang Huang, Yutong Ke +3 more

Multimodal large language models (MLLMs) have achieved impressive performance across diverse tasks by jointly reasoning over textual and visual...

6 months ago cs.LG cs.CR PDF

Attack MEDIUM

S3C2 Summit 2025-03: Industry Secure Supply Chain Summit

Elizabeth Lin, Jonah Ghebremichael, William Enck +5 more

Software supply chains, while providing immense economic and software development value, are only as strong as their weakest link. Over the past...

6 months ago cs.CR PDF

Attack MEDIUM

Retracing the Past: LLMs Emit Training Data When They Get Lost

Myeongseob Ko, Nikhil Reddy Billa, Adam Nguyen +3 more

The memorization of training data in large language models (LLMs) poses significant privacy and copyright concerns. Existing data extraction methods,...

6 months ago cs.CL cs.AI PDF

Attack MEDIUM

Is Your Prompt Poisoning Code? Defect Induction Rates and Security Mitigation Strategies

Bin Wang, YiLu Zhong, MiDi Wan +4 more

Large language models (LLMs) have become indispensable for automated code generation, yet the quality and security of their outputs remain a critical...

6 months ago cs.CR cs.AI PDF

Attack MEDIUM

Self-Calibrated Consistency can Fight Back for Adversarial Robustness in Vision-Language Models

Jiaxiang Liu, Jiawei Du, Xiao Liu +2 more

Pre-trained vision-language models (VLMs) such as CLIP have demonstrated strong zero-shot capabilities across diverse domains, yet remain highly...

6 months ago cs.CV PDF

Attack MEDIUM

Adapting Noise-Driven PUF and AI for Secure WBG ICS: A Proof-of-Concept Study

Devon A. Kelly, Christiana Chamon

Wide-bandgap (WBG) technologies offer unprecedented improvements in power system efficiency, size, and performance, but also introduce unique sensor...

6 months ago cs.CR cs.LG eess.SY PDF

Attack MEDIUM

Toward Understanding the Transferability of Adversarial Suffixes in Large Language Models

Sarah Ball, Niki Hasrati, Alexander Robey +4 more

Discrete optimization-based jailbreaking attacks on large language models aim to generate short, nonsensical suffixes that, when appended onto input...

6 months ago cs.CL cs.AI PDF

Attack MEDIUM

Security Logs to ATT&CK Insights: Leveraging LLMs for High-Level Threat Understanding and Cognitive Trait Inference

Soham Hans, Stacy Marsella, Sophia Hirschmann +1 more

Understanding adversarial behavior in cybersecurity has traditionally relied on high-level intelligence reports and manual interpretation of attack...

6 months ago cs.CR cs.AI PDF

Attack MEDIUM

RAGRank: Using PageRank to Counter Poisoning in CTI LLM Pipelines

Austin Jia, Avaneesh Ramesh, Zain Shamsi +2 more

Retrieval-Augmented Generation (RAG) has emerged as the dominant architectural pattern to operationalize Large Language Model (LLM) usage in Cyber...

6 months ago cs.CR cs.AI cs.IR PDF

Attack MEDIUM

NeuPerm: Disrupting Malware Hidden in Neural Network Parameters by Leveraging Permutation Symmetry

Daniel Gilkarov, Ran Dubin

Pretrained deep learning model sharing holds tremendous value for researchers and enterprises alike. It allows them to apply deep learning by...

6 months ago cs.CR PDF

Attack MEDIUM

SecureInfer: Heterogeneous TEE-GPU Architecture for Privacy-Critical Tensors for Large Language Model Deployment

Tushar Nayan, Ziqi Zhang, Ruimin Sun

With the increasing deployment of Large Language Models (LLMs) on mobile and edge platforms, securing them against model extraction attacks has...

6 months ago cs.CR cs.LG cs.SE PDF

Track AI security vulnerabilities in real time

Get breaking CVE alerts, compliance reports (ISO 42001, EU AI Act), and CISO risk assessments for your AI/ML stack.

Start 14-Day Free Trial