AI Security Research

2,560+ academic papers on AI security, attacks, and defenses

Total

2,560

Attack

982

Benchmark

736

Defense

350

Tool

275

Survey

144

Type

All Attack Defense Survey Benchmark Tool

Relevance

All High Medium

Date

All time 7 days 30 days 6 months

Showing 661–680 of 879 papers

Clear filters

Attack HIGH

A methodological analysis of prompt perturbations and their effect on attack success rates

Tiago Machado, Maysa Malfiza Garcia de Macedo, Rogerio Abreu de Paula +5 more

This work aims to investigate how different Large Language Models (LLMs) alignment methods affect the models' responses to prompt attacks. We...

6 months ago cs.CL PDF

Attack HIGH

Why does weak-OOD help? A Further Step Towards Understanding Jailbreaking VLMs

Yuxuan Zhou, Yuzhao Peng, Yang Bai +7 more

Large Vision-Language Models (VLMs) are susceptible to jailbreak attacks: researchers have developed a variety of attack strategies that can...

6 months ago cs.CR PDF

Benchmark HIGH

MSCR: Exploring the Vulnerability of LLMs' Mathematical Reasoning Abilities Using Multi-Source Candidate Replacement

Zhishen Sun, Guang Dai, Haishan Ye

LLMs demonstrate performance comparable to human abilities in complex tasks such as mathematical reasoning, but their robustness in mathematical...

6 months ago cs.AI PDF

Attack HIGH

Class-feature Watermark: A Resilient Black-box Watermark Against Model Extraction Attacks

Yaxin Xiao, Qingqing Ye, Zi Liang +4 more

Machine learning models constitute valuable intellectual property, yet remain vulnerable to model extraction attacks (MEA), where adversaries...

6 months ago cs.CR cs.CV cs.LG PDF

Attack HIGH

LoopLLM: Transferable Energy-Latency Attacks in LLMs via Repetitive Generation

Xingyu Li, Xiaolei Liu, Cheng Liu +4 more

As large language models (LLMs) scale, their inference incurs substantial computational resources, exposing them to energy-latency attacks, where...

6 months ago cs.CR cs.AI cs.CL PDF

Attack HIGH

From Pretrain to Pain: Adversarial Vulnerability of Video Foundation Models Without Task Knowledge

Hui Lu, Yi Yu, Song Xia +5 more

Large-scale Video Foundation Models (VFMs) has significantly advanced various video-related tasks, either through task-specific models or Multi-modal...

6 months ago cs.CV cs.CR PDF

Attack HIGH

Comparing Reconstruction Attacks on Pretrained Versus Full Fine-tuned Large Language Model Embeddings on Homo Sapiens Splice Sites Genomic Data

Reem Al-Saidi, Erman Ayday, Ziad Kobti

This study investigates embedding reconstruction attacks in large language models (LLMs) applied to genomic sequences, with a specific focus on how...

6 months ago cs.LG PDF

Tool HIGH

KG-DF: A Black-box Defense Framework against Jailbreak Attacks Based on Knowledge Graphs

Shuyuan Liu, Jiawei Chen, Xiao Yang +2 more

With the widespread application of large language models (LLMs) in various fields, the security challenges they face have become increasingly...

6 months ago cs.CR cs.AI PDF

Tool HIGH

RAG-targeted Adversarial Attack on LLM-based Threat Detection and Mitigation Framework

Seif Ikbarieh, Kshitiz Aryal, Maanak Gupta

The rapid expansion of the Internet of Things (IoT) is reshaping communication and operational practices across industries, but it also broadens the...

6 months ago cs.CR cs.AI PDF

Attack HIGH

Injecting Falsehoods: Adversarial Man-in-the-Middle Attacks Undermining Factual Recall in LLMs

Alina Fastowski, Bardh Prenkaj, Yuxiao Li +1 more

LLMs are now an integral part of information retrieval. As such, their role as question answering chatbots raises significant concerns due to their...

6 months ago cs.CR cs.AI cs.CL PDF

Attack HIGH

When AI Meets the Web: Prompt Injection Risks in Third-Party AI Chatbot Plugins

Yigitcan Kaya, Anton Landerer, Stijn Pletinckx +3 more

Prompt injection attacks pose a critical threat to large language models (LLMs), with prior work focusing on cutting-edge LLM applications like...

6 months ago cs.CR cs.AI PDF

Attack HIGH

Turning Adversaries into Allies: Reversing Typographic Attacks for Multimodal E-Commerce Product Retrieval

Janet Jenq, Hongda Shen

Multimodal product retrieval systems in e-commerce platforms rely on effectively combining visual and textual signals to improve search relevance and...

6 months ago cs.LG PDF

Attack HIGH

MedFedPure: A Medical Federated Framework with MAE-based Detection and Diffusion Purification for Inference-Time Attacks

Mohammad Karami, Mohammad Reza Nemati, Aidin Kazemi +3 more

Artificial intelligence (AI) has shown great potential in medical imaging, particularly for brain tumor detection using Magnetic Resonance Imaging...

6 months ago cs.LG cs.AI cs.CR PDF

Attack HIGH

Black-Box Guardrail Reverse-engineering Attack

Hongwei Yao, Yun Xia, Shuo Shao +3 more

Large language models (LLMs) increasingly employ guardrails to enforce ethical, legal, and application-specific constraints on their outputs. While...

6 months ago cs.CR cs.CL PDF

Defense HIGH

Specification-Guided Vulnerability Detection with Large Language Models

Hao Zhu, Jia Li, Cuiyun Gao +7 more

Large language models (LLMs) have achieved remarkable progress in code understanding tasks. However, they demonstrate limited performance in...

6 months ago cs.SE cs.CR PDF

Attack HIGH

Whisper Leak: a side-channel attack on Large Language Models

Geoff McDonald, Jonathan Bar Or

Large Language Models (LLMs) are increasingly deployed in sensitive domains including healthcare, legal services, and confidential communications,...

6 months ago cs.CR cs.AI PDF

Attack HIGH

Let the Bees Find the Weak Spots: A Path Planning Perspective on Multi-Turn Jailbreak Attacks against LLMs

Yize Liu, Yunyun Hou, Aina Sui

Large Language Models (LLMs) have been widely deployed across various applications, yet their potential security and ethical risks have raised...

6 months ago cs.CR cs.CL PDF

Attack HIGH

Death by a Thousand Prompts: Open Model Vulnerability Analysis

Amy Chang, Nicholas Conley, Harish Santhanalakshmi Ganesan +1 more

Open-weight models provide researchers and developers with accessible foundations for diverse downstream applications. We tested the safety and...

6 months ago cs.CR cs.LG PDF

Attack HIGH

Jailbreaking in the Haystack

Rishi Rajesh Shah, Chen Henry Wu, Shashwat Saxena +3 more

Recent advances in long-context language models (LMs) have enabled million-token inputs, expanding their capabilities across complex tasks like...

6 months ago cs.CR cs.AI cs.CL PDF

Attack HIGH

Optimizing AI Agent Attacks With Synthetic Data

Chloe Loughridge, Paul Colognese, Avery Griffin +3 more

As AI deployments become more complex and high-stakes, it becomes increasingly important to be able to estimate their risk. AI control is one...

6 months ago cs.AI PDF

Track AI security vulnerabilities in real time

Get breaking CVE alerts, compliance reports (ISO 42001, EU AI Act), and CISO risk assessments for your AI/ML stack.

Start 14-Day Free Trial