AI Security Research

2,529+ academic papers on AI security, attacks, and defenses

Total

2,529

Attack

969

Benchmark

729

Defense

345

Tool

272

Survey

142

Type

All Attack Defense Survey Benchmark Tool

Relevance

All High Medium

Date

All time 7 days 30 days 6 months

Showing 181–200 of 265 papers

Clear filters

Defense MEDIUM

Rethinking On-Device LLM Reasoning: Why Analogical Mapping Outperforms Abstract Thinking for IoT DDoS Detection

William Pan, Guiran Liu, Binrong Zhu +4 more

The rapid expansion of IoT deployments has intensified cybersecurity threats, notably Distributed Denial of Service (DDoS) attacks, characterized by...

3 months ago cs.CR eess.SY PDF

Defense MEDIUM

In Vino Veritas and Vulnerabilities: Examining LLM Safety via Drunk Language Inducement

Anudeex Shetty, Aditya Joshi, Salil S. Kanhere

Humans are susceptible to undesirable behaviours and privacy leaks under the influence of alcohol. This paper investigates drunk language, i.e., text...

3 months ago cs.CL cs.AI cs.CR PDF

Defense LOW

Cross-reality Location Privacy Protection in 6G-enabled Vehicular Metaverses: An LLM-enhanced Hybrid Generative Diffusion Model-based Approach

Xiaofeng Luo, Jiayi He, Jiawen Kang +4 more

The emergence of 6G-enabled vehicular metaverses enables Autonomous Vehicles (AVs) to operate across physical and virtual spaces through...

3 months ago cs.NI cs.CR cs.HC PDF

Defense HIGH

Multi-Agent Taint Specification Extraction for Vulnerability Detection

Jonah Ghebremichael, Saastha Vasan, Saad Ullah +6 more

Static Application Security Testing (SAST) tools using taint analysis are widely viewed as providing higher-quality vulnerability detection results...

3 months ago cs.CR cs.SE PDF

Defense HIGH

Be Your Own Red Teamer: Safety Alignment via Self-Play and Reflective Experience Replay

Hao Wang, Yanting Wang, Hao Li +2 more

Large Language Models (LLMs) have achieved remarkable capabilities but remain vulnerable to adversarial ``jailbreak'' attacks designed to bypass...

3 months ago cs.CR cs.CL PDF

Defense LOW

A Safety Report on GPT-5.2, Gemini 3 Pro, Qwen3-VL, Grok 4.1 Fast, Nano Banana Pro, and Seedream 4.5

Xingjun Ma, Yixu Wang, Hengyuan Xu +18 more

The rapid evolution of Large Language Models (LLMs) and Multimodal Large Language Models (MLLMs) has driven major gains in reasoning, perception, and...

3 months ago cs.AI cs.CL cs.CV PDF

Defense MEDIUM

Understanding and Preserving Safety in Fine-Tuned LLMs

Jiawen Zhang, Yangfan Hu, Kejia Chen +7 more

Fine-tuning is an essential and pervasive functionality for applying large language models (LLMs) to downstream tasks. However, it has the potential...

3 months ago cs.LG cs.AI PDF

Defense MEDIUM

Beyond Simulations: What 20,000 Real Conversations Reveal About Mental Health AI Safety

Caitlin A. Stamatis, Jonah Meyerhoff, Richard Zhang +3 more

Large language models (LLMs) are increasingly used for mental health support, yet existing safety evaluations rely primarily on small,...

3 months ago cs.CY cs.CL PDF

Defense MEDIUM

DNF: Dual-Layer Nested Fingerprinting for Large Language Model Intellectual Property Protection

Zhenhua Xu, Yiran Zhao, Mengting Zhong +4 more

The rapid growth of large language models raises pressing concerns about intellectual property protection under black-box deployment. Existing...

4 months ago cs.CR cs.AI PDF

Defense LOW

Subspace Alignment for Vision-Language Model Test-time Adaptation

Zhichen Zeng, Wenxuan Bao, Xiao Lin +8 more

Vision-language models (VLMs), despite their extraordinary zero-shot capabilities, are vulnerable to distribution shifts. Test-time adaptation (TTA)...

4 months ago cs.CV cs.AI PDF

Defense MEDIUM

Safe-FedLLM: Delving into the Safety of Federated Large Language Models

Mingxiang Tao, Yu Tian, Wenxuan Tu +3 more

Federated learning (FL) addresses data privacy and silo issues in large language models (LLMs). Most prior work focuses on improving the training...

4 months ago cs.CR cs.AI PDF

Defense LOW

SafePro: Evaluating the Safety of Professional-Level AI Agents

Kaiwen Zhou, Shreedhar Jangam, Ashwin Nagarajan +7 more

Large language model-based agents are rapidly evolving from simple conversational assistants into autonomous systems capable of performing complex,...

4 months ago cs.AI PDF

Defense MEDIUM

SecureDyn-FL: A Robust Privacy-Preserving Federated Learning Framework for Intrusion Detection in IoT Networks

Imtiaz Ali Soomro, Hamood Ur Rehman, S. Jawad Hussain ID +3 more

The rapid proliferation of Internet of Things (IoT) devices across domains such as smart homes, industrial control systems, and healthcare networks...

4 months ago cs.CR cs.NI PDF

Defense MEDIUM

StriderSPD: Structure-Guided Joint Representation Learning for Binary Security Patch Detection

Qingyuan Li, Chenchen Yu, Chuanyi Li +4 more

Vulnerabilities severely threaten software systems, making the timely application of security patches crucial for mitigating attacks. However,...

4 months ago cs.SE cs.CR PDF

Defense MEDIUM

PII-VisBench: Evaluating Personally Identifiable Information Safety in Vision Language Models Along a Continuum of Visibility

G M Shahariar, Zabir Al Nazi, Md Olid Hasan Bhuiyan +1 more

Vision Language Models (VLMs) are increasingly integrated into privacy-critical domains, yet existing evaluations of personally identifiable...

4 months ago cs.AI cs.CL cs.CR PDF

Defense LOW

Safety Not Found (404): Hidden Risks of LLM-Based Robotics Decision Making

Jua Han, Jaeyoon Seo, Jungbin Min +2 more

One mistake by an AI system in a safety-critical setting can cost lives. As Large Language Models (LLMs) become integral to robotics decision-making,...

4 months ago cs.AI cs.RO PDF

Defense LOW

Robust Reasoning as a Symmetry-Protected Topological Phase

Ilmo Sung

Large language models suffer from "hallucinations"-logical inconsistencies induced by semantic noise. We propose that current architectures operate...

4 months ago cs.LG cond-mat.dis-nn cs.AI PDF

Defense MEDIUM

AM$^3$Safety: Towards Data Efficient Alignment of Multi-modal Multi-turn Safety for MLLMs

Han Zhu, Jiale Chen, Chengkun Cai +8 more

Multi-modal Large Language Models (MLLMs) are increasingly deployed in interactive applications. However, their safety vulnerabilities become...

4 months ago cs.CL PDF

Defense MEDIUM

What Matters For Safety Alignment?

Xing Li, Hui-Ling Zhen, Lihao Yin +3 more

This paper presents a comprehensive empirical study on the safety alignment capabilities. We evaluate what matters for safety alignment in LLMs and...

4 months ago cs.CL cs.AI cs.CR PDF

Defense MEDIUM

STAR-S: Improving Safety Alignment through Self-Taught Reasoning on Safety Rules

Di Wu, Yanyan Zhao, Xin Lu +2 more

Defending against jailbreak attacks is crucial for the safe deployment of Large Language Models (LLMs). Recent research has attempted to improve...

4 months ago cs.AI cs.CL PDF

Track AI security vulnerabilities in real time

Get breaking CVE alerts, compliance reports (ISO 42001, EU AI Act), and CISO risk assessments for your AI/ML stack.

Start 14-Day Free Trial