AI Security Research

2,589+ academic papers on AI security, attacks, and defenses

Total

2,589

Attack

998

Benchmark

740

Defense

355

Tool

276

Survey

147

Type

All Attack Defense Survey Benchmark Tool

Relevance

All High Medium

Date

All time 7 days 30 days 6 months

Showing 1581–1600 of 2,589 papers

Benchmark MEDIUM

Optimistic TEE-Rollups: A Hybrid Architecture for Scalable and Verifiable Generative AI Inference on Blockchain

Aaron Chan, Alex Ding, Frank Chen +3 more

The rapid integration of Large Language Models (LLMs) into decentralized physical infrastructure networks (DePIN) is currently bottlenecked by the...

4 months ago cs.CR PDF

Tool HIGH

Odysseus: Jailbreaking Commercial Multimodal LLM-integrated Systems via Dual Steganography

Songze Li, Jiameng Cheng, Yiming Li +2 more

By integrating language understanding with perceptual modalities such as images, multimodal large language models (MLLMs) constitute a critical...

4 months ago cs.CR cs.AI cs.LG PDF

Attack MEDIUM

AI Security Beyond Core Domains: Resume Screening as a Case Study of Adversarial Vulnerabilities in Specialized LLM Applications

Honglin Mu, Jinghao Liu, Kaiyang Wan +4 more

Large Language Models (LLMs) excel at text comprehension and generation, making them ideal for automated tasks like code review and content...

4 months ago cs.CL cs.AI PDF

Other MEDIUM

On the Effectiveness of Instruction-Tuning Local LLMs for Identifying Software Vulnerabilities

Sangryu Park, Gihyuk Ko, Homook Cho

Large Language Models (LLMs) show significant promise in automating software vulnerability analysis, a critical task given the impact of security...

4 months ago cs.CR cs.AI PDF

Attack MEDIUM

IoT-based Android Malware Detection Using Graph Neural Network With Adversarial Defense

Rahul Yumlembam, Biju Issac, Seibu Mary Jacob +1 more

Since the Internet of Things (IoT) is widely adopted using Android applications, detecting malicious Android apps is essential. In recent years,...

4 months ago cs.CR cs.AI cs.LG PDF

Tool MEDIUM

ReGAIN: Retrieval-Grounded AI Framework for Network Traffic Analysis

Shaghayegh Shajarian, Kennedy Marsh, James Benson +2 more

Modern networks generate vast, heterogeneous traffic that must be continuously analyzed for security and performance. Traditional network traffic...

4 months ago cs.LG cs.AI cs.CR PDF

Attack MEDIUM

Conditional Adversarial Fragility in Financial Machine Learning under Macroeconomic Stress

Samruddhi Baviskar

Machine learning models used in financial decision systems operate in nonstationary economic environments, yet adversarial robustness is typically...

4 months ago cs.LG cs.AI cs.CR PDF

Benchmark MEDIUM

A Multi-Perspective Benchmark and Moderation Model for Evaluating Safety and Adversarial Robustness

Naseem Machlovi, Maryam Saleki, Ruhul Amin +5 more

As large language models (LLMs) become deeply embedded in daily life, the urgent need for safer moderation systems that distinguish between naive and...

4 months ago cs.CL cs.AI cs.HC PDF

Benchmark MEDIUM

GuardEval: A Multi-Perspective Benchmark for Evaluating Safety, Fairness, and Robustness in LLM Moderators

Naseem Machlovi, Maryam Saleki, Ruhul Amin +5 more

As large language models (LLMs) become deeply embedded in daily life, the urgent need for safer moderation systems, distinguishing between naive from...

4 months ago cs.CL cs.AI cs.HC PDF

Attack MEDIUM

SafeMed-R1: Adversarial Reinforcement Learning for Generalizable and Robust Medical Reasoning in Vision-Language Models

A. A. Gde Yogi Pramana, Jason Ray, Anthony Jaya +1 more

Vision--Language Models (VLMs) show significant promise for Medical Visual Question Answering (VQA), yet their deployment in clinical settings is...

4 months ago cs.AI PDF

Attack HIGH

Causal-Guided Detoxify Backdoor Attack of Open-Weight LoRA Models

Linzhi Chen, Yang Sun, Hongru Wei +1 more

Low-Rank Adaptation (LoRA) has emerged as an efficient method for fine-tuning large language models (LLMs) and is widely adopted within the...

4 months ago cs.CR cs.AI PDF

Attack HIGH

GShield: Mitigating Poisoning Attacks in Federated Learning

Sameera K. M., Serena Nicolazzo, Antonino Nocera +2 more

Federated Learning (FL) has recently emerged as a revolutionary approach to collaborative training Machine Learning models. In particular, it enables...

4 months ago cs.CR cs.LG PDF

Other LOW

JEPA-Reasoner: Decoupling Latent Reasoning from Token Generation

Bingyang Kelvin Liu, Ziyu Patrick Chen, David P. Woodruff

Current autoregressive language models couple high-level reasoning and low-level token generation into a single sequential process, making the...

4 months ago cs.CL PDF

Defense MEDIUM

Elevating Intrusion Detection and Security Fortification in Intelligent Networks through Cutting-Edge Machine Learning Paradigms

Md Minhazul Islam Munna, Md Mahbubur Rahman, Jaroslav Frnda +2 more

The proliferation of IoT devices and their reliance on Wi-Fi networks have introduced significant security vulnerabilities, particularly the KRACK...

4 months ago cs.CR cs.LG PDF

Benchmark HIGH

DREAM: Dynamic Red-teaming across Environments for AI Models

Liming Lu, Xiang Gu, Junyu Huang +5 more

Large Language Models (LLMs) are increasingly used in agentic systems, where their interactions with diverse tools and environments create complex,...

4 months ago cs.CR PDF

Attack HIGH

PromptScreen: Efficient Jailbreak Mitigation Using Semantic Linear Classification in a Multi-Staged Pipeline

Akshaj Prashanth Rao, Advait Singh, Saumya Kumaar Saksena +1 more

Prompt injection and jailbreaking attacks pose persistent security challenges to large language model (LLM)-based systems. We present PromptScreen,...

4 months ago cs.CR cs.AI cs.CL PDF

Defense MEDIUM

R-GenIMA: Integrating Neuroimaging and Genetics with Interpretable Multimodal AI for Alzheimer's Disease Progression

Kun Zhao, Siyuan Dai, Yingying Zhang +9 more

Early detection of Alzheimer's disease (AD) requires models capable of integrating macro-scale neuroanatomical alterations with micro-scale genetic...

4 months ago cs.LG cs.AI PDF

Benchmark HIGH

Learning-Based Automated Adversarial Red-Teaming for Robustness Evaluation of Large Language Models

Zhang Wei, Peilu Hu, Zhenyuan Wei +16 more

The increasing deployment of large language models (LLMs) in safety-critical applications raises fundamental challenges in systematically evaluating...

4 months ago cs.CR cs.CL PDF

Defense LOW

"Even GPT Can Reject Me": Conceptualizing Abrupt Refusal Secondary Harm (ARSH) and Reimagining Psychological AI Safety with Compassionate Completion Standard (CCS)

Yang Ni, Tong Yang

Large Language Models (LLMs) and AI chatbots are increasingly used for emotional and mental health support due to their low cost, immediacy, and...

4 months ago cs.CY cs.HC PDF

Attack HIGH

MEEA: Mere Exposure Effect-Driven Confrontational Optimization for LLM Jailbreaking

Jianyi Zhang, Shizhao Liu, Ziyin Zhou +1 more

The rapid advancement of large language models (LLMs) has intensified concerns about the robustness of their safety alignment. While existing...

4 months ago cs.AI PDF

Track AI security vulnerabilities in real time

Get breaking CVE alerts, compliance reports (ISO 42001, EU AI Act), and CISO risk assessments for your AI/ML stack.

Start 14-Day Free Trial