AI Security Research

2,589+ academic papers on AI security, attacks, and defenses

Total

2,589

Attack

998

Benchmark

740

Defense

355

Tool

276

Survey

147

Type

All Attack Defense Survey Benchmark Tool

Relevance

All High Medium

Date

All time 7 days 30 days 6 months

Showing 1401–1420 of 2,589 papers

Benchmark MEDIUM

CyberLLM-FINDS 2025: Instruction-Tuned Fine-tuning of Domain-Specific LLMs with Retrieval-Augmented Generation and Graph Integration for MITRE Evaluation

Vasanth Iyer, Leonardo Bobadilla, S. S. Iyengar

Large Language Models (LLMs) such as Gemma-2B have shown strong performance in various natural language processing tasks. However, general-purpose...

4 months ago cs.CR PDF

Attack HIGH

ALFA: A Safe-by-Design Approach to Mitigate Quishing Attacks Launched via Fancy QR Codes

Muhammad Wahid Akram, Keshav Sood, Muneeb Ul Hassan +1 more

Phishing with Quick Response (QR) codes is termed as Quishing. The attackers exploit this method to manipulate individuals into revealing their...

4 months ago cs.CR cs.LG PDF

Survey LOW

AI Hallucination from Students' Perspective: A Thematic Analysis

Abdulhadi Shoufan, Ahmad-Azmi-Abdelhamid Esmaeil

As students increasingly rely on large language models, hallucinations pose a growing threat to learning. To mitigate this, AI literacy must expand...

4 months ago cs.HC cs.AI cs.CL PDF

Defense LOW

SafePro: Evaluating the Safety of Professional-Level AI Agents

Kaiwen Zhou, Shreedhar Jangam, Ashwin Nagarajan +7 more

Large language model-based agents are rapidly evolving from simple conversational assistants into autonomous systems capable of performing complex,...

4 months ago cs.AI PDF

Attack HIGH

Leveraging Soft Prompts for Privacy Attacks in Federated Prompt Tuning

Quan Minh Nguyen, Min-Seon Kim, Hoang M. Ngo +3 more

Membership inference attack (MIA) poses a significant privacy threat in federated learning (FL) as it allows adversaries to determine whether a...

4 months ago cs.LG cs.CR PDF

Benchmark MEDIUM

Burn-After-Use for Preventing Data Leakage through a Secure Multi-Tenant Architecture in Enterprise LLM

Qiang Zhang, Elena Emma Wang, Jiaming Li +1 more

This study presents a Secure Multi-Tenant Architecture (SMTA) combined with a novel concept Burn-After-Use (BAU) mechanism for enterprise LLM...

4 months ago cs.CR cs.AI PDF

Tool LOW

Cross-Border Data Security and Privacy Risks in Large Language Models and IoT Systems

Chalitha Handapangoda

The reliance of Large Language Models and Internet of Things systems on massive, globally distributed data flows creates systemic security and...

4 months ago cs.CR cs.LG PDF

Attack HIGH

Are LLMs Vulnerable to Preference-Undermining Attacks (PUA)? A Factorial Analysis Methodology for Diagnosing the Trade-off between Preference Alignment and Real-World Validity

Hongjun An, Yiliang Song, Jiangan Chen +3 more

Large Language Model (LLM) training often optimizes for preference alignment, rewarding outputs that are perceived as helpful and...

4 months ago cs.CR cs.AI PDF

Defense MEDIUM

SecureDyn-FL: A Robust Privacy-Preserving Federated Learning Framework for Intrusion Detection in IoT Networks

Imtiaz Ali Soomro, Hamood Ur Rehman, S. Jawad Hussain ID +3 more

The rapid proliferation of Internet of Things (IoT) devices across domains such as smart homes, industrial control systems, and healthcare networks...

4 months ago cs.CR cs.NI PDF

Attack MEDIUM

On the Adversarial Robustness of 3D Large Vision-Language Models

Chao Liu, Ngai-Man Cheung

3D Vision-Language Models (VLMs), such as PointLLM and GPT4Point, have shown strong reasoning and generalization abilities in 3D understanding tasks....

4 months ago cs.CV PDF

Benchmark MEDIUM

VIPER Strike: Defeating Visual Reasoning CAPTCHAs via Structured Vision-Language Inference

Minfeng Qi, Dongyang He, Qin Wang +1 more

Visual Reasoning CAPTCHAs (VRCs) combine visual scenes with natural-language queries that demand compositional inference over objects, attributes,...

4 months ago cs.CR cs.CV cs.ET PDF

Benchmark MEDIUM

Lightweight Yet Secure: Secure Scripting Language Generation via Lightweight LLMs

Keyang Zhang, Zeyu Chen, Xuan Feng +4 more

The security of scripting languages such as PowerShell is critical given their powerful automation and administration capabilities, often exercised...

4 months ago cs.CR cs.PL PDF

Benchmark MEDIUM

Why LoRA Fails to Forget: Regularized Low-Rank Adaptation Against Backdoors in Language Models

Hoang-Chau Luong, Lingwei Chen

Low-Rank Adaptation (LoRA) is widely used for parameter-efficient fine-tuning of large language models, but it is notably ineffective at removing...

4 months ago cs.CL PDF

Benchmark MEDIUM

Agentic LLMs as Powerful Deanonymizers: Re-identification of Participants in the Anthropic Interviewer Dataset

Tianshi Li

On December 4, 2025, Anthropic released Anthropic Interviewer, an AI tool for running qualitative interviews at scale, along with a public dataset of...

4 months ago cs.CR cs.AI cs.CY PDF

Attack HIGH

Cybersecurity AI: A Game-Theoretic AI for Guiding Attack and Defense

Víctor Mayoral-Vilches, María Sanz-Gómez, Francesco Balassone +6 more

AI-driven penetration testing now executes thousands of actions per hour but still lacks the strategic intuition humans apply in competitive...

4 months ago cs.CR PDF

Defense MEDIUM

StriderSPD: Structure-Guided Joint Representation Learning for Binary Security Patch Detection

Qingyuan Li, Chenchen Yu, Chuanyi Li +4 more

Vulnerabilities severely threaten software systems, making the timely application of security patches crucial for mitigating attacks. However,...

4 months ago cs.SE cs.CR PDF

Tool HIGH

VIGIL: Defending LLM Agents Against Tool Stream Injection via Verify-Before-Commit

Junda Lin, Zhaomeng Zhou, Zhi Zheng +4 more

LLM agents operating in open environments face escalating risks from indirect prompt injection, particularly within the tool stream where manipulated...

4 months ago cs.CR cs.AI PDF

Attack HIGH

The Echo Chamber Multi-Turn LLM Jailbreak

Ahmad Alobaid, Martí Jordà Roca, Carlos Castillo +1 more

The availability of Large Language Models (LLMs) has led to a new generation of powerful chatbots that can be developed at relatively low cost. As...

4 months ago cs.CR cs.AI PDF

Defense MEDIUM

PII-VisBench: Evaluating Personally Identifiable Information Safety in Vision Language Models Along a Continuum of Visibility

G M Shahariar, Zabir Al Nazi, Md Olid Hasan Bhuiyan +1 more

Vision Language Models (VLMs) are increasingly integrated into privacy-critical domains, yet existing evaluations of personally identifiable...

4 months ago cs.AI cs.CL cs.CR PDF

Attack MEDIUM

Projecting Out the Malice: A Global Subspace Approach to LLM Detoxification

Zenghao Duan, Zhiyi Yin, Zhichao Shi +8 more

Large language models (LLMs) exhibit exceptional performance but pose inherent risks of generating toxic content, restricting their safe deployment....

4 months ago cs.LG cs.AI PDF

Track AI security vulnerabilities in real time

Get breaking CVE alerts, compliance reports (ISO 42001, EU AI Act), and CISO risk assessments for your AI/ML stack.

Start 14-Day Free Trial