AI Security Research

2,529+ academic papers on AI security, attacks, and defenses

Total

2,529

Attack

969

Benchmark

729

Defense

345

Tool

272

Survey

142

Type

All Attack Defense Survey Benchmark Tool

Relevance

All High Medium

Date

All time 7 days 30 days 6 months

Showing 41–60 of 272 papers

Clear filters

Tool HIGH

Architecting Secure AI Agents: Perspectives on System-Level Defenses Against Indirect Prompt Injection Attacks

Chong Xiang, Drew Zagieboylo, Shaona Ghosh +5 more

AI agents, predominantly powered by large language models (LLMs), are vulnerable to indirect prompt injection, in which malicious instructions...

1 months ago cs.CR cs.AI PDF

Tool HIGH

CivicShield: A Cross-Domain Defense-in-Depth Framework for Securing Government-Facing AI Chatbots Against Multi-Turn Adversarial Attacks

KrishnaSaiReddy Patil

LLM-based chatbots in government services face critical security gaps. Multi-turn adversarial attacks achieve over 90% success against current...

1 months ago cs.CR cs.AI PDF

Tool HIGH

ORACAL: A Robust and Explainable Multimodal Framework for Smart Contract Vulnerability Detection with Causal Graph Enrichment

Tran Duong Minh Dai, Triet Huynh Minh Le, M. Ali Babar +2 more

Although Graph Neural Networks (GNNs) have shown promise for smart contract vulnerability detection, they still face significant limitations....

1 months ago cs.LG cs.CR PDF

Tool LOW

Measuring What Matters -- or What's Convenient?: Robustness of LLM-Based Scoring Systems to Construct-Irrelevant Factors

Cole Walsh, Rodica Ivan

Automated systems have been widely adopted across the educational testing industry for open-response assessment and essay scoring. These systems...

1 months ago cs.CL cs.AI cs.CY PDF

Tool HIGH

The System Prompt Is the Attack Surface: How LLM Agent Configuration Shapes Security and Creates Exploitable Vulnerabilities

Ron Litvak

System prompt configuration can make the difference between near-total phishing blindness and near-perfect detection in LLM email agents. We present...

1 months ago cs.CR cs.AI PDF

Tool MEDIUM

Toward a Multi-Layer ML-Based Security Framework for Industrial IoT

Aymen Bouferroum, Valeria Loscri, Abderrahim Benslimane

The Industrial Internet of Things (IIoT) introduces significant security challenges as resource-constrained devices become increasingly integrated...

1 months ago cs.CR cs.LG PDF

Tool HIGH

Are AI-assisted Development Tools Immune to Prompt Injection?

Charoes Huang, Xin Huang, Amin Milani Fard

Prompt injection is listed as the number-one vulnerability class in the OWASP Top 10 for LLM Applications that can subvert LLM guardrails, disclose...

1 months ago cs.CR cs.SE PDF

Tool LOW

Emergent Formal Verification: How an Autonomous AI Ecosystem Independently Discovered SMT-Based Safety Across Six Domains

Octavian Untila

An autonomous AI ecosystem (SUBSTRATE S3), generating product specifications without explicit instructions about formal methods, independently...

1 months ago cs.SE cs.AI PDF

Tool MEDIUM

Before the Tool Call: Deterministic Pre-Action Authorization for Autonomous AI Agents

Uchi Uchibeke

AI agents today have passwords but no permission slips. They execute tool calls (fund transfers, database queries, shell commands, sub-agent...

1 months ago cs.CR cs.AI PDF

Tool MEDIUM

A Framework for Formalizing LLM Agent Security

Vincent Siu, Jingxuan He, Kyle Montgomery +4 more

Security in LLM agents is inherently contextual. For example, the same action taken by an agent may represent legitimate behavior or a security...

1 months ago cs.CR cs.AI PDF

Tool HIGH

Prompt Control-Flow Integrity: A Priority-Aware Runtime Defense Against Prompt Injection in LLM Systems

Md Takrim Ul Alam, Akif Islam, Mohd Ruhul Ameen +2 more

Large language models (LLMs) deployed behind APIs and retrieval-augmented generation (RAG) stacks are vulnerable to prompt injection attacks that may...

1 months ago cs.CR PDF

Tool MEDIUM

Security Assessment and Mitigation Strategies for Large Language Models: A Comprehensive Defensive Framework

Taiwo Onitiju, Iman Vakilinia

Large Language Models increasingly power critical infrastructure from healthcare to finance, yet their vulnerability to adversarial manipulation...

1 months ago cs.CR cs.AI PDF

Tool MEDIUM

SIA: A Synthesize-Inject-Align Framework for Knowledge-Grounded and Secure E-commerce Search LLMs with Industrial Deployment

Zhouwei Zhai, Mengxiang Chen, Anmeng Zhang

Large language models offer transformative potential for e-commerce search by enabling intent-aware recommendations. However, their industrial...

1 months ago cs.CL PDF

Tool LOW

From Workflow Automation to Capability Closure: A Formal Framework for Safe and Revenue-Aware Customer Service AI

Cosimo Spera

Customer service automation is undergoing a structural transformation. The dominant paradigm is shifting from scripted chatbots and single-agent...

1 months ago cs.AI PDF

Tool HIGH

ClawWorm: Self-Propagating Attacks Across LLM Agent Ecosystems

Yihao Zhang, Zeming Wei, Xiaokun Luan +7 more

Autonomous LLM-based agents increasingly operate as long-running processes forming densely interconnected multi-agent ecosystems, whose security...

1 months ago cs.CR cs.AI cs.LG PDF

Tool HIGH

ClawWorm: Self-Propagating Attacks Across LLM Agent Ecosystems

Yihao Zhang, Zeming Wei, Xiaokun Luan +7 more

Autonomous LLM-based agents increasingly operate as long-running processes forming densely interconnected multi-agent ecosystems, whose security...

1 months ago cs.CR cs.AI cs.LG PDF

Tool MEDIUM

Rethinking LLM Watermark Detection in Black-Box Settings: A Non-Intrusive Third-Party Framework

Zhuoshang Wang, Yubing Ren, Yanan Cao +3 more

While watermarking serves as a critical mechanism for LLM provenance, existing secret-key schemes tightly couple detection with injection, requiring...

1 months ago cs.CR cs.CL PDF

Tool MEDIUM

Governing Dynamic Capabilities: Cryptographic Binding and Reproducibility Verification for AI Agent Tool Use

Ziling Zhou

AI agents dynamically acquire capabilities at runtime via MCP and A2A, yet no framework detects when capabilities change post-authorization. We term...

1 months ago cs.CR PDF

Tool MEDIUM

Governing Dynamic Capabilities: Cryptographic Binding and Reproducibility Verification for AI Agent Tool Use

Ziling Zhou

AI agents dynamically acquire tools, orchestrate sub-agents, and transact across organizational boundaries, yet no existing security layer verifies...

1 months ago cs.CR PDF

Tool MEDIUM

ChainFuzzer: Greybox Fuzzing for Workflow-Level Multi-Tool Vulnerabilities in LLM Agents

Jiangrong Wu, Zitong Yao, Yuhong Nan +1 more

Tool-augmented LLM agents increasingly rely on multi-step, multi-tool workflows to complete real tasks. This design expands the attack surface,...

2 months ago cs.SE cs.CR PDF

Track AI security vulnerabilities in real time

Get breaking CVE alerts, compliance reports (ISO 42001, EU AI Act), and CISO risk assessments for your AI/ML stack.

Start 14-Day Free Trial