AI Security Research

2,583+ academic papers on AI security, attacks, and defenses

Total

2,583

Attack

994

Benchmark

740

Defense

355

Tool

275

Survey

146

Type

All Attack Defense Survey Benchmark Tool

Relevance

All High Medium

Date

All time 7 days 30 days 6 months

Showing 2021–2040 of 2,583 papers

Attack MEDIUM

Large Language Models for Cyber Security

Raunak Somani, Aswani Kumar Cherukuri

This paper studies the integration off Large Language Models into cybersecurity tools and protocols. The main issue discussed in this paper is how...

6 months ago cs.CR PDF

Attack MEDIUM

Adversarially Robust and Interpretable Magecart Malware Detection

Pedro Pereira, José Gouveia, João Vitorino +2 more

Magecart skimming attacks have emerged as a significant threat to client-side security and user trust in online payment systems. This paper addresses...

6 months ago cs.CR PDF

Tool MEDIUM

AdversariaLLM: A Unified and Modular Toolbox for LLM Robustness Research

Tim Beyer, Jonas Dornbusch, Jakob Steimle +3 more

The rapid expansion of research on Large Language Model (LLM) safety and robustness has produced a fragmented and oftentimes buggy ecosystem of...

6 months ago cs.AI cs.SE PDF

Attack HIGH

Black-Box Guardrail Reverse-engineering Attack

Hongwei Yao, Yun Xia, Shuo Shao +3 more

Large language models (LLMs) increasingly employ guardrails to enforce ethical, legal, and application-specific constraints on their outputs. While...

6 months ago cs.CR cs.CL PDF

Defense MEDIUM

Explaining Software Vulnerabilities with Large Language Models

Oshando Johnson, Alexandra Fomina, Ranjith Krishnamurthy +3 more

The prevalence of security vulnerabilities has prompted companies to adopt static application security testing (SAST) tools for vulnerability...

6 months ago cs.SE cs.AI PDF

Other MEDIUM

Interpreting Multi-Attribute Confounding through Numerical Attributes in Large Language Models

Hirohane Takagi, Gouki Minegishi, Shota Kizawa +2 more

Although behavioral studies have documented numerical reasoning errors in large language models (LLMs), the underlying representational mechanisms...

6 months ago cs.AI PDF

Defense HIGH

Specification-Guided Vulnerability Detection with Large Language Models

Hao Zhu, Jia Li, Cuiyun Gao +7 more

Large language models (LLMs) have achieved remarkable progress in code understanding tasks. However, they demonstrate limited performance in...

6 months ago cs.SE cs.CR PDF

Benchmark MEDIUM

Hybrid Fuzzing with LLM-Guided Input Mutation and Semantic Feedback

Shiyin Lin

Software fuzzing has become a cornerstone in automated vulnerability discovery, yet existing mutation strategies often lack semantic awareness,...

6 months ago cs.CR cs.AI PDF

Defense MEDIUM

STARS: Synchronous Token Alignment for Robust Supervision in Large Language Models

Mohammad Atif Quamar, Mohammad Areeb, Mikhail Kuznetsov +2 more

Aligning large language models (LLMs) with human values is crucial for safe deployment. Inference-time techniques offer granular control over...

6 months ago cs.CL PDF

Attack HIGH

Whisper Leak: a side-channel attack on Large Language Models

Geoff McDonald, Jonathan Bar Or

Large Language Models (LLMs) are increasingly deployed in sensitive domains including healthcare, legal services, and confidential communications,...

6 months ago cs.CR cs.AI PDF

Other LOW

AnchorTP: Resilient LLM Inference with State-Preserving Elastic Tensor Parallelism

Wendong Xu, Chujie Chen, He Xiao +8 more

Large Language Model (LLM) inference services demand exceptionally high availability and low latency, yet multi-GPU Tensor Parallelism (TP) makes...

6 months ago cs.DC PDF

Attack MEDIUM

Inter-Agent Trust Models: A Comparative Study of Brief, Claim, Proof, Stake, Reputation and Constraint in Agentic Web Protocol Design-A2A, AP2, ERC-8004, and Beyond

Botao 'Amber' Hu, Helena Rong

As the "agentic web" takes shape-billions of AI agents (often LLM-powered) autonomously transacting and collaborating-trust shifts from human...

6 months ago cs.HC cs.AI cs.MA PDF

Attack HIGH

Let the Bees Find the Weak Spots: A Path Planning Perspective on Multi-Turn Jailbreak Attacks against LLMs

Yize Liu, Yunyun Hou, Aina Sui

Large Language Models (LLMs) have been widely deployed across various applications, yet their potential security and ethical risks have raised...

6 months ago cs.CR cs.CL PDF

Attack HIGH

Death by a Thousand Prompts: Open Model Vulnerability Analysis

Amy Chang, Nicholas Conley, Harish Santhanalakshmi Ganesan +1 more

Open-weight models provide researchers and developers with accessible foundations for diverse downstream applications. We tested the safety and...

6 months ago cs.CR cs.LG PDF

Defense LOW

Approximating the Mathematical Structure of Psychodynamics

Bryce-Allen Bagley, Navin Khoshnan

The complexity of human cognition has meant that psychology makes more use of theory and conceptual models than perhaps any other biomedical field....

6 months ago q-bio.NC cs.CL cs.CY PDF

Tool LOW

DeepKnown-Guard: A Proprietary Model-Based Safety Response Framework for AI Agents

Qi Li, Jianjun Xu, Pingtao Wei +8 more

With the widespread application of Large Language Models (LLMs), their associated security issues have become increasingly prominent, severely...

6 months ago cs.AI PDF

Attack HIGH

Jailbreaking in the Haystack

Rishi Rajesh Shah, Chen Henry Wu, Shashwat Saxena +3 more

Recent advances in long-context language models (LMs) have enabled million-token inputs, expanding their capabilities across complex tasks like...

6 months ago cs.CR cs.AI cs.CL PDF

Benchmark MEDIUM

Evaluating Control Protocols for Untrusted AI Agents

Jon Kutasov, Chloe Loughridge, Yuqi Sun +4 more

As AI systems become more capable and widely deployed as agents, ensuring their safe operation becomes critical. AI control offers one approach to...

6 months ago cs.AI PDF

Attack HIGH

Optimizing AI Agent Attacks With Synthetic Data

Chloe Loughridge, Paul Colognese, Avery Griffin +3 more

As AI deployments become more complex and high-stakes, it becomes increasingly important to be able to estimate their risk. AI control is one...

6 months ago cs.AI PDF

Survey LOW

A Criminology of Machines

Gian Maria Campedelli

While the possibility of reaching human-like Artificial Intelligence (AI) remains controversial, the likelihood that the future will be characterized...

6 months ago cs.CY cs.AI cs.HC PDF

Track AI security vulnerabilities in real time

Get breaking CVE alerts, compliance reports (ISO 42001, EU AI Act), and CISO risk assessments for your AI/ML stack.

Start 14-Day Free Trial