AI Security Research

2,589+ academic papers on AI security, attacks, and defenses

Total

2,589

Attack

998

Benchmark

740

Defense

355

Tool

276

Survey

147

Type

All Attack Defense Survey Benchmark Tool

Relevance

All High Medium

Date

All time 7 days 30 days 6 months

Showing 1281–1300 of 1,931 papers

Clear filters

Benchmark MEDIUM

TempoNet: Learning Realistic Communication and Timing Patterns for Network Traffic Simulation

Kristen Moore, Diksha Goel, Cody James Christopher +5 more

Realistic network traffic simulation is critical for evaluating intrusion detection systems, stress-testing network protocols, and constructing...

3 months ago cs.CR cs.AI cs.LG PDF

Tool MEDIUM

Securing LLM-as-a-Service for Small Businesses: An Industry Case Study of a Distributed Chatbot Deployment Platform

Jiazhu Xie, Bowen Li, Heyu Fu +3 more

Large Language Model (LLM)-based question-answering systems offer significant potential for automating customer support and internal knowledge access...

3 months ago cs.DC cs.CR PDF

Attack HIGH

Multi-Targeted Graph Backdoor Attack

Md Nabi Newaz Khan, Abdullah Arafat Miah, Yu Bi

Graph neural network (GNN) have demonstrated exceptional performance in solving critical problems across diverse domains yet remain susceptible to...

3 months ago cs.LG cs.AI cs.CR PDF

Attack HIGH

Robust Fake News Detection using Large Language Models under Adversarial Sentiment Attacks

Sahar Tahmasebi, Eric Müller-Budack, Ralph Ewerth

Misinformation and fake news have become a pressing societal challenge, driving the need for reliable automated detection methods. Prior research has...

3 months ago cs.CL PDF

Attack HIGH

Lightweight LLMs for Network Attack Detection in IoT Networks

Piyumi Bhagya Sudasinghe, Kushan Sudheera Kalupahana Liyanage, Harsha S. Gardiyawasam Pussewalage

The rapid growth of Internet of Things (IoT) devices has increased the scale and diversity of cyberattacks, exposing limitations in traditional...

3 months ago cs.CR PDF

Benchmark LOW

Privacy Collapse: Benign Fine-Tuning Can Break Contextual Privacy in Language Models

Anmol Goel, Cornelius Emde, Sangdoo Yun +2 more

We identify a novel phenomenon in language models: benign fine-tuning of frontier models can lead to privacy collapse. We find that diverse, subtle...

3 months ago cs.CL PDF

Benchmark MEDIUM

Knowledge Restoration-driven Prompt Optimization: Unlocking LLM Potential for Open-Domain Relational Triplet Extraction

Xiaonan Jing, Gongqing Wu, Xingrui Zhuo +2 more

Open-domain Relational Triplet Extraction (ORTE) is the foundation for mining structured knowledge without predefined schemas. Despite the impressive...

3 months ago cs.CL cs.AI PDF

Benchmark LOW

Unified Multi-Dataset Training for TBPS

Nilanjana Chatterjee, Sidharatha Garg, A V Subramanyam +1 more

Text-Based Person Search (TBPS) has seen significant progress with vision-language models (VLMs), yet it remains constrained by limited training data...

3 months ago cs.CV PDF

Benchmark LOW

Does medical specialization of VLMs enhance discriminative power?: A comprehensive investigation through feature distribution analysis

Keita Takeda, Tomoya Sakai

This study investigates the feature representations produced by publicly available open source medical vision-language models (VLMs). While medical...

3 months ago cs.CV PDF

Benchmark MEDIUM

Gaming the Judge: Unfaithful Chain-of-Thought Can Undermine Agent Evaluation

Muhammad Khalifa, Lajanugen Logeswaran, Jaekyeom Kim +6 more

Large language models (LLMs) are increasingly used as judges to evaluate agent performance, particularly in non-verifiable settings where judgments...

3 months ago cs.AI cs.CL PDF

Attack HIGH

Beyond Denial-of-Service: The Puppeteer's Attack for Fine-Grained Control in Ranking-Based Federated Learning

Zhihao Chen, Zirui Gong, Jianting Ning +2 more

Federated Rank Learning (FRL) is a promising Federated Learning (FL) paradigm designed to be resilient against model poisoning attacks due to its...

3 months ago cs.LG cs.CR cs.DC PDF

Tool MEDIUM

INFA-Guard: Mitigating Malicious Propagation via Infection-Aware Safeguarding in LLM-Based Multi-Agent Systems

Yijin Zhou, Xiaoya Lu, Dongrui Liu +2 more

The rapid advancement of Large Language Model (LLM)-based Multi-Agent Systems (MAS) has introduced significant security vulnerabilities, where...

3 months ago cs.MA cs.AI PDF

Defense MEDIUM

NeuroFilter: Privacy Guardrails for Conversational LLM Agents

Saswat Das, Ferdinando Fioretto

This work addresses the computational challenge of enforcing privacy for agentic Large Language Models (LLMs), where privacy is governed by the...

3 months ago cs.CR cs.AI cs.CL PDF

Tool HIGH

A Prompt-Based Framework for Loop Vulnerability Detection Using Local LLMs

Adeyemi Adeseye, Aisvarya Adeseye

Loop vulnerabilities are one major risky construct in software development. They can easily lead to infinite loops or executions, exhaust resources,...

3 months ago cs.SE PDF

Attack MEDIUM

Towards Cybersecurity Superintelligence: from AI-guided humans to human-guided AI

Víctor Mayoral-Vilches, Stefan Rass, Martin Pinzger +14 more

Cybersecurity superintelligence -- artificial intelligence exceeding the best human capability in both speed and strategic reasoning -- represents...

3 months ago cs.CR PDF

Tool MEDIUM

An LLM Agent-based Framework for Whaling Countermeasures

Daisuke Miyamoto, Takuji Iimura, Narushige Michishita

With the spread of generative AI in recent years, attacks known as Whaling have become a serious threat. Whaling is a form of social engineering that...

3 months ago cs.CR PDF

Benchmark MEDIUM

From Biased Chatbots to Biased Agents: Examining Role Assignment Effects on LLM Agent Robustness

Linbo Cao, Lihao Sun, Yang Yue

Large Language Models (LLMs) are increasingly deployed as autonomous agents capable of actions with real-world impacts beyond text generation. While...

3 months ago cs.CL cs.AI PDF

Attack MEDIUM

Holmes: An Evidence-Grounded LLM Agent for Auditable DDoS Investigation in Cloud Networks

Haodong Chen, Ziheng Zhang, Jinghui Jiang +2 more

Cloud environments face frequent DDoS threats due to centralized resources and broad attack surfaces. Modern cloud-native DDoS attacks further evolve...

3 months ago cs.CR cs.NI PDF

Attack MEDIUM

Constructing Multi-label Hierarchical Classification Models for MITRE ATT&CK Text Tagging

Andrew Crossman, Jonah Dodd, Viralam Ramamurthy Chaithanya Kumar +5 more

MITRE ATT&CK is a cybersecurity knowledge base that organizes threat actor and cyber-attack information into a set of tactics describing the reasons...

3 months ago cs.LG cs.CR PDF

Defense LOW

LLM Security and Safety: Insights from Homotopy-Inspired Prompt Obfuscation

Luis Lazo, Hamed Jelodar, Roozbeh Razavi-Far

In this study, we propose a homotopy-inspired prompt obfuscation framework to enhance understanding of security and safety vulnerabilities in Large...

3 months ago cs.CR cs.LG PDF

Track AI security vulnerabilities in real time

Get breaking CVE alerts, compliance reports (ISO 42001, EU AI Act), and CISO risk assessments for your AI/ML stack.

Start 14-Day Free Trial