AI Security Research

2,077+ academic papers on AI security, attacks, and defenses

Total

2,077

Attack

809

Benchmark

603

Defense

272

Tool

226

Survey

113

Type

All Attack Defense Survey Benchmark Tool

Relevance

All High Medium

Date

All time 7 days 30 days 6 months

Showing 561–580 of 598 papers

Clear filters

Benchmark LOW

FalseCrashReducer: Mitigating False Positive Crashes in OSS-Fuzz-Gen Using Agentic AI

Paschal C. Amusuo, Dongge Liu, Ricardo Andres Calvo Mendez +3 more

Fuzz testing has become a cornerstone technique for identifying software bugs and security vulnerabilities, with broad adoption in both industry and...

5 months ago cs.SE cs.CR cs.MA PDF

Benchmark MEDIUM

Are LLMs Better GNN Helpers? Rethinking Robust Graph Learning under Deficiencies with Iterative Refinement

Zhaoyan Wang, Zheng Gao, Arogya Kharel +1 more

Graph Neural Networks (GNNs) are widely adopted in Web-related applications, serving as a core technique for learning from graph-structured data,...

5 months ago cs.LG cs.AI PDF

Benchmark LOW

Human-AI Teaming Co-Learning in Military Operations

Clara Maathuis, Kasper Cools

In a time of rapidly evolving military threats and increasingly complex operational environments, the integration of AI into military operations...

5 months ago cs.AI PDF

Benchmark MEDIUM

POLAR: Automating Cyber Threat Prioritization through LLM-Powered Assessment

Luoxi Tang, Yuqiao Meng, Ankita Patra +3 more

Large Language Models (LLMs) are intensively used to assist security analysts in counteracting the rapid exploitation of cyber threats, wherein LLMs...

5 months ago cs.CR cs.AI PDF

Benchmark MEDIUM

OntoLogX: Ontology-Guided Knowledge Graph Extraction from Cybersecurity Logs with Large Language Models

Luca Cotti, Idilio Drago, Anisa Rula +2 more

System logs represent a valuable source of Cyber Threat Intelligence (CTI), capturing attacker behaviors, exploited vulnerabilities, and traces of...

5 months ago cs.AI PDF

Benchmark HIGH

WAInjectBench: Benchmarking Prompt Injection Detections for Web Agents

Yinuo Liu, Ruohan Xu, Xilong Wang +2 more

Multiple prompt injection attacks have been proposed against web agents. At the same time, various methods have been developed to detect general...

5 months ago cs.CR cs.AI cs.CL PDF

Benchmark LOW

Social Welfare Function Leaderboard: When LLM Agents Allocate Social Welfare

Zhengliang Shi, Ruotian Ma, Jen-tse Huang +14 more

Large language models (LLMs) are increasingly entrusted with high-stakes decisions that affect human welfare. However, the principles and values that...

5 months ago cs.CL cs.AI cs.CY PDF

Benchmark MEDIUM

Downgrade to Upgrade: Optimizer Simplification Enhances Robustness in LLM Unlearning

Yicheng Lang, Yihua Zhang, Chongyu Fan +3 more

Large language model (LLM) unlearning aims to surgically remove the influence of undesired data or knowledge from an existing model while preserving...

5 months ago cs.LG PDF

Benchmark LOW

When Silence Matters: The Impact of Irrelevant Audio on Text Reasoning in Large Audio-Language Models

Chen-An Li, Tzu-Han Lin, Hung-yi Lee

Large audio-language models (LALMs) unify speech and text processing, but their robustness in noisy real-world settings remains underexplored. We...

5 months ago cs.SD cs.CL PDF

Benchmark MEDIUM

Sentry: Authenticating Machine Learning Artifacts on the Fly

Andrew Gan, Zahra Ghodsi

Machine learning systems increasingly rely on open-source artifacts such as datasets and models that are created or hosted by other parties. The...

5 months ago cs.CR PDF

Benchmark HIGH

From Trace to Line: LLM Agent for Real-World OSS Vulnerability Localization

Haoran Xi, Minghao Shao, Brendan Dolan-Gavitt +2 more

Large language models show promise for vulnerability discovery, yet prevailing methods inspect code in isolation, struggle with long contexts, and...

5 months ago cs.SE cs.CR cs.LG PDF

Benchmark MEDIUM

SecureBERT 2.0: Advanced Language Model for Cybersecurity Intelligence

Ehsan Aghaei, Sarthak Jain, Prashanth Arun +1 more

Effective analysis of cybersecurity and threat intelligence data demands language models that can interpret specialized terminology, complex document...

5 months ago cs.CR cs.AI cs.LG PDF

Benchmark MEDIUM

Fairness Testing in Retrieval-Augmented Generation: How Small Perturbations Reveal Bias in Small Language Models

Matheus Vinicius da Silva de Oliveira, Jonathan de Andrade Silva, Awdren de Lima Fontao

Large Language Models (LLMs) are widely used across multiple domains but continue to raise concerns regarding security and fairness. Beyond known...

5 months ago cs.AI cs.IR cs.LG PDF

Benchmark LOW

Towards Reliable Benchmarking: A Contamination Free, Controllable Evaluation Framework for Multi-step LLM Function Calling

Seiji Maekawa, Jackson Hassell, Pouya Pezeshkpour +2 more

Existing benchmarks for tool-augmented language models (TaLMs) lack fine-grained control over task difficulty and remain vulnerable to data...

5 months ago cs.CL cs.PL PDF

Benchmark LOW

SeedPrints: Fingerprints Can Even Tell Which Seed Your Large Language Model Was Trained From

Yao Tong, Haonan Wang, Siquan Li +2 more

Fingerprinting Large Language Models (LLMs) is essential for provenance verification and model attribution. Existing methods typically extract...

5 months ago cs.CR cs.AI cs.CL PDF

Benchmark LOW

Sandbagging in a Simple Survival Bandit Problem

Joel Dyer, Daniel Jarne Ornia, Nicholas Bishop +2 more

Evaluating the safety of frontier AI systems is an increasingly important concern, helping to measure the capabilities of such models and identify...

5 months ago cs.LG cs.AI stat.ML PDF

Benchmark LOW

SafeEvalAgent: Toward Agentic and Self-Evolving Safety Evaluation of LLMs

Yixu Wang, Xin Wang, Yang Yao +4 more

The rapid integration of Large Language Models (LLMs) into high-stakes domains necessitates reliable safety and compliance evaluation. However,...

5 months ago cs.AI PDF

Benchmark HIGH

Red Teaming Program Repair Agents: When Correct Patches can Hide Vulnerabilities

Simin Chen, Yixin He, Suman Jana +1 more

LLM-based agents are increasingly deployed for software maintenance tasks such as automated program repair (APR). APR agents automatically fetch...

5 months ago cs.SE PDF

Benchmark LOW

SafeMind: Benchmarking and Mitigating Safety Risks in Embodied LLM Agents

Ruolin Chen, Yinqian Sun, Jihang Wang +3 more

Embodied agents powered by large language models (LLMs) inherit advanced planning capabilities; however, their direct interaction with the physical...

5 months ago cs.AI PDF

Benchmark LOW

Rotation Control Unlearning: Quantifying and Controlling Continuous Unlearning for LLM with The Cognitive Rotation Space

Xiang Zhang, Kun Wei, Xu Yang +3 more

As Large Language Models (LLMs) become increasingly prevalent, their security vulnerabilities have already drawn attention. Machine unlearning is...

5 months ago cs.LG cs.CL PDF

Track AI security vulnerabilities in real time

Get breaking CVE alerts, compliance reports (ISO 42001, EU AI Act), and CISO risk assessments for your AI/ML stack.

Start 14-Day Free Trial