AI Security Research

2,529+ academic papers on AI security, attacks, and defenses

Total
2,529
Attack
969
Benchmark
729
Defense
345
Tool
272
Survey
142

Showing 701–720 of 729 papers

Clear filters
Benchmark LOW

Sandbagging in a Simple Survival Bandit Problem

Joel Dyer, Daniel Jarne Ornia, Nicholas Bishop +2 more

Evaluating the safety of frontier AI systems is an increasingly important concern, helping to measure the capabilities of such models and identify...

7 months ago cs.LG cs.AI stat.ML PDF
Benchmark MEDIUM

Binary Diff Summarization using Large Language Models

Meet Udeshi, Venkata Sai Charan Putrevu, Prashanth Krishnamurthy +4 more

Security of software supply chains is necessary to ensure that software updates do not contain maliciously injected code or introduce vulnerabilities...

7 months ago cs.CR PDF
Benchmark MEDIUM

How LLMs Learn to Reason: A Complex Network Perspective

Sihan Hu, Xiansheng Cai, Yuan Huang +5 more

Training large language models with Reinforcement Learning with Verifiable Rewards (RLVR) exhibits a set of distinctive and puzzling behaviors that...

7 months ago cs.AI cond-mat.dis-nn cond-mat.stat-mech PDF
Benchmark MEDIUM

AutoML in Cybersecurity: An Empirical Study

Sherif Saad, Kevin Shi, Mohammed Mamun +1 more

Automated machine learning (AutoML) has emerged as a promising paradigm for automating machine learning (ML) pipeline design, broadening AI adoption....

7 months ago cs.CR PDF

Track AI security vulnerabilities in real time

Get breaking CVE alerts, compliance reports (ISO 42001, EU AI Act), and CISO risk assessments for your AI/ML stack.

Start 14-Day Free Trial