AI Security Research

2,077+ academic papers on AI security, attacks, and defenses

Total
2,077
Attack
809
Benchmark
603
Defense
272
Tool
226
Survey
113

Showing 241–259 of 259 papers

Clear filters
Attack MEDIUM

MOLM: Mixture of LoRA Markers

Samar Fares, Nurbek Tastan, Noor Hussein +1 more

Generative models can generate photorealistic images at scale. This raises urgent concerns about the ability to detect synthetically generated images...

5 months ago cs.CV cs.CR cs.LG PDF
Attack MEDIUM

CHAI: Command Hijacking against embodied AI

Luis Burbano, Diego Ortiz, Qi Sun +5 more

Embodied Artificial Intelligence (AI) promises to handle edge cases in robotic vehicle systems where data is scarce by using common-sense reasoning...

5 months ago cs.CR cs.AI cs.LG PDF
Attack MEDIUM

Are Robust LLM Fingerprints Adversarially Robust?

Anshul Nasery, Edoardo Contente, Alkin Kaz +2 more

Model fingerprinting has emerged as a promising paradigm for claiming model ownership. However, robustness evaluations of these schemes have mostly...

5 months ago cs.CR cs.AI cs.LG PDF
Attack MEDIUM

LLM Watermark Evasion via Bias Inversion

Jeongyeon Hwang, Sangdon Park, Jungseul Ok

Watermarking offers a promising solution for detecting LLM-generated content, yet its robustness under realistic query-free (black-box) evasion...

5 months ago cs.CR cs.AI PDF
Attack MEDIUM

Adversarial training with restricted data manipulation

David Benfield, Stefano Coniglio, Phan Tu Vuong +1 more

Adversarial machine learning concerns situations in which learners face attacks from active adversaries. Such scenarios arise in applications such as...

6 months ago cs.LG cs.CR PDF

Track AI security vulnerabilities in real time

Get breaking CVE alerts, compliance reports (ISO 42001, EU AI Act), and CISO risk assessments for your AI/ML stack.

Start 14-Day Free Trial