AI Security Research

2,529+ academic papers on AI security, attacks, and defenses

Total

2,529

Attack

969

Benchmark

729

Defense

345

Tool

272

Survey

142

Type

All Attack Defense Survey Benchmark Tool

Relevance

All High Medium

Date

All time 7 days 30 days 6 months

Showing 2461–2480 of 2,529 papers

Benchmark MEDIUM

WAREX: Web Agent Reliability Evaluation on Existing Benchmarks

Su Kara, Fazle Faisal, Suman Nath

Recent advances in browser-based LLM agents have shown promise for automating tasks ranging from simple form filling to hotel booking or online...

7 months ago cs.AI cs.CR cs.LG PDF

Benchmark MEDIUM

An Ensemble Framework for Unbiased Language Model Watermarking

Yihan Wu, Ruibo Chen, Georgios Milis +1 more

As large language models become increasingly capable and widely deployed, verifying the provenance of machine-generated content is critical to...

7 months ago cs.CR PDF

Benchmark HIGH

Automated Vulnerability Validation and Verification: A Large Language Model Approach

Alireza Lotfi, Charalampos Katsis, Elisa Bertino

Software vulnerabilities remain a critical security challenge, providing entry points for attackers into enterprise networks. Despite advances in...

7 months ago cs.CR PDF

Defense MEDIUM

Policy-as-Prompt: Turning AI Governance Rules into Guardrails for AI Agents

Gauri Kholkar, Ratinder Ahuja

As autonomous AI agents are used in regulated and safety-critical settings, organizations need effective ways to turn policy into enforceable...

7 months ago cs.CL cs.AI PDF

Benchmark MEDIUM

Binary Diff Summarization using Large Language Models

Meet Udeshi, Venkata Sai Charan Putrevu, Prashanth Krishnamurthy +4 more

Security of software supply chains is necessary to ensure that software updates do not contain maliciously injected code or introduce vulnerabilities...

7 months ago cs.CR PDF

Benchmark MEDIUM

Quant Fever, Reasoning Blackholes, Schrodinger's Compliance, and More: Probing GPT-OSS-20B

Shuyi Lin, Tian Lu, Zikai Wang +3 more

OpenAI's GPT-OSS family provides open-weight language models with explicit chain-of-thought (CoT) reasoning and a Harmony prompt format. We summarize...

7 months ago cs.AI cs.CR PDF

Benchmark LOW

GroupCoOp: Group-robust Fine-tuning via Group Prompt Learning

Nayeong Kim, Seong Joon Oh, Suha Kwak

Parameter-efficient fine-tuning (PEFT) of vision-language models (VLMs) excels in various vision tasks thanks to the rich knowledge and...

7 months ago cs.CV cs.AI PDF

Benchmark HIGH

SafeSearch: Automated Red-Teaming of LLM-Based Search Agents

Jianshuo Dong, Sheng Guo, Hao Wang +6 more

Search agents connect LLMs to the Internet, enabling them to access broader and more up-to-date information. However, this also introduces a new...

7 months ago cs.AI cs.CL cs.CR PDF

Benchmark MEDIUM

How LLMs Learn to Reason: A Complex Network Perspective

Sihan Hu, Xiansheng Cai, Yuan Huang +5 more

Training large language models with Reinforcement Learning with Verifiable Rewards (RLVR) exhibits a set of distinctive and puzzling behaviors that...

7 months ago cs.AI cond-mat.dis-nn cond-mat.stat-mech PDF

Benchmark MEDIUM

AutoML in Cybersecurity: An Empirical Study

Sherif Saad, Kevin Shi, Mohammed Mamun +1 more

Automated machine learning (AutoML) has emerged as a promising paradigm for automating machine learning (ML) pipeline design, broadening AI adoption....

7 months ago cs.CR PDF

Attack HIGH

StolenLoRA: Exploring LoRA Extraction Attacks via Synthetic Data

Yixu Wang, Yan Teng, Yingchun Wang +1 more

Parameter-Efficient Fine-Tuning (PEFT) methods like LoRA have transformed vision model adaptation, enabling the rapid deployment of customized...

7 months ago cs.CR cs.CV PDF

Defense MEDIUM

Uncovering Vulnerabilities of LLM-Assisted Cyber Threat Intelligence

Yuqiao Meng, Luoxi Tang, Feiyang Yu +4 more

Large language models (LLMs) are increasingly used to help security analysts manage the surge of cyber threats, automating tasks from vulnerability...

7 months ago cs.CR cs.AI PDF

Attack HIGH

Formalization Driven LLM Prompt Jailbreaking via Reinforcement Learning

Zhaoqi Wang, Daqing He, Zijian Zhang +4 more

Large language models (LLMs) have demonstrated remarkable capabilities, yet they also introduce novel security challenges. For instance, prompt...

7 months ago cs.AI cs.CR PDF

Other MEDIUM

Contrastive Learning Enhances Language Model Based Cell Embeddings for Low-Sample Single Cell Transcriptomics

Luxuan Zhang, Douglas Jiang, Qinglong Wang +2 more

Large language models (LLMs) have shown strong ability in generating rich representations across domains such as natural language processing and...

7 months ago q-bio.GN cs.NE q-bio.MN PDF

Defense MEDIUM

ReliabilityRAG: Effective and Provably Robust Defense for RAG-based Web-Search

Zeyu Shen, Basileal Imana, Tong Wu +3 more

Retrieval-Augmented Generation (RAG) enhances Large Language Models by grounding their outputs in external documents. These systems, however, remain...

7 months ago cs.CR cs.AI PDF

Defense MEDIUM

Beyond Embeddings: Interpretable Feature Extraction for Binary Code Similarity

Charles E. Gagnon, Steven H. H. Ding, Philippe Charland +1 more

Binary code similarity detection is a core task in reverse engineering. It supports malware analysis and vulnerability discovery by identifying...

7 months ago cs.AI cs.CR cs.SE PDF

Attack MEDIUM

Dual-Space Smoothness for Robust and Balanced LLM Unlearning

Han Yan, Zheyuan Liu, Meng Jiang

With the rapid advancement of large language models, Machine Unlearning has emerged to address growing concerns around user privacy, copyright...

7 months ago cs.CL cs.AI PDF

Attack HIGH

Preventing Robotic Jailbreaking via Multimodal Domain Adaptation

Francesco Marchiori, Rohan Sinha, Christopher Agia +4 more

Large Language Models (LLMs) and Vision-Language Models (VLMs) are increasingly deployed in robotic environments but remain vulnerable to...

7 months ago cs.RO PDF

Benchmark MEDIUM

Reinforcement Learning-Based Prompt Template Stealing for Text-to-Image Models

Xiaotian Zou

Multimodal Large Language Models (MLLMs) have transformed text-to-image workflows, allowing designers to create novel visual concepts with...

7 months ago cs.CV cs.AI PDF

Defense LOW

Towards Quantum-Ready Blockchain Fraud Detection via Ensemble Graph Neural Networks

M. Z. Haider, Tayyaba Noreen, M. Salman

Blockchain Business applications and cryptocurrencies such as enable secure, decentralized value transfer, yet their pseudonymous nature creates...

7 months ago cs.LG cs.AI cs.CR PDF

Track AI security vulnerabilities in real time

Get breaking CVE alerts, compliance reports (ISO 42001, EU AI Act), and CISO risk assessments for your AI/ML stack.

Start 14-Day Free Trial