AI Security Research

2,583+ academic papers on AI security, attacks, and defenses

Total

2,583

Attack

994

Benchmark

740

Defense

355

Tool

275

Survey

146

Type

All Attack Defense Survey Benchmark Tool

Relevance

All High Medium

Date

All time 7 days 30 days 6 months

Showing 581–600 of 927 papers

Clear filters

Benchmark MEDIUM

$α^3$-SecBench: A Large-Scale Evaluation Suite of Security, Resilience, and Trust for LLM-based UAV Agents over 6G Networks

Mohamed Amine Ferrag, Abderrahmane Lakas, Merouane Debbah

Autonomous unmanned aerial vehicle (UAV) systems are increasingly deployed in safety-critical, networked environments where they must operate...

3 months ago cs.CR cs.AI PDF

Benchmark MEDIUM

A Generative AI-Driven Reliability Layer for Action-Oriented Disaster Resilience

Geunsik Lim

As climate-related hazards intensify, conventional early warning systems (EWS) disseminate alerts rapidly but often fail to trigger timely protective...

3 months ago cs.AI cs.SI eess.SY PDF

Benchmark MEDIUM

From Transcripts to AI Agents: Knowledge Extraction, RAG Integration, and Robust Evaluation of Conversational AI Assistants

Krittin Pachtrachai, Petmongkon Pornpichitsuwan, Wachiravit Modecrua +1 more

Building reliable conversational AI assistants for customer-facing industries remains challenging due to noisy conversational data, fragmented...

3 months ago cs.CL PDF

Benchmark MEDIUM

MalURLBench: A Benchmark Evaluating Agents' Vulnerabilities When Processing Web URLs

Dezhang Kong, Zhuxi Wu, Shiqi Liu +8 more

LLM-based web agents have become increasingly popular for their utility in daily life and work. However, they exhibit critical vulnerabilities when...

3 months ago cs.CR cs.AI PDF

Defense MEDIUM

When Personalization Legitimizes Risks: Uncovering Safety Vulnerabilities in Personalized Dialogue Agents

Jiahe Guo, Xiangran Guo, Yulin Hu +8 more

Long-term memory enables large language model (LLM) agents to support personalized and sustained interactions. However, most work on personalized...

3 months ago cs.AI PDF

Benchmark MEDIUM

An Effective and Cost-Efficient Agentic Framework for Ethereum Smart Contract Auditing

Xiaohui Hu, Wun Yu Chan, Yuejie Shi +5 more

Smart contract security is paramount, but identifying intricate business logic vulnerabilities remains a persistent challenge because existing...

3 months ago cs.CR PDF

Benchmark MEDIUM

Improving User Privacy in Personalized Generation: Client-Side Retrieval-Augmented Modification of Server-Side Generated Speculations

Alireza Salemi, Hamed Zamani

Personalization is crucial for aligning Large Language Model (LLM) outputs with individual user preferences and background knowledge....

3 months ago cs.CL cs.AI cs.CR PDF

Benchmark MEDIUM

Unintended Memorization of Sensitive Information in Fine-Tuned Language Models

Marton Szep, Jorge Marin Ruiz, Georgios Kaissis +4 more

Fine-tuning Large Language Models (LLMs) on sensitive datasets carries a substantial risk of unintended memorization and leakage of Personally...

3 months ago cs.LG cs.AI cs.CL PDF

Attack MEDIUM

Robust Privacy: Inference-Time Privacy through Certified Robustness

Jiankai Jin, Xiangzheng Zhang, Zhao Liu +2 more

Machine learning systems can produce personalized outputs that allow an adversary to infer sensitive input attributes at inference time. We introduce...

3 months ago cs.LG cs.AI cs.CR PDF

Tool MEDIUM

Learning to Collaborate: An Orchestrated-Decentralized Framework for Peer-to-Peer LLM Federation

Inderjeet Singh, Eleonore Vissol-Gaudin, Andikan Otung +1 more

Fine-tuning Large Language Models (LLMs) for specialized domains is constrained by a fundamental challenge: the need for diverse,...

3 months ago cs.LG cs.AI cs.CR PDF

Attack MEDIUM

GRIP: Algorithm-Agnostic Machine Unlearning for Mixture-of-Experts via Geometric Router Constraints

Andy Zhu, Rongzhe Wei, Yupu Gu +1 more

Machine unlearning (MU) for large language models has become critical for AI safety, yet existing methods fail to generalize to Mixture-of-Experts...

3 months ago cs.LG cs.AI PDF

Benchmark MEDIUM

SycoEval-EM: Sycophancy Evaluation of Large Language Models in Simulated Clinical Encounters for Emergency Care

Dongshen Peng, Yi Wang, Austin Schoeffler +2 more

Large language models (LLMs) show promise in clinical decision support yet risk acquiescing to patient pressure for inappropriate care. We introduce...

3 months ago cs.AI cs.HC PDF

Defense MEDIUM

SafeThinker: Reasoning about Risk to Deepen Safety Beyond Shallow Alignment

Xianya Fang, Xianying Luo, Yadong Wang +8 more

Despite the intrinsic risk-awareness of Large Language Models (LLMs), current defenses often result in shallow safety alignment, rendering models...

3 months ago cs.CR cs.AI PDF

Tool MEDIUM

Bridging Expert Reasoning and LLM Detection: A Knowledge-Driven Framework for Malicious Packages

Wenbo Guo, Shiwen Song, Jiaxun Guo +5 more

Open-source ecosystems such as NPM and PyPI are increasingly targeted by supply chain attacks, yet existing detection methods either depend on...

3 months ago cs.SE cs.CR PDF

Benchmark MEDIUM

NOIR: Privacy-Preserving Generation of Code with Open-Source LLMs

Khoa Nguyen, Khiem Ton, NhatHai Phan +6 more

Although boosting software development performance, large language model (LLM)-powered code generation introduces intellectual property and data...

3 months ago cs.CR cs.AI PDF

Benchmark MEDIUM

Machine-Assisted Grading of Nationwide School-Leaving Essay Exams with LLMs and Statistical NLP

Andres Karjus, Kais Allkivi, Silvia Maine +3 more

Large language models (LLMs) enable rapid and consistent automated evaluation of open-ended exam responses, including dimensions of content and...

3 months ago cs.CL cs.AI PDF

Attack MEDIUM

Feature-Space Adversarial Robustness Certification for Multimodal Large Language Models

Song Xia, Meiwen Ding, Chenqi Kong +2 more

Multimodal large language models (MLLMs) exhibit strong capabilities across diverse applications, yet remain vulnerable to adversarial perturbations...

3 months ago cs.LG cs.CV PDF

Benchmark MEDIUM

Improving Methodologies for LLM Evaluations Across Global Languages

Akriti Vij, Benjamin Chua, Darshini Ramiah +43 more

As frontier AI models are deployed globally, it is essential that their behaviour remains safe and reliable across diverse linguistic and cultural...

3 months ago cs.AI PDF

Benchmark MEDIUM

TempoNet: Learning Realistic Communication and Timing Patterns for Network Traffic Simulation

Kristen Moore, Diksha Goel, Cody James Christopher +5 more

Realistic network traffic simulation is critical for evaluating intrusion detection systems, stress-testing network protocols, and constructing...

3 months ago cs.CR cs.AI cs.LG PDF

Tool MEDIUM

Securing LLM-as-a-Service for Small Businesses: An Industry Case Study of a Distributed Chatbot Deployment Platform

Jiazhu Xie, Bowen Li, Heyu Fu +3 more

Large Language Model (LLM)-based question-answering systems offer significant potential for automating customer support and internal knowledge access...

3 months ago cs.DC cs.CR PDF

Track AI security vulnerabilities in real time

Get breaking CVE alerts, compliance reports (ISO 42001, EU AI Act), and CISO risk assessments for your AI/ML stack.

Start 14-Day Free Trial