AI Security Research

2,560+ academic papers on AI security, attacks, and defenses

Total

2,560

Attack

982

Benchmark

736

Defense

350

Tool

275

Survey

144

Type

All Attack Defense Survey Benchmark Tool

Relevance

All High Medium

Date

All time 7 days 30 days 6 months

Showing 181–200 of 442 papers

Clear filters

Benchmark MEDIUM

Benchmarking Large Language Models for Zero-shot and Few-shot Phishing URL Detection

Najmul Hasan, Prashanth BusiReddyGari

The Uniform Resource Locator (URL), introduced in a connectivity-first era to define access and locate resources, remains historically limited,...

3 months ago cs.CR cs.AI PDF

Benchmark MEDIUM

Trustworthy Blockchain-based Federated Learning for Electronic Health Records: Securing Participant Identity with Decentralized Identifiers and Verifiable Credentials

Rodrigo Tertulino, Ricardo Almeida, Laercio Alencar

The digitization of healthcare has generated massive volumes of Electronic Health Records (EHRs), offering unprecedented opportunities for training...

3 months ago cs.CR cs.AI cs.LG PDF

Benchmark MEDIUM

Expected Harm: Rethinking Safety Evaluation of (Mis)Aligned LLMs

Yen-Shan Chen, Zhi Rui Tam, Cheng-Kuang Wu +1 more

Current evaluations of LLM safety predominantly rely on severity-based taxonomies to assess the harmfulness of malicious queries. We argue that this...

3 months ago cs.CR cs.CL cs.CY PDF

Benchmark MEDIUM

CIPHER: Cryptographic Insecurity Profiling via Hybrid Evaluation of Responses

Max Manolov, Tony Gao, Siddharth Shukla +2 more

Large language models (LLMs) are increasingly used to assist developers with code, yet their implementations of cryptographic functionality often...

3 months ago cs.CR cs.AI PDF

Benchmark MEDIUM

Don't Judge a Book by its Cover: Testing LLMs' Robustness Under Logical Obfuscation

Abhilekh Borah, Shubhra Ghosh, Kedar Joshi +2 more

Tasks such as solving arithmetic equations, evaluating truth tables, and completing syllogisms are handled well by large language models (LLMs) in...

3 months ago cs.CL PDF

Benchmark MEDIUM

Protecting Private Code in IDE Autocomplete using Differential Privacy

Evgeny Grigorenko, David Stanojević, David Ilić +2 more

Modern Integrated Development Environments (IDEs) increasingly leverage Large Language Models (LLMs) to provide advanced features like code...

3 months ago cs.CR cs.AI PDF

Benchmark MEDIUM

Evaluating Large Language Models for Security Bug Report Prediction

Farnaz Soltaniani, Shoaib Razzaq, Mohammad Ghafari

Early detection of security bug reports (SBRs) is critical for timely vulnerability mitigation. We present an evaluation of prompt-based engineering...

3 months ago cs.CR cs.AI cs.LG PDF

Benchmark MEDIUM

AlienLM: Alienization of Language for API-Boundary Privacy in Black-Box LLMs

Jaehee Kim, Pilsung Kang

Modern LLMs are increasingly accessed via black-box APIs, requiring users to transmit sensitive prompts, outputs, and fine-tuning data to external...

3 months ago cs.CR cs.CL PDF

Benchmark MEDIUM

Hair-Trigger Alignment: Black-Box Evaluation Cannot Guarantee Post-Update Alignment

Yavuz Bakman, Duygu Nur Yaldiz, Salman Avestimehr +1 more

Large Language Models (LLMs) are rarely static and are frequently updated in practice. A growing body of alignment research has shown that models...

3 months ago cs.LG PDF

Benchmark MEDIUM

FIT: Defying Catastrophic Forgetting in Continual LLM Unlearning

Xiaoyu Xu, Minxin Du, Kun Fang +6 more

Large language models (LLMs) demonstrate impressive capabilities across diverse tasks but raise concerns about privacy, copyright, and harmful...

3 months ago cs.CL cs.AI cs.CR PDF

Benchmark MEDIUM

The Compliance Paradox: Semantic-Instruction Decoupling in Automated Academic Code Evaluation

Devanshu Sahoo, Manish Prasad, Vasudev Majhi +5 more

The rapid integration of Large Language Models (LLMs) into educational assessment rests on the unverified assumption that instruction following...

3 months ago cs.CL cs.AI cs.ET PDF

Benchmark MEDIUM

VoxMorph: Scalable Zero-shot Voice Identity Morphing via Disentangled Embeddings

Bharath Krishnamurthy, Ajita Rattani

Morphing techniques generate artificial biometric samples that combine features from multiple individuals, allowing each contributor to be verified...

3 months ago cs.SD cs.CR cs.LG PDF

Benchmark MEDIUM

Benchmarking LLAMA Model Security Against OWASP Top 10 For LLM Applications

Nourin Shahin, Izzat Alsmadi

As large language models (LLMs) move from research prototypes to enterprise systems, their security vulnerabilities pose serious risks to data...

3 months ago cs.CR cs.LG PDF

Benchmark MEDIUM

Automated Safety Benchmarking: A Multi-agent Pipeline for LVLMs

Xiangyang Zhu, Yuan Tian, Zicheng Zhang +6 more

Large vision-language models (LVLMs) exhibit remarkable capabilities in cross-modal tasks but face significant safety challenges, which undermine...

3 months ago cs.CL PDF

Benchmark MEDIUM

Selective Steering: Norm-Preserving Control Through Discriminative Layer Selection

Quy-Anh Dang, Chris Ngo

Despite significant progress in alignment, large language models (LLMs) remain vulnerable to adversarial attacks that elicit harmful behaviors....

3 months ago cs.LG cs.AI PDF

Benchmark MEDIUM

VoxPrivacy: A Benchmark for Evaluating Interactional Privacy of Speech Language Models

Yuxiang Wang, Hongyu Liu, Dekun Chen +2 more

As Speech Language Models (SLMs) transition from personal devices to shared, multi-user environments such as smart homes, a new challenge emerges:...

3 months ago eess.AS cs.AI cs.SD PDF

Benchmark MEDIUM

Malicious Repurposing of Open Science Artefacts by Using Large Language Models

Zahra Hashemi, Zhiqiang Zhong, Jun Pang +1 more

The rapid evolution of large language models (LLMs) has fuelled enthusiasm about their role in advancing scientific discovery, with studies exploring...

3 months ago cs.CL PDF

Benchmark MEDIUM

$α^3$-SecBench: A Large-Scale Evaluation Suite of Security, Resilience, and Trust for LLM-based UAV Agents over 6G Networks

Mohamed Amine Ferrag, Abderrahmane Lakas, Merouane Debbah

Autonomous unmanned aerial vehicle (UAV) systems are increasingly deployed in safety-critical, networked environments where they must operate...

3 months ago cs.CR cs.AI PDF

Benchmark MEDIUM

A Generative AI-Driven Reliability Layer for Action-Oriented Disaster Resilience

Geunsik Lim

As climate-related hazards intensify, conventional early warning systems (EWS) disseminate alerts rapidly but often fail to trigger timely protective...

3 months ago cs.AI cs.SI eess.SY PDF

Benchmark MEDIUM

From Transcripts to AI Agents: Knowledge Extraction, RAG Integration, and Robust Evaluation of Conversational AI Assistants

Krittin Pachtrachai, Petmongkon Pornpichitsuwan, Wachiravit Modecrua +1 more

Building reliable conversational AI assistants for customer-facing industries remains challenging due to noisy conversational data, fragmented...

3 months ago cs.CL PDF

Track AI security vulnerabilities in real time

Get breaking CVE alerts, compliance reports (ISO 42001, EU AI Act), and CISO risk assessments for your AI/ML stack.

Start 14-Day Free Trial