AI Security Research

2,529+ academic papers on AI security, attacks, and defenses

Total

2,529

Attack

969

Benchmark

729

Defense

345

Tool

272

Survey

142

Type

All Attack Defense Survey Benchmark Tool

Relevance

All High Medium

Date

All time 7 days 30 days 6 months

Showing 261–280 of 345 papers

Clear filters

Defense MEDIUM

SGuard-v1: Safety Guardrail for Large Language Models

JoonHo Lee, HyeonMin Cho, Jaewoong Yun +3 more

We present SGuard-v1, a lightweight safety guardrail for Large Language Models (LLMs), which comprises two specialized models to detect harmful...

5 months ago cs.CL cs.AI cs.CR PDF

Defense HIGH

Multi-Agent Collaborative Fuzzing with Continuous Reflection for Smart Contracts Vulnerability Detection

Jie Chen, Liangmin Wang

Fuzzing is a widely used technique for detecting vulnerabilities in smart contracts, which generates transaction sequences to explore the execution...

5 months ago cs.CR cs.SE PDF

Defense MEDIUM

Rethinking Deep Alignment Through The Lens Of Incomplete Learning

Thong Bach, Dung Nguyen, Thao Minh Le +1 more

Large language models exhibit systematic vulnerabilities to adversarial attacks despite extensive safety alignment. We provide a mechanistic analysis...

5 months ago cs.LG PDF

Defense MEDIUM

EcoAlign: An Economically Rational Framework for Efficient LVLM Alignment

Ruoxi Cheng, Haoxuan Ma, Teng Ma +1 more

Large Vision-Language Models (LVLMs) exhibit powerful reasoning capabilities but suffer sophisticated jailbreak vulnerabilities. Fundamentally,...

5 months ago cs.AI PDF

Defense HIGH

Prompt Engineering vs. Fine-Tuning for LLM-Based Vulnerability Detection in Solana and Algorand Smart Contracts

Biagio Boi, Christian Esposito

Smart contracts have emerged as key components within decentralized environments, enabling the automation of transactions through self-executing...

5 months ago cs.CR PDF

Defense MEDIUM

EnchTable: Unified Safety Alignment Transfer in Fine-tuned Large Language Models

Jialin Wu, Kecen Li, Zhicong Huang +3 more

Many machine learning models are fine-tuned from large language models (LLMs) to achieve high performance in specialized domains like code...

6 months ago cs.CL cs.CR PDF

Defense MEDIUM

Slice-Aware Spoofing Detection in 5G Networks Using Lightweight Machine Learning

Daniyal Ganiuly, Nurzhau Bolatbek

The increasing virtualization of fifth generation (5G) networks expands the attack surface of the user plane, making spoofing a persistent threat to...

6 months ago cs.CR cs.NI PDF

Defense LOW

Patching LLM Like Software: A Lightweight Method for Improving Safety Policy in Large Language Models

Huzaifa Arif, Keerthiram Murugesan, Ching-Yun Ko +3 more

We propose patching for large language models (LLMs) like software versions, a lightweight and modular approach for addressing safety...

6 months ago cs.AI PDF

Defense MEDIUM

HybridGuard: Enhancing Minority-Class Intrusion Detection in Dew-Enabled Edge-of-Things Networks

Binayak Kara, Ujjwal Sahua, Ciza Thomas +1 more

Securing Dew-Enabled Edge-of-Things (EoT) networks against sophisticated intrusions is a critical challenge. This paper presents HybridGuard, a...

6 months ago cs.CR cs.AI cs.LG PDF

Defense MEDIUM

A Self-Improving Architecture for Dynamic Safety in Large Language Models

Tyler Slater

Context: The integration of Large Language Models (LLMs) into core software systems is accelerating. However, existing software architecture patterns...

6 months ago cs.SE cs.AI cs.CR PDF

Defense MEDIUM

EASE: Practical and Efficient Safety Alignment for Small Language Models

Haonan Shi, Guoli Wang, Tu Ouyang +1 more

Small language models (SLMs) are increasingly deployed on edge devices, making their safety alignment crucial yet challenging. Current shallow...

6 months ago cs.CR cs.LG PDF

Defense LOW

Alignment-Constrained Dynamic Pruning for LLMs: Identifying and Preserving Alignment-Critical Circuits

Dev Patel, Gabrielle Gervacio, Diekola Raimi +5 more

Large Language Models require substantial computational resources for inference, posing deployment challenges. While dynamic pruning offers superior...

6 months ago cs.LG cs.AI cs.CL PDF

Defense MEDIUM

Explaining Software Vulnerabilities with Large Language Models

Oshando Johnson, Alexandra Fomina, Ranjith Krishnamurthy +3 more

The prevalence of security vulnerabilities has prompted companies to adopt static application security testing (SAST) tools for vulnerability...

6 months ago cs.SE cs.AI PDF

Defense HIGH

Specification-Guided Vulnerability Detection with Large Language Models

Hao Zhu, Jia Li, Cuiyun Gao +7 more

Large language models (LLMs) have achieved remarkable progress in code understanding tasks. However, they demonstrate limited performance in...

6 months ago cs.SE cs.CR PDF

Defense MEDIUM

STARS: Synchronous Token Alignment for Robust Supervision in Large Language Models

Mohammad Atif Quamar, Mohammad Areeb, Mikhail Kuznetsov +2 more

Aligning large language models (LLMs) with human values is crucial for safe deployment. Inference-time techniques offer granular control over...

6 months ago cs.CL PDF

Defense LOW

Approximating the Mathematical Structure of Psychodynamics

Bryce-Allen Bagley, Navin Khoshnan

The complexity of human cognition has meant that psychology makes more use of theory and conceptual models than perhaps any other biomedical field....

6 months ago q-bio.NC cs.CL cs.CY PDF

Defense LOW

Federated Attention: A Distributed Paradigm for Collaborative LLM Inference over Edge Networks

Xiumei Deng, Zehui Xiong, Binbin Chen +3 more

Large language models (LLMs) are proliferating rapidly at the edge, delivering intelligent capabilities across diverse application scenarios....

6 months ago cs.DC cs.AI cs.LG PDF

Defense LOW

LM-Fix: Lightweight Bit-Flip Detection and Rapid Recovery Framework for Language Models

Ahmad Tahmasivand, Noureldin Zahran, Saba Al-Sayouri +2 more

This paper presents LM-Fix, a lightweight detection and rapid recovery framework for faults in large language models (LLMs). Existing integrity...

6 months ago cs.SE cs.AI cs.AR PDF

Defense LOW

Seed-Induced Uniqueness in Transformer Models: Subspace Alignment Governs Subliminal Transfer

Ayşe Selin Okatan, Mustafa İlhan Akbaş, Laxima Niure Kandel +1 more

We analyze subliminal transfer in Transformer models, where a teacher embeds hidden traits that can be linearly decoded by a student without...

6 months ago eess.SP cs.AI cs.CR PDF

Defense MEDIUM

Reimagining Safety Alignment with An Image

Yifan Xia, Guorui Chen, Wenqian Yu +3 more

Large language models (LLMs) excel in diverse applications but face dual challenges: generating harmful content under jailbreak attacks and...

6 months ago cs.AI cs.CR PDF

Track AI security vulnerabilities in real time

Get breaking CVE alerts, compliance reports (ISO 42001, EU AI Act), and CISO risk assessments for your AI/ML stack.

Start 14-Day Free Trial