Attack HIGH
Víctor Mayoral-Vilches, María Sanz-Gómez, Francesco Balassone +6 more
AI-driven penetration testing now executes thousands of actions per hour but still lacks the strategic intuition humans apply in competitive...
Attack HIGH
Ahmad Alobaid, Martí Jordà Roca, Carlos Castillo +1 more
The availability of Large Language Models (LLMs) has led to a new generation of powerful chatbots that can be developed at relatively low cost. As...
2 months ago cs.CR cs.AI
PDF
Attack HIGH
Balachandra Devarangadi Sunil, Isheeta Sinha, Piyush Maheshwari +3 more
Large language model agents equipped with persistent memory are vulnerable to memory poisoning attacks, where adversaries inject malicious...
2 months ago cs.CR cs.MA
PDF
Attack HIGH
Songze Li, Ruishi He, Xiaojun Jia +2 more
Large Language Models (LLMs) face a significant threat from multi-turn jailbreak attacks, where adversaries progressively steer conversations to...
2 months ago cs.CR cs.LG
PDF
Attack HIGH
Badhan Chandra Das, Md Tasnim Jawad, Joaquin Molto +2 more
In recent years, the security vulnerabilities of Multi-modal Large Language Models (MLLMs) have become a serious concern in the Generative Artificial...
2 months ago cs.CR cs.AI
PDF
Attack HIGH
Zhiyuan Chang, Mingyang Li, Yuekai Huang +6 more
Large language model (LLM)-integrated applications have become increasingly prevalent, yet face critical security vulnerabilities from prompt...
2 months ago cs.AI cs.CR
PDF
Attack HIGH
Hoagy Cunningham, Jerry Wei, Zihan Wang +26 more
We introduce enhanced Constitutional Classifiers that deliver production-grade jailbreak robustness with dramatically reduced computational costs and...
2 months ago cs.CR cs.AI
PDF
Attack HIGH
Ahmad Mohammad Saber, Saeed Jafari, Zhengmao Ouyang +3 more
This paper presents a large language model (LLM)-based framework that adapts and fine-tunes compact LLMs for detecting cyberattacks on transformer...
2 months ago cs.CR cs.LG eess.SP
PDF
Attack HIGH
Iago Alves Brito, Walcy Santos Rezende Rios, Julia Soares Dollis +2 more
Current safety evaluations of large language models (LLMs) create a dangerous illusion of universality, aggregating "Identity Hate" into scalar...
2 months ago cs.CL cs.AI
PDF
Attack HIGH
Yu Yan, Sheng Sun, Mingfeng Li +6 more
Recently, people have suffered from LLM hallucination and have become increasingly aware of the reliability gap of LLMs in open and...
Attack HIGH
Siyuan Li, Xi Lin, Jun Wu +5 more
Jailbreak attacks pose significant threats to large language models (LLMs), enabling attackers to bypass safeguards. However, existing reactive...
2 months ago cs.CR cs.AI
PDF
Attack HIGH
Ji Guo, Wenbo Jiang, Yansong Lin +7 more
Vision-Language-Action (VLA) models are widely deployed in safety-critical embodied AI applications such as robotics. However, their complex...
2 months ago cs.CR cs.LG
PDF
Attack HIGH
Hang Fu, Wanli Peng, Yinghan Zhou +3 more
The widespread adoption of Large Language Model (LLM) in commercial and research settings has intensified the need for robust intellectual property...
Attack HIGH
Binh Nguyen, Thai Le
Audio Language Models (ALMs) offer a promising shift towards explainable audio deepfake detections (ADDs), moving beyond \textit{black-box}...
2 months ago cs.CL cs.SD eess.AS
PDF
Attack HIGH
Xiao Lin, Philip Li, Zhichen Zeng +6 more
Despite rich safety alignment strategies, large language models (LLMs) remain highly susceptible to jailbreak attacks, which compromise safety...
2 months ago cs.LG cs.AI cs.IR
PDF
Attack HIGH
Zhakshylyk Nurlanov, Frank R. Schmidt, Florian Bernard
As Large Language Models (LLMs) are increasingly deployed in safety-critical domains, rigorously evaluating their robustness against adversarial...
2 months ago cs.LG cs.AI cs.CR
PDF
Attack HIGH
Xi Wang, Songlei Jian, Shasha Li +5 more
Despite extensive safety alignment, Large Language Models (LLMs) often fail against jailbreak attacks. While machine unlearning has emerged as a...
2 months ago cs.CR cs.AI
PDF
Attack HIGH
Yuetian Chen, Yuntao Du, Kaiyuan Zhang +4 more
Most membership inference attacks (MIAs) against Large Language Models (LLMs) rely on global signals, like average loss, to identify training data....
2 months ago cs.CL cs.AI cs.CR
PDF
Attack HIGH
Dinghong Song, Zhiwei Xu, Hai Wan +3 more
Model quantization is critical for deploying large language models (LLMs) on resource-constrained hardware, yet recent work has revealed severe...
2 months ago cs.CR cs.LG
PDF
Attack HIGH
Scott Thornton
Large language models remain vulnerable to jailbreak attacks, and single-layer defenses often trade security for usability. We present TRYLOCK, the...
2 months ago cs.CR cs.LG
PDF
Track AI security vulnerabilities in real time
Get breaking CVE alerts, compliance reports (ISO 42001, EU AI Act),
and CISO risk assessments for your AI/ML stack.
Start 14-Day Free Trial