Benchmark LOW
Jian Wang, Xiaofei Xie, Qiang Hu +4 more
Automated Program Repair (APR) plays a critical role in enhancing the quality and reliability of software systems. While substantial progress has...
Tool HIGH
Hyeseon An, Shinwoo Park, Suyeon Woo +1 more
The promise of LLM watermarking rests on a core assumption that a specific watermark proves authorship by a specific model. We demonstrate that this...
5 months ago cs.CR cs.AI
PDF
Benchmark LOW
Jidong Li, Lingyong Fang, Haodong Zhao +2 more
Multimodal large language models (MLLMs) have witnessed astonishing advancements in recent years. Despite these successes, MLLMs remain vulnerable to...
5 months ago cs.CL cs.AI
PDF
Tool MEDIUM
Qizhou Peng, Yang Zheng, Yu Wen +2 more
Reinforcement learning (RL) has been an important machine learning paradigm for solving long-horizon sequential decision-making problems under...
5 months ago cs.LG cs.CR
PDF
Attack HIGH
Zonghuan Xu, Jiayu Li, Yunhan Zhao +3 more
Vision-Language-Action (VLA) models map multimodal perception and language instructions to executable robot actions, making them particularly...
5 months ago cs.CR cs.AI cs.RO
PDF
Attack MEDIUM
Zaixi Zhang, Souradip Chakraborty, Amrit Singh Bedi +16 more
The rapid adoption of generative artificial intelligence (GenAI) in the biosciences is transforming biotechnology, medicine, and synthetic biology....
5 months ago cs.CR q-bio.BM
PDF
Attack MEDIUM
Tiarnaigh Downey-Webb, Olamide Jogunola, Oluwaseun Ajao
This paper presents a systematic security assessment of four prominent Large Language Models (LLMs) against diverse adversarial attack vectors. We...
5 months ago cs.CR cs.AI cs.CY
PDF
Benchmark LOW
Norbert Tihanyi, Bilel Cherif, Richard A. Dubniczky +2 more
In this paper, we present the first large-scale study exploring whether JavaScript code generated by Large Language Models (LLMs) can reveal which...
5 months ago cs.CR cs.LG
PDF
Benchmark MEDIUM
Mohan Zhang, Yihua Zhang, Jinghan Jia +3 more
Modern large reasoning models (LRMs) exhibit impressive multi-step problem-solving via chain-of-thought (CoT) reasoning. However, this iterative...
5 months ago cs.LG cs.AI cs.CR
PDF
Attack HIGH
Ming Tan, Wei Li, Hu Tao +4 more
Open-source large language models (LLMs) have demonstrated considerable dominance over proprietary LLMs in resolving neural processing tasks, thanks...
5 months ago cs.CR cs.AI
PDF
Benchmark MEDIUM
Shaolun Liu, Sina Marefat, Omar Tsai +4 more
GraphQL's flexible query model and nested data dependencies expose APIs to complex, context-dependent vulnerabilities that are difficult to uncover...
5 months ago cs.CR cs.SE
PDF
Other LOW
Shingo Kodama, Haya Diwan, Lucas Rosenblatt +2 more
The rapid spread of text generated by large language models (LLMs) makes it increasingly difficult to distinguish authentic human writing from...
5 months ago cs.CR cs.LG
PDF
Attack HIGH
Guan-Yan Yang, Tzu-Yu Cheng, Ya-Wen Teng +2 more
The integration of Large Language Models (LLMs) into computer applications has introduced transformative capabilities but also significant security...
5 months ago cs.CR cs.AI cs.CL
PDF
Attack HIGH
Wentian Zhu, Zhen Xiang, Wei Niu +1 more
Unlike regular tokens derived from existing text corpora, special tokens are artificially created to annotate structured conversations during the...
5 months ago cs.CR cs.AI
PDF
Attack HIGH
Yutao Wu, Xiao Liu, Yinghui Li +5 more
Knowledge poisoning poses a critical threat to Retrieval-Augmented Generation (RAG) systems by injecting adversarial content into knowledge bases,...
5 months ago cs.CL cs.AI cs.CR
PDF
Attack HIGH
Mengyao Zhao, Kaixuan Li, Lyuye Zhang +4 more
Recent advances in Large Language Models (LLMs) have brought remarkable progress in code understanding and reasoning, creating new opportunities and...
Attack HIGH
Yue Deng, Francisco Santos, Pang-Ning Tan +1 more
Deep learning based weather forecasting (DLWF) models leverage past weather observations to generate future forecasts, supporting a wide range of...
5 months ago cs.LG cs.CR stat.ML
PDF
Benchmark MEDIUM
Zonghao Ying, Yangguang Shao, Jianle Gan +9 more
Large vision-language model (LVLM)-based web agents are emerging as powerful tools for automating complex online tasks. However, when deployed in...
5 months ago cs.CR cs.CV
PDF
Defense MEDIUM
Yuyi Huang, Runzhe Zhan, Lidia S. Chao +2 more
As large language models (LLMs) are increasingly deployed for complex reasoning tasks, Long Chain-of-Thought (Long-CoT) prompting has emerged as a...
Attack HIGH
Ruizhe Zhu
The widespread application of large vision language models has significantly raised safety concerns. In this project, we investigate text prompt...
5 months ago cs.CL cs.CV
PDF
Track AI security vulnerabilities in real time
Get breaking CVE alerts, compliance reports (ISO 42001, EU AI Act),
and CISO risk assessments for your AI/ML stack.
Start 14-Day Free Trial