Attack MEDIUM
Feng Zhang, Shijia Li, Chunmao Zhang +7 more
User simulators serve as the critical interactive environment for agent post-training, and an ideal user simulator generalizes across domains and...
Defense MEDIUM
Caitlin A. Stamatis, Jonah Meyerhoff, Richard Zhang +3 more
Large language models (LLMs) are increasingly used for mental health support, yet existing safety evaluations rely primarily on small,...
2 months ago cs.CY cs.CL
PDF
Attack MEDIUM
Renyang Liu, Kangjie Chen, Han Qiu +4 more
Image generation models (IGMs), while capable of producing impressive and creative content, often memorize a wide range of undesirable concepts from...
2 months ago cs.CV cs.AI cs.CR
PDF
Benchmark MEDIUM
Ziqi Ding, Yunfeng Wan, Wei Song +7 more
CAPTCHAs are widely used by websites to block bots and spam by presenting challenges that are easy for humans but difficult for automated programs to...
2 months ago cs.SD cs.CY eess.AS
PDF
Benchmark MEDIUM
Seong-Gyu Park, Sohee Park, Jisu Lee +2 more
Recent LLMs increasingly integrate reasoning mechanisms like Chain-of-Thought (CoT). However, this explicit reasoning exposes a new attack surface...
2 months ago cs.CL cs.CR cs.LG
PDF
Benchmark MEDIUM
Erin Feiglin, Nir Hutnik, Raz Lapid
We investigate a failure mode of large language models (LLMs) in which plain-text prompts elicit excessive outputs, a phenomenon we term Overflow....
2 months ago cs.CL cs.AI
PDF
Attack MEDIUM
Abdelaziz Bounhar, Rania Hossam Elmohamady Elbadry, Hadi Abdine +3 more
Steering Large Language Models (LLMs) through activation interventions has emerged as a lightweight alternative to fine-tuning for alignment and...
Defense MEDIUM
Zhenhua Xu, Yiran Zhao, Mengting Zhong +4 more
The rapid growth of large language models raises pressing concerns about intellectual property protection under black-box deployment. Existing...
2 months ago cs.CR cs.AI
PDF
Benchmark MEDIUM
Dongryeol Lee, Yerin Hwang, Taegwan Kang +3 more
While large language models (LLMs) are increasingly used as automatic judges for question answering (QA) and other reference-conditioned evaluation...
Attack MEDIUM
Ruiqi Li, Zhiqiang Wang, Yunhao Yao +1 more
To standardize interactions between LLM-based agents and their environments, the Model Context Protocol (MCP) was proposed and has since been widely...
2 months ago cs.CR cs.AI
PDF
Benchmark MEDIUM
Weipeng Jiang, Xiaoyu Zhang, Juan Zhai +3 more
Emoticons are widely used in digital communication to convey affective intent, yet their safety implications for Large Language Models (LLMs) remain...
2 months ago cs.CR cs.AI cs.SE
PDF
Defense MEDIUM
Mingxiang Tao, Yu Tian, Wenxuan Tu +3 more
Federated learning (FL) addresses data privacy and silo issues in large language models (LLMs). Most prior work focuses on improving the training...
2 months ago cs.CR cs.AI
PDF
Tool MEDIUM
Yixiao Peng, Hao Hu, Feiyang Li +5 more
While virtualization and resource pooling empower cloud networks with structural flexibility and elastic scalability, they inevitably expand the...
2 months ago cs.CR cs.AI cs.LG
PDF
Benchmark MEDIUM
Ying Zhou, Jiacheng Wei, Yu Qi +2 more
Large language models (LLMs) demonstrate remarkable capabilities in natural language understanding and generation. Despite being trained on...
2 months ago cs.CR cs.AI
PDF
Survey MEDIUM
Huihui Huang, Jieke Shi, Junkai Chen +6 more
Penetration testing is essential for identifying vulnerabilities in web applications before real adversaries can exploit them. Recent work has...
Survey MEDIUM
Takaaki Toda, Tatsuya Mori
Modern software package registries like PyPI have become critical infrastructure for software development, but are increasingly exploited by threat...
2 months ago cs.CR cs.SE
PDF
Benchmark MEDIUM
Vasanth Iyer, Leonardo Bobadilla, S. S. Iyengar
Large Language Models (LLMs) such as Gemma-2B have shown strong performance in various natural language processing tasks. However, general-purpose...
Benchmark MEDIUM
Qiang Zhang, Elena Emma Wang, Jiaming Li +1 more
This study presents a Secure Multi-Tenant Architecture (SMTA) combined with a novel concept Burn-After-Use (BAU) mechanism for enterprise LLM...
2 months ago cs.CR cs.AI
PDF
Defense MEDIUM
Imtiaz Ali Soomro, Hamood Ur Rehman, S. Jawad Hussain ID +3 more
The rapid proliferation of Internet of Things (IoT) devices across domains such as smart homes, industrial control systems, and healthcare networks...
2 months ago cs.CR cs.NI
PDF
Attack MEDIUM
Chao Liu, Ngai-Man Cheung
3D Vision-Language Models (VLMs), such as PointLLM and GPT4Point, have shown strong reasoning and generalization abilities in 3D understanding tasks....
Track AI security vulnerabilities in real time
Get breaking CVE alerts, compliance reports (ISO 42001, EU AI Act),
and CISO risk assessments for your AI/ML stack.
Start 14-Day Free Trial