Benchmark MEDIUM
Dachuan Lin, Guobin Shen, Zihao Yang +3 more
Safety evaluation of large language models (LLMs) increasingly relies on LLM-as-a-judge pipelines, but strong judges can still be expensive to use at...
4 months ago cs.AI cs.CR
PDF
Benchmark LOW
Azanzi Jiomekong, Jean Bikim, Patricia Negoue +1 more
Evaluating semantic tables interpretation (STI) systems, (particularly, those based on Large Language Models- LLMs) especially in domain-specific...
Tool HIGH
Seif Ikbarieh, Kshitiz Aryal, Maanak Gupta
The rapid expansion of the Internet of Things (IoT) is reshaping communication and operational practices across industries, but it also broadens the...
4 months ago cs.CR cs.AI
PDF
Attack MEDIUM
Dilli Prasad Sharma, Liang Xue, Xiaowei Sun +2 more
The rapid proliferation of Internet of Things (IoT) devices has transformed numerous industries by enabling seamless connectivity and data-driven...
4 months ago cs.CR cs.AI cs.CL
PDF
Attack HIGH
Alina Fastowski, Bardh Prenkaj, Yuxiao Li +1 more
LLMs are now an integral part of information retrieval. As such, their role as question answering chatbots raises significant concerns due to their...
4 months ago cs.CR cs.AI cs.CL
PDF
Tool MEDIUM
Jiayi Fu, Yuansen Zhang, Yinggui Wang
Large Language Models (LLMs) demonstrate strong capabilities in solving complex tasks when integrated with external tools. The Model Context Protocol...
4 months ago cs.CR cs.CL
PDF
Attack MEDIUM
Viet Nguyen, Vishal M. Patel
Recent advancements in large-scale generative models have enabled the creation of high-quality images and videos, but have also raised significant...
4 months ago cs.CV cs.AI cs.CR
PDF
Attack HIGH
Yigitcan Kaya, Anton Landerer, Stijn Pletinckx +3 more
Prompt injection attacks pose a critical threat to large language models (LLMs), with prior work focusing on cutting-edge LLM applications like...
4 months ago cs.CR cs.AI
PDF
Benchmark MEDIUM
Amr Gomaa, Ahmed Salem, Sahar Abdelnabi
As language models evolve into autonomous agents that act and communicate on behalf of users, ensuring safety in multi-agent ecosystems becomes a...
4 months ago cs.CR cs.CL cs.CY
PDF
Attack HIGH
Janet Jenq, Hongda Shen
Multimodal product retrieval systems in e-commerce platforms rely on effectively combining visual and textual signals to improve search relevance and...
Benchmark MEDIUM
Ishan Kavathekar, Hemang Jain, Ameya Rathod +2 more
Large Language Models (LLMs) have demonstrated strong capabilities as autonomous agents through tool use, planning, and decision-making abilities,...
4 months ago cs.MA cs.AI
PDF
Attack HIGH
Mohammad Karami, Mohammad Reza Nemati, Aidin Kazemi +3 more
Artificial intelligence (AI) has shown great potential in medical imaging, particularly for brain tumor detection using Magnetic Resonance Imaging...
4 months ago cs.LG cs.AI cs.CR
PDF
Benchmark MEDIUM
Hadi Reisizadeh, Jiajun Ruan, Yiwei Chen +3 more
Unlearning in large language models (LLMs) is critical for regulatory compliance and for building ethical generative AI systems that avoid producing...
Benchmark MEDIUM
Cyril Vallez, Alexander Sternfeld, Andrei Kucharavy +1 more
As the role of Large Language Models (LLM)-based coding assistants in software development becomes more critical, so does the role of the bugs they...
Attack MEDIUM
Raunak Somani, Aswani Kumar Cherukuri
This paper studies the integration off Large Language Models into cybersecurity tools and protocols. The main issue discussed in this paper is how...
Attack MEDIUM
Pedro Pereira, José Gouveia, João Vitorino +2 more
Magecart skimming attacks have emerged as a significant threat to client-side security and user trust in online payment systems. This paper addresses...
Tool MEDIUM
Tim Beyer, Jonas Dornbusch, Jakob Steimle +3 more
The rapid expansion of research on Large Language Model (LLM) safety and robustness has produced a fragmented and oftentimes buggy ecosystem of...
4 months ago cs.AI cs.SE
PDF
Attack HIGH
Hongwei Yao, Yun Xia, Shuo Shao +3 more
Large language models (LLMs) increasingly employ guardrails to enforce ethical, legal, and application-specific constraints on their outputs. While...
4 months ago cs.CR cs.CL
PDF
Defense MEDIUM
Oshando Johnson, Alexandra Fomina, Ranjith Krishnamurthy +3 more
The prevalence of security vulnerabilities has prompted companies to adopt static application security testing (SAST) tools for vulnerability...
4 months ago cs.SE cs.AI
PDF
Other MEDIUM
Hirohane Takagi, Gouki Minegishi, Shota Kizawa +2 more
Although behavioral studies have documented numerical reasoning errors in large language models (LLMs), the underlying representational mechanisms...
Track AI security vulnerabilities in real time
Get breaking CVE alerts, compliance reports (ISO 42001, EU AI Act),
and CISO risk assessments for your AI/ML stack.
Start 14-Day Free Trial