Tool MEDIUM
Joel Rorseth, Parke Godfrey, Lukasz Golab +2 more
This paper demonstrates RUBEN, an interactive tool for discovering minimal rules to explain the outputs of retrieval-augmented large language models...
Tool MEDIUM
Michael A. Riegler, Inga Strümke
We present swarm-attack, an open-source adversarial testing framework in which multiple lightweight LLM agents coordinate through shared memory,...
2 days ago cs.CR cs.AI cs.LG
PDF
Tool MEDIUM
Chengjie Wang, Jingzheng Wu, Xiang Ling +2 more
Large language models (LLMs) are now largely involved in software development workflows, and the code they generate routinely includes third-party...
5 days ago cs.SE cs.AI
PDF
Tool MEDIUM
Kerri Prinos, Lilianne Brush, Cameron Denton +5 more
Agentic systems involved in high-stake decision-making under adversarial pressure need formal guarantees not offered by existing approaches....
1 weeks ago cs.AI cs.CR eess.SY
PDF
Tool MEDIUM
Mingming Zha, Xiaofeng Wang
Autonomous LLM agents operate as long-running processes with persistent workspaces, memory files, scheduled task state, and messaging integrations....
Tool MEDIUM
Neha Nagaraja, Hayretdin Bahsi, Carlo R. da Cunha
As large language models are integrated into autonomous robotic systems for task planning and control, compromised inputs or unsafe model outputs can...
1 weeks ago cs.CR cs.AI cs.RO
PDF
Tool MEDIUM
Kato Mivule
This paper extends the Classification Error Gauge (x-CEG) framework, originally developed for measuring the privacy-utility trade-off in tabular...
Tool MEDIUM
Mikko Lempinen, Joni Kemppainen, Niklas Raesalmi
As artificial intelligence (AI) systems are increasingly deployed across critical domains, their security vulnerabilities pose growing risks of...
2 weeks ago cs.CR cs.AI cs.CL
PDF
Tool MEDIUM
Yuan Fang, Yiming Luo, Aimin Zhou +1 more
Ensuring the safety of large language models (LLMs) requires robust red teaming, yet the systematic synthesis of high-quality toxic data remains...
3 weeks ago cs.CL cs.AI
PDF
Tool MEDIUM
Shangkun Che, Silin Du, Ge Gao
The widespread use of Large Language Models (LLMs) in text generation has raised increasing concerns about intellectual property disputes....
4 weeks ago cs.CR cs.CL
PDF
Tool MEDIUM
Hengkai Ye, Zhechang Zhang, Jinyuan Jia +1 more
Large language models (LLMs) increasingly rely on external tools to perform time-sensitive tasks and real-world actions. While tool integration...
Tool MEDIUM
Yen-Shan Chen, Sian-Yao Huang, Cheng-Lin Yang +1 more
As large language models (LLMs) evolve from static chatbots into autonomous agents, the primary vulnerability surface shifts from final outputs to...
1 months ago cs.CR cs.AI cs.CL
PDF
Tool MEDIUM
Yinghan Hou, Zongyou Yang
OpenClaw's ClawHub marketplace hosts over 13,000 community-contributed agent skills, and between 13% and 26% of them contain security vulnerabilities...
1 months ago cs.CR cs.AI
PDF
Tool MEDIUM
Shaofei Huang, Christopher M. Poskitt, Lwin Khin Shar
Cyber-physical systems often contend with incomplete architectural documentation or outdated information resulting from legacy technologies,...
1 months ago cs.CR cs.AI
PDF
Tool MEDIUM
Anes Abdennebi, Nadjia Kara, Laaziz Lahlou +1 more
Modern Security Operations Centers struggle with alert fatigue, fragmented tooling, and limited cross-source event correlation. Challenges that...
1 months ago cs.CR cs.AI
PDF
Tool MEDIUM
Wuyang Zhang, Shichao Pei
Tool-use large language model (LLM) agents are increasingly deployed to support sensitive workflows, relying on tool calls for retrieval, external...
1 months ago cs.CR cs.AI
PDF
Tool MEDIUM
Jiling Zhou, Aisvarya Adeseye, Seppo Virtanen +2 more
Chain-of-Thought (CoT) prompting has been used to enhance the reasoning capability of LLMs. However, its reliability in security-sensitive analytical...
1 months ago cs.CR cs.AI
PDF
Tool MEDIUM
Fariha Tanjim Shifat, Hariswar Baburaj, Ce Zhou +2 more
Large language models (LLMs) are increasingly embedded in open-source software (OSS) ecosystems, creating complex interactions among natural language...
1 months ago cs.CR cs.SE
PDF
Tool MEDIUM
Jihoon Jeong
AI models of equivalent capability can exhibit fundamentally different behavioral patterns, yet no standardized instrument exists to measure these...
1 months ago cs.AI cs.CL
PDF
Tool MEDIUM
Aymen Bouferroum, Valeria Loscri, Abderrahim Benslimane
The Industrial Internet of Things (IIoT) introduces significant security challenges as resource-constrained devices become increasingly integrated...
1 months ago cs.CR cs.LG
PDF
Track AI security vulnerabilities in real time
Get breaking CVE alerts, compliance reports (ISO 42001, EU AI Act),
and CISO risk assessments for your AI/ML stack.
Start 14-Day Free Trial