Tool HIGH
Xiaoyi Pang, Xuanyi Hao, Pengyu Liu +3 more
Recent intelligent systems integrate powerful Large Language Models (LLMs) through APIs, but their trustworthiness may be critically undermined by...
3 weeks ago cs.CR cs.AI
PDF
Tool MEDIUM
Qingxiao Xu, Ze Sheng, Zhicheng Chen +1 more
Large language models (LLMs) have shown promise for automated patching, but their effectiveness depends strongly on how they are integrated into...
3 weeks ago cs.CR cs.SE
PDF
Tool MEDIUM
Yijun Yu
Agentic AI systems exhibit numerous crosscutting concerns -- security, observability, cost management, fault tolerance -- that are poorly modularized...
3 weeks ago cs.AI cs.SE
PDF
Tool MEDIUM
Reva Schwartz, Carina Westling, Morgan Briggs +12 more
This paper proposes CIRCLE, a six-stage, lifecycle-based framework to bridge the reality gap between model-centric performance metrics and AI's...
3 weeks ago cs.AI cs.SE
PDF
Tool MEDIUM
Chuanming Tang, Ling Qing, Shifeng Chen
The rapid evolution of sophisticated cyberattacks has strained modern Security Operations Centers (SOC), which traditionally rely on rule-based or...
3 weeks ago cs.CR cs.AI
PDF
Tool MEDIUM
Quanjun Zhang, Chengyu Gao, Yu Han +4 more
The rapid advancement of Large Language Models (LLMs) has led to the emergence of intelligent agents capable of autonomously interacting with...
Tool MEDIUM
Kimberly T. Mai, Anna Gausen, Magda Dubois +5 more
AI is increasingly being used to assist fraud and cybercrime. However, it is unclear the extent to which current large language models can provide...
Tool LOW
Yongchang Zhang, Oliver Ma, Tianyi Liu +2 more
Recent large vision-language models (LVLMs) have demonstrated impressive reasoning ability by generating long chain-of-thought (CoT) responses....
Tool HIGH
Xinfeng Li, Shenyu Dai, Kelong Zheng +4 more
Large language model (LLM) agents are rapidly becoming trusted copilots in high-stakes domains like software development and healthcare. However,...
4 weeks ago cs.HC cs.AI cs.CR
PDF
Tool HIGH
Che Wang, Jiaming Zhang, Ziqi Zhang +6 more
The integration of external data services (e.g., Model Context Protocol, MCP) has made large language model-based agents increasingly powerful for...
4 weeks ago cs.CR cs.AI
PDF
Tool HIGH
Ian Steenstra, Paola Pedrelli, Weiyan Shi +2 more
Large Language Models (LLMs) are increasingly utilized for mental health support; however, current safety benchmarks often fail to detect the...
1 months ago cs.CL cs.AI cs.CY
PDF
Tool MEDIUM
Yedi Zhang, Haoyu Wang, Xianglin Yang +2 more
LLM-enabled applications are rapidly reshaping the software ecosystem by using large language models as core reasoning components for complex task...
1 months ago cs.CR cs.AI cs.SE
PDF
Tool LOW
Jongwon Jeong, Jungtaek Kim, Kangwook Lee
Language Model (LM) agents have demonstrated remarkable capabilities in solving tasks that require multiple interactions with the environment....
Tool HIGH
Xingyu Shen, Tommy Duong, Xiaodong An +6 more
Age estimation systems are increasingly deployed as gatekeepers for age-restricted online content, yet their robustness to cosmetic modifications has...
1 months ago cs.CV cs.CR cs.LG
PDF
Tool MEDIUM
Florin Adrian Chitan
The proliferation of autonomous AI agents capable of executing real-world actions - filesystem operations, API calls, database modifications,...
1 months ago cs.AI cs.CR
PDF
Tool MEDIUM
Emmanuel Bamidele
Long-running LLM agents require persistent memory to preserve state across interactions, yet most deployed systems manage memory with age-based...
1 months ago cs.DC cs.AI cs.LG
PDF
Tool HIGH
Phan The Duy, Nghi Hoang Khoa, Nguyen Tran Anh Quan +3 more
The increasing deployment of Federated Learning (FL) in Intrusion Detection Systems (IDS) introduces new challenges related to data privacy,...
1 months ago cs.CR cs.AI
PDF
Tool LOW
Leon Staufer, Kevin Feng, Kevin Wei +6 more
Agentic AI systems are increasingly capable of performing professional and personal tasks with limited human involvement. However, tracking these...
1 months ago cs.CY cs.AI
PDF
Tool MEDIUM
Arnold Cartagena, Ariane Teixeira
Large language models deployed as agents increasingly interact with external systems through tool calls--actions with real-world consequences that...
1 months ago cs.AI cs.SE
PDF
Tool HIGH
Doron Shavit
Jailbreak prompts are a practical and evolving threat to large language models (LLMs), particularly in agentic systems that execute tools over...
1 months ago cs.CR cs.AI
PDF
Track AI security vulnerabilities in real time
Get breaking CVE alerts, compliance reports (ISO 42001, EU AI Act),
and CISO risk assessments for your AI/ML stack.
Start 14-Day Free Trial