Benchmark MEDIUM
Ricardo Bessa, Rui Claro, João Trindade +1 more
The application of Machine Learning techniques in code generation is now a common practice for most developers. Tools such as ChatGPT from OpenAI...
Benchmark LOW
Dzenan Hamzic, Florian Skopik, Max Landauer +2 more
Cyber threat intelligence (CTI) analysts must answer complex questions over large collections of narrative security reports. Retrieval-augmented...
1 months ago cs.AI cs.CR
PDF
Other MEDIUM
Yiran Ling, Wenxuan Li, Siying Dong +5 more
Robot grasping of desktop object is widely used in intelligent manufacturing, logistics, and agriculture.Although vision-language models (VLMs) show...
Tool HIGH
Yihao Zhang, Kai Wang, Jiangrong Wu +7 more
Large Language Models (LLMs) face prominent security risks from jailbreaking, a practice that manipulates models to bypass built-in security...
1 months ago cs.CR cs.AI cs.CL
PDF
Attack LOW
Zhixiang Lu, Jionglong Su
Multimodal Large Language Models (MLLMs) in healthcare suffer from severe confirmation bias, often hallucinating visual details to support initial,...
Attack HIGH
Navid Azimi, Aditya Prakash, Yao Wang +1 more
Deep neural networks remain highly vulnerable to adversarial perturbations, limiting their reliability in security- and safety-critical applications....
1 months ago cs.CR cs.AI cs.CV
PDF
Attack MEDIUM
Shuhao Zhang, Yuli Chen, Jiale Han +2 more
Watermarking provides a critical safeguard for large language model (LLM) services by facilitating the detection of LLM-generated text....
1 months ago cs.CR cs.AI
PDF
Benchmark MEDIUM
Xiaomeng Hu, Yinger Zhang, Fei Huang +7 more
AI agents are expected to perform professional work across hundreds of occupational domains (from emergency department triage to nuclear reactor...
Other LOW
Maria Camporese, Fabio Massacci, Yuanjun Gong
[Background:] Thematic analysis of free-text justifications in human experiments provides significant qualitative insights. Yet, it is costly because...
1 months ago cs.SE cs.AI
PDF
Attack HIGH
Yuanbo Xie, Yingjie Zhang, Yulin Li +5 more
Retrieval-Augmented Generation (RAG) systems augment large language models with external knowledge, yet introduce a critical security vulnerability:...
1 months ago cs.CR cs.AI cs.CL
PDF
Tool HIGH
Vu Tuan Truong, Long Bao Le
Large Language Models (LLMs), despite their impressive capabilities across domains, have been shown to be vulnerable to backdoor attacks. Prior...
1 months ago cs.CR cs.AI
PDF
Other LOW
Jordi Cabot
There is a pressing need for better development methods and tools to keep up with the growing demand and increasing complexity of new software...
1 months ago cs.SE cs.AI
PDF
Benchmark MEDIUM
Yuchen Chen, Yuan Xiao, Chunrong Fang +2 more
The proliferation of large language models for code (CodeLMs) and open-source contributions has heightened concerns over unauthorized use of source...
Defense MEDIUM
Xuwei Ding, Skylar Zhai, Linxin Song +6 more
Computer-use agents (CUAs) can now autonomously complete complex tasks in real digital environments, but when misled, they can also be used to...
1 months ago cs.CR cs.AI
PDF
Benchmark LOW
Wenbo Hu, Xin Chen, Yan Gao-Tian +3 more
Group Relative Policy Optimization (GRPO) has emerged as the de facto Reinforcement Learning (RL) objective driving recent advancements in Multimodal...
1 months ago cs.CV cs.AI cs.CL
PDF
Benchmark HIGH
Runpeng Geng, Chenlong Yin, Yanting Wang +2 more
Prompt injection attacks pose serious security risks across a wide range of real-world applications. While receiving increasing attention, the...
1 months ago cs.CR cs.AI cs.CL
PDF
Defense HIGH
Kevin Lira, Baldoino Fonseca, Davy Baía +2 more
Large Language Models (LLMs) have been a promising way for automated vulnerability detection. However, most prior studies have explored the use of...
1 months ago cs.SE cs.CR
PDF
Attack HIGH
Hanzhi Liu, Chaofan Shou, Hongbo Wen +3 more
Large language model (LLM) agents increasingly rely on third-party API routers to dispatch tool-calling requests across multiple upstream providers....
Benchmark MEDIUM
Wenhao Yuan, Chenchen Lin, Jian Chen +3 more
In large language model (LLM) agents, reasoning trajectories are treated as reliable internal beliefs for guiding actions and updating memory....
1 months ago cs.AI cs.CL
PDF
Attack MEDIUM
Nam Duong Tran, Phi Le Nguyen
Recent advances in Vision-Language Models (VLMs) have greatly enhanced the integration of visual perception and linguistic reasoning, driving rapid...
1 months ago cs.CV cs.AI
PDF
Track AI security vulnerabilities in real time
Get breaking CVE alerts, compliance reports (ISO 42001, EU AI Act),
and CISO risk assessments for your AI/ML stack.
Start 14-Day Free Trial