Attack HIGH
Evangelos Lamprou, Julian Dai, Grigoris Ntousakis +2 more
Software supply-chain attacks are an important and ongoing concern in the open source software ecosystem. These attacks maintain the standard...
Attack HIGH
Xiaoyu Xue, Yuni Lai, Chenxi Huang +4 more
The emergence of graph foundation models (GFMs), particularly those incorporating language models (LMs), has revolutionized graph learning and...
5 months ago cs.CR cs.AI
PDF
Attack MEDIUM
Andrew Zhao, Reshmi Ghosh, Vitor Carvalho +4 more
Large language model (LLM) systems increasingly power everyday AI applications such as chatbots, computer-use assistants, and autonomous robots,...
5 months ago cs.LG cs.AI cs.CL
PDF
Defense MEDIUM
Mason Nakamura, Abhinav Kumar, Saaduddin Mahmud +3 more
A multi-agent system (MAS) powered by large language models (LLMs) can automate tedious user tasks such as meeting scheduling that requires...
5 months ago cs.AI cs.CL cs.CR
PDF
Attack HIGH
Yingguang Yang, Xianghua Zeng, Qi Wu +5 more
Social networks have become a crucial source of real-time information for individuals. The influence of social bots within these platforms has...
5 months ago cs.LG cs.AI cs.CR
PDF
Attack MEDIUM
Fanchao Meng, Jiaping Gui, Yunbo Li +1 more
Modern Network Intrusion Detection Systems generate vast volumes of low-level alerts, yet these outputs remain semantically fragmented, requiring...
Benchmark HIGH
Trilok Padhi, Pinxian Lu, Abdulkadir Erol +5 more
Large Language Model (LLM) agents are powering a growing share of interactive web applications, yet remain vulnerable to misuse and harm. Prior...
Tool MEDIUM
Edoardo Allegrini, Ananth Shreekumar, Z. Berkay Celik
Agentic AI systems, which leverage multiple autonomous agents and Large Language Models (LLMs), are increasingly used to address complex, multi-step...
5 months ago cs.AI cs.CR cs.MA
PDF
Benchmark LOW
Matan Levi, Daniel Ohayon, Ariel Blobstein +3 more
Large language models (LLMs) are transforming everyday applications, yet deployment in cybersecurity lags due to a lack of high-quality,...
5 months ago cs.CL cs.AI cs.CR
PDF
Attack MEDIUM
Jianzhu Yao, Hongxu Su, Taobo Liao +4 more
Neural networks increasingly run on hardware outside the user's control (cloud GPUs, inference marketplaces). Yet ML-as-a-Service reveals little...
5 months ago cs.CR cs.AI cs.LG
PDF
Benchmark MEDIUM
Qiushi Wu, Yue Xiao, Dhilung Kirat +3 more
Fixing bugs in large programs is a challenging task that demands substantial time and effort. Once a bug is found, it is reported to the project...
5 months ago cs.SE cs.AI
PDF
Attack HIGH
Abdulrahman Alhaidari, Balaji Palanisamy, Prashant Krishnamurthy
Billions of dollars are lost every year in DeFi platforms by transactions exploiting business logic or accounting vulnerabilities. Existing defenses...
5 months ago cs.CR cs.AI cs.DC
PDF
Attack HIGH
Wei Zou, Yupei Liu, Yanting Wang +3 more
LLM-integrated applications are vulnerable to prompt injection attacks, where an attacker contaminates the input to inject malicious instructions,...
5 months ago cs.CR cs.LG
PDF
Benchmark MEDIUM
Yibo Peng, James Song, Lei Li +6 more
Code agents are increasingly trusted to autonomously fix bugs on platforms such as GitHub, yet their security evaluation focuses almost exclusively...
5 months ago cs.CR cs.SE
PDF
Benchmark LOW
Xiuyuan Chen, Tao Sun, Dexin Su +38 more
Current benchmarks for AI clinician systems, often based on multiple-choice exams or manual rubrics, fail to capture the depth, robustness, and...
Benchmark MEDIUM
Jonghyun Park, Minhyuk Seo, Jonghyun Choi
One of the key challenges of modern AI models is ensuring that they provide helpful responses to benign queries while refusing malicious ones. But...
Benchmark HIGH
Ivan Dubrovsky, Anastasia Orlova, Illarion Iov +3 more
Benchmarking outcomes increasingly govern trust, selection, and deployment of LLMs, yet these evaluations remain vulnerable to semantically...
Attack HIGH
Avihay Cohen
Large Language Model (LLM) based agents integrated into web browsers (often called agentic AI browsers) offer powerful automation of web tasks....
5 months ago cs.CR cs.AI
PDF
Benchmark MEDIUM
Xin Zhao, Xiaojun Chen, Bingshan Liu +3 more
Large language models (LLMs) with Mixture-of-Experts (MoE) architectures achieve impressive performance and efficiency by dynamically routing inputs...
Tool MEDIUM
Yisen Wang, Yichuan Mo, Hongjun Wang +2 more
Despite the rapid progress of neural networks, they remain highly vulnerable to adversarial examples, for which adversarial training (AT) is...
5 months ago cs.LG cs.AI cs.CR
PDF
Track AI security vulnerabilities in real time
Get breaking CVE alerts, compliance reports (ISO 42001, EU AI Act),
and CISO risk assessments for your AI/ML stack.
Start 14-Day Free Trial