Benchmark LOW
Gloria Felicia, Michael Eniolade, Jinfeng He +4 more
Existing agent safety benchmarks report binary accuracy, conflating early intervention with post-mortem analysis. A detector that flags a violation...
3 months ago cs.LG cs.AI cs.CR
PDF
Attack HIGH
Xiaogeng Liu, Xinyan Wang, Yechao Zhang +5 more
Large reasoning models (LRMs) extend large language models with explicit multi-step reasoning traces, but this capability introduces a new class of...
3 months ago cs.CR cs.AI
PDF
Benchmark MEDIUM
Xiaoyu Xu, Minxin Du, Kun Fang +6 more
Large language models (LLMs) demonstrate impressive capabilities across diverse tasks but raise concerns about privacy, copyright, and harmful...
3 months ago cs.CL cs.AI cs.CR
PDF
Attack MEDIUM
Mingyang Liao, Yichen Wan, shuchen wu +6 more
LLM-based role-playing has rapidly improved in fidelity, yet stronger adherence to persona constraints commonly increases vulnerability to jailbreak...
Attack HIGH
Ningyuan He, Ronghong Huang, Qianqian Tang +3 more
In-context learning (ICL) has become a powerful, data-efficient paradigm for text classification using large language models. However, its robustness...
Attack MEDIUM
Wenhui Zhang, Huiyu Xu, Zhibo Wang +4 more
Recent advancements in multi-model AI systems have leveraged LLM routers to reduce computational cost while maintaining response quality by assigning...
Benchmark MEDIUM
Devanshu Sahoo, Manish Prasad, Vasudev Majhi +5 more
The rapid integration of Large Language Models (LLMs) into educational assessment rests on the unverified assumption that instruction following...
3 months ago cs.CL cs.AI cs.ET
PDF
Tool MEDIUM
Xiang Zheng, Yutao Wu, Hanxun Huang +5 more
Autonomous code agents built on large language models are reshaping software and AI development through tool use, long-horizon reasoning, and...
Attack MEDIUM
Alvi Md Ishmam, Najibul Haque Sarker, Zaber Ibn Abdul Hakim +1 more
Multimodal Large Language Models (MLLMs) have achieved remarkable performance across vision-language tasks. Recent advancements allow these models to...
Attack MEDIUM
Arther Tian, Alex Ding, Frank Chen +2 more
Decentralized large language model inference networks require lightweight mechanisms to reward high quality outputs under heterogeneous latency and...
3 months ago cs.CR cs.AI
PDF
Attack MEDIUM
Jarrod Barnes
As large language models (LLMs) improve, so do their offensive applications: frontier agents now generate working exploits for under $50 in compute...
Attack MEDIUM
Onkar Shelar, Travis Desell
Evolutionary prompt search is a practical black-box approach for red teaming large language models (LLMs), but existing methods often collapse onto a...
3 months ago cs.NE q-bio.PE
PDF
Benchmark LOW
Mingqiao Mo, Yunlong Tan, Hao Zhang +2 more
Large language models (LLMs) have achieved remarkable progress in code generation, yet their potential for software protection remains largely...
Attack HIGH
Xingwei Lin, Wenhao Lin, Sicong Cao +4 more
Multi-turn jailbreak attacks have emerged as a critical threat to Large Language Models (LLMs), bypassing safety mechanisms by progressively...
3 months ago cs.CR cs.AI
PDF
Attack MEDIUM
Yizhong Ding
Webshells remain a primary foothold for attackers to compromise servers, particularly within PHP ecosystems. However, existing detection mechanisms...
3 months ago cs.CR cs.AI
PDF
Defense MEDIUM
Holly Trikilis, Pasindu Marasinghe, Fariza Rashid +1 more
Phishing continues to be one of the most prevalent attack vectors, making accurate classification of phishing URLs essential. Recently, large...
3 months ago cs.CR cs.AI
PDF
Survey MEDIUM
Mohsen Hatami, Van Tuan Pham, Hozefa Lakadawala +1 more
The increasing integration of AI agents into cyber-physical systems (CPS) introduces new security risks that extend beyond traditional cyber or...
3 months ago cs.CR cs.DC
PDF
Attack HIGH
Yuetian Chen, Kaiyuan Zhang, Yuntao Du +5 more
Diffusion Language Models (DLMs) represent a promising alternative to autoregressive language models, using bidirectional masked token prediction....
3 months ago cs.LG cs.AI
PDF
Benchmark LOW
Faezeh Hosseini, Mohammadali Yousefzadeh, Yadollah Yaghoobzadeh
Figurative language, particularly fixed figurative expressions (FFEs) such as idioms and proverbs, poses persistent challenges for large language...
Attack HIGH
Md Tasnim Jawad, Mingyan Xiao, Yanzhao Wu
With the widespread adoption of Large Language Models (LLMs) and increasingly stringent privacy regulations, protecting data privacy in LLMs has...
Track AI security vulnerabilities in real time
Get breaking CVE alerts, compliance reports (ISO 42001, EU AI Act),
and CISO risk assessments for your AI/ML stack.
Start 14-Day Free Trial