Benchmark MEDIUM
Ziyao Tang, Pengkun Jiao, Bin Zhu +3 more
Video Large Language Models (Vid-LLMs) have demonstrated remarkable performance in video understanding tasks, yet their robustness under...
Defense MEDIUM
Ting Zhang, Yikun Li, Chengran Yang +15 more
Software vulnerabilities remain one of the most persistent threats to modern digital infrastructure. While static application security testing (SAST)...
Benchmark MEDIUM
Shozo Saeki, Minoru Kawahara, Hirohisa Aman
A nearest-neighbor framework is a fundamental tool for various applications involving Large Language Models (LLMs) and Visual Language Models (VLMs)....
Tool MEDIUM
Yuan Fang, Yiming Luo, Aimin Zhou +1 more
Ensuring the safety of large language models (LLMs) requires robust red teaming, yet the systematic synthesis of high-quality toxic data remains...
3 weeks ago cs.CL cs.AI
PDF
Benchmark MEDIUM
Yihao Zou, Tianming Zheng, Futai Zou +1 more
Fuzzing has become a widely adopted technique for vulnerability discovery, yet it remains ineffective for structured-input programs due to strict...
3 weeks ago cs.CR cs.PL
PDF
Benchmark LOW
Khang Tran, Khoa Nguyen, Cristian Borcea +1 more
Recent advances in large language models for test case generation have improved branch coverage via prompt-engineered mutations. However, they still...
3 weeks ago cs.SE cs.LG
PDF
Benchmark HIGH
Ivan Bercovich, Ivgeni Segal, Kexun Zhang +3 more
We release Terminal Wrench, a subset of 331 terminal-agent benchmark environments, copied from the popular open benchmarks that are demonstrably...
3 weeks ago cs.CR cs.AI
PDF
Defense MEDIUM
Hailin Liu, Eugene Ilyushin, Jie Ni +1 more
Large language model (LLM) agents are vulnerable to prompt-injection attacks that propagate through multi-step workflows, tool interactions, and...
3 weeks ago cs.AI cs.MA
PDF
Attack MEDIUM
Jianming Tong, Hanshen Xiao, Krishna Kumar Nair +5 more
Multi-user virtual reality enables immersive interaction. However, rendering avatars for numerous participants on each headset incurs prohibitive...
3 weeks ago cs.CR cs.AR cs.CV
PDF
Benchmark MEDIUM
Dongwook Lee, Eunwoo Song, Che Hyun Lee +2 more
While recent Spoken Language Models (SLMs) have been actively deployed in real-world scenarios, they lack the capability to discern Third-Party...
3 weeks ago cs.CL cs.AI cs.SD
PDF
Benchmark MEDIUM
Rina Mishra, Gaurav Varshney, Doddipatla Sesha Sahithi
The rapid adoption of open-source Large Language Models (LLMs) in offline and enterprise environments has introduced a largely unexamined security...
Benchmark LOW
Madhav Agarwal, Sotirios A. Tsaftaris, Laura Sevilla-Lara +1 more
Understanding emotions is a fundamental ability for intelligent systems to be able to interact with humans. Vision-language models (VLMs) have made...
3 weeks ago cs.CV cs.AI
PDF
Other MEDIUM
XiangRui Zhang, Qiang Li, Haining Wang
Binary analysis increasingly relies on large language models (LLMs) to perform semantic reasoning over complex program behaviors. However, existing...
Attack HIGH
Haochun Tang, Yuliang Yan, Jiahua Lu +2 more
Cost-aware routing dynamically dispatches user queries to models of varying capability to balance performance and inference cost. However, the...
3 weeks ago cs.CR cs.AI cs.CL
PDF
Attack MEDIUM
Xuanli He, Bilgehan Sel, Faizan Ali +3 more
Large Language Models (LLMs) are increasingly exposed to adaptive jailbreaking, particularly in high-stakes Chemical, Biological, Radiological, and...
3 weeks ago cs.CL cs.CR
PDF
Other LOW
Peifeng Zhang, Zice Qiu, Donghua Yu +4 more
In continual visual question answering (VQA), existing Continual Learning (CL) methods are mostly built for symmetric, unimodal architectures....
3 weeks ago cs.CV cs.CL
PDF
Attack HIGH
Meng Chen, Kun Wang, Li Lu +2 more
Modern Large audio-language models (LALMs) power intelligent voice interactions by tightly integrating audio and text. This integration, however,...
3 weeks ago cs.CR cs.AI cs.SD
PDF
Attack MEDIUM
Firas Ben Hmida, Philemon Hailemariam, Kashif Ali Khan +1 more
Deep neural networks (DNNs) remain largely opaque at inference time, limiting our ability to detect and diagnose malicious input manipulations such...
Attack HIGH
Fortunatus Aabangbio Wulnye, Justice Owusu Agyemang, Kwame Opuni-Boachie Obour Agyekum +3 more
Ensuring the reliability of machine learning-based intrusion detection systems remains a critical challenge in Internet of Things (IoT) environments,...
3 weeks ago cs.CR cs.AI
PDF
Attack MEDIUM
Pavel Chizhov, Egor Bogomolov, Ivan P. Yamshchikov
Efficiency and safety of Large Language Models (LLMs), among other factors, rely on the quality of tokenization. A good tokenizer not only improves...
Track AI security vulnerabilities in real time
Get breaking CVE alerts, compliance reports (ISO 42001, EU AI Act),
and CISO risk assessments for your AI/ML stack.
Start 14-Day Free Trial