Tool HIGH
Yuchuan Zhao, Tong Chen, Junliang Yu +3 more
Large language model-powered sequential recommender systems (LLM-SRSs) have recently demonstrated remarkable performance, enabling recommendations...
Survey MEDIUM
Jialiang Wang, Yuchen Liu, Hang Xu +7 more
The volume of scientific submissions continues to climb, outpacing the capacity of qualified human referees and stretching editorial timelines. At...
Benchmark LOW
Pegah Khayatan, Jayneel Parekh, Arnaud Dapogny +3 more
Despite impressive progress in capabilities of large vision-language models (LVLMs), these systems remain vulnerable to hallucinations, i.e., outputs...
2 weeks ago cs.CV cs.AI cs.CL
PDF
Attack HIGH
Naheed Rayhan, Sohely Jahan
Large language models (LLMs) are increasingly integrated into sensitive workflows, raising the stakes for adversarial robustness and safety. This...
2 weeks ago cs.CR cs.AI
PDF
Attack HIGH
Zihan Wang, Rui Zhang, Yu Liu +4 more
LLM agents increasingly rely on skills to encapsulate reusable capabilities via progressively disclosed instructions. High-quality skills inject...
Attack HIGH
Jiali Wei, Ming Fan, Guoheng Sun +3 more
The growing application of large language models (LLMs) in safety-critical domains has raised urgent concerns about their security. Many recent...
2 weeks ago cs.CR cs.AI cs.CL
PDF
Tool HIGH
Run Hao, Zhuoran Tan
Model Context Protocol (MCP) is increasingly adopted for tool-integrated LLM agents, but its multi-layer design and third-party server ecosystem...
Other LOW
Arthur Douillard, Keith Rush, Yani Donchev +14 more
Modern large-scale language model pre-training relies heavily on the single program multiple data (SPMD) paradigm, which requires tight coupling...
Benchmark MEDIUM
Yuchen Shi, Xin Guo, Huajie Chen +3 more
Poisoning-based backdoor attacks pose significant threats to deep neural networks by embedding triggers in training data, causing models to...
2 weeks ago cs.CR cs.AI
PDF
Benchmark MEDIUM
Vishal Rajput
We prove that empirical risk minimisation (ERM) imposes a necessary geometric constraint on learned representations: any encoder that minimises...
2 weeks ago cs.LG cs.AI cs.CV
PDF
Benchmark LOW
Yongcan Yu, Lingxiao He, Jian Liang +5 more
Test-time reinforcement learning (TTRL) always adapts models at inference time via pseudo-labeling, leaving it vulnerable to spurious optimization...
2 weeks ago cs.LG cs.AI cs.CL
PDF
Defense HIGH
Zhaohui Geoffrey Wang
Automated code vulnerability detection is critical for software security, yet existing approaches face a fundamental trade-off between detection...
2 weeks ago cs.CR cs.LG cs.SE
PDF
Attack HIGH
Guilin Deng, Silong Chen, Yuchuan Luo +6 more
Federated Large Language Models (FedLLMs) enable multiple parties to collaboratively fine-tune LLMs without sharing raw data, addressing challenges...
Attack HIGH
Jesse Zymet, Andy Luo, Swapnil Shinde +2 more
Many approaches to LLM red-teaming leverage an attacker LLM to discover jailbreaks against a target. Several of them task the attacker with...
2 weeks ago cs.CR cs.AI cs.CL
PDF
Attack MEDIUM
Irti Haq, Belén Saldías
As state-of-the-art Large Language Models (LLMs) have become ubiquitous, ensuring equitable performance across diverse demographics is critical....
2 weeks ago cs.CY cs.AI cs.CL
PDF
Benchmark MEDIUM
Ari Azarafrooz
AI-agent guardrails are memoryless: each message is judged in isolation, so an adversary who spreads a single attack across dozens of sessions slips...
2 weeks ago cs.CR cs.AI cs.CL
PDF
Benchmark MEDIUM
Mohammad Farhad, Shuvalaxmi Dass
Software security relies on effective vulnerability detection and patching, yet determining whether a patch fully eliminates risk remains an...
2 weeks ago cs.SE cs.CR
PDF
Attack HIGH
Yannis Belkhiter, Giulio Zizzo, Sergio Maffeis +2 more
The growth of agentic AI has drawn significant attention to function calling Large Language Models (LLMs), which are designed to extend the...
2 weeks ago cs.CR cs.AI cs.CL
PDF
Tool MEDIUM
Mikko Lempinen, Joni Kemppainen, Niklas Raesalmi
As artificial intelligence (AI) systems are increasingly deployed across critical domains, their security vulnerabilities pose growing risks of...
2 weeks ago cs.CR cs.AI cs.CL
PDF
Benchmark HIGH
Hanzhi Liu, Chaofan Shou, Xiaonan Liu +4 more
LLM agents have begun to find real security vulnerabilities that human auditors and automated fuzzers missed for decades, in source-available targets...
Track AI security vulnerabilities in real time
Get breaking CVE alerts, compliance reports (ISO 42001, EU AI Act),
and CISO risk assessments for your AI/ML stack.
Start 14-Day Free Trial