Attack HIGH
Gamze Kirman Tokgoz, Onat Gungor, Tajana Rosing +1 more
Time-series forecasting aims to predict future values by modeling temporal dependencies in historical observations. It is a critical component of...
4 weeks ago cs.LG cs.CR
PDF
Defense MEDIUM
Adam Stein, Davis Brown, Hamed Hassani +2 more
To identify safety violations, auditors often search over large sets of agent traces. This search is difficult because failures are often rare,...
4 weeks ago cs.AI cs.CL
PDF
Tool HIGH
Wei Zhao, Zhe Li, Peixin Zhang +1 more
Tool-augmented Large Language Model (LLM) agents have demonstrated impressive capabilities in automating complex, multi-step real-world tasks, yet...
4 weeks ago cs.CR cs.AI
PDF
Benchmark MEDIUM
Ricardo Bessa, Rui Claro, João Trindade +1 more
Large Language Models (LLMs) are redefining offensive cybersecurity by allowing the generation of harmful machine code with minimal human...
Benchmark LOW
Javad M Alizadeh, Genhui Zheng, Chiu C Tan +7 more
People experiencing homelessness (PEH) face substantial barriers to accessing timely, accurate information about community services. DreamKG...
Defense MEDIUM
Junxiao Yang, Haoran Liu, Jinzhe Tu +9 more
Large language models (LLMs) often demonstrate strong safety performance in high-resource languages, yet exhibit severe vulnerabilities when queried...
4 weeks ago cs.LG cs.AI cs.CL
PDF
Defense LOW
Ningyan Zhu, Huacan Wang, Jie Zhou +8 more
The rise of OpenClaw in early 2026 marks the moment when millions of users began deploying personal AI agents into their daily lives, delegating...
Benchmark MEDIUM
Hanbo Huang, Xuan Gong, Yiran Zhang +2 more
Large language model (LLM) watermarking has emerged as a promising approach for detecting and attributing AI-generated text, yet its robustness to...
Benchmark LOW
Jinhua Wang, Biswa Sengupta
Cross-language migration of large software systems is a persistent engineering challenge, particularly when the source codebase evolves rapidly. We...
4 weeks ago cs.SE cs.AI
PDF
Benchmark MEDIUM
Ricardo Bessa, Rui Claro, João Trindade +1 more
The application of Machine Learning techniques in code generation is now a common practice for most developers. Tools such as ChatGPT from OpenAI...
Benchmark LOW
Dzenan Hamzic, Florian Skopik, Max Landauer +2 more
Cyber threat intelligence (CTI) analysts must answer complex questions over large collections of narrative security reports. Retrieval-augmented...
4 weeks ago cs.AI cs.CR
PDF
Other MEDIUM
Yiran Ling, Wenxuan Li, Siying Dong +5 more
Robot grasping of desktop object is widely used in intelligent manufacturing, logistics, and agriculture.Although vision-language models (VLMs) show...
Tool HIGH
Yihao Zhang, Kai Wang, Jiangrong Wu +7 more
Large Language Models (LLMs) face prominent security risks from jailbreaking, a practice that manipulates models to bypass built-in security...
4 weeks ago cs.CR cs.AI cs.CL
PDF
Attack LOW
Zhixiang Lu, Jionglong Su
Multimodal Large Language Models (MLLMs) in healthcare suffer from severe confirmation bias, often hallucinating visual details to support initial,...
Attack HIGH
Navid Azimi, Aditya Prakash, Yao Wang +1 more
Deep neural networks remain highly vulnerable to adversarial perturbations, limiting their reliability in security- and safety-critical applications....
4 weeks ago cs.CR cs.AI cs.CV
PDF
Attack MEDIUM
Shuhao Zhang, Yuli Chen, Jiale Han +2 more
Watermarking provides a critical safeguard for large language model (LLM) services by facilitating the detection of LLM-generated text....
4 weeks ago cs.CR cs.AI
PDF
Track AI security vulnerabilities in real time
Get breaking CVE alerts, compliance reports (ISO 42001, EU AI Act),
and CISO risk assessments for your AI/ML stack.
Start 14-Day Free Trial