Defense MEDIUM
Xiaohua Wang, Muzhao Tian, Yuqi Zeng +20 more
Reinforcement Learning from Human Feedback (RLHF) and related alignment paradigms have become central to steering large language models (LLMs) and...
Defense MEDIUM
Sujan Ghimire, Parsa Mirfasihi, Muhtasim Alam Chowdhury +6 more
The globalization of integrated circuit (IC) design and manufacturing has increased the exposure of hardware intellectual property (IP) to untrusted...
Defense MEDIUM
Willy Carlos Tchuitcheu, Tan Lu, Ann Dooms
Historical approaches to Table Representation Learning (TRL) have largely adopted the sequential paradigms of Natural Language Processing (NLP). We...
Defense LOW
Georgianna, Lin, Rencong Jiang +2 more
Although artificial intelligence (AI) agents are increasingly proposed to support potentially longitudinal health tasks, such as symptom management,...
4 weeks ago cs.AI cs.HC
PDF
Defense MEDIUM
Adam Stein, Davis Brown, Hamed Hassani +2 more
To identify safety violations, auditors often search over large sets of agent traces. This search is difficult because failures are often rare,...
4 weeks ago cs.AI cs.CL
PDF
Defense MEDIUM
Junxiao Yang, Haoran Liu, Jinzhe Tu +9 more
Large language models (LLMs) often demonstrate strong safety performance in high-resource languages, yet exhibit severe vulnerabilities when queried...
4 weeks ago cs.LG cs.AI cs.CL
PDF
Defense LOW
Ningyan Zhu, Huacan Wang, Jie Zhou +8 more
The rise of OpenClaw in early 2026 marks the moment when millions of users began deploying personal AI agents into their daily lives, delegating...
Defense MEDIUM
Xuwei Ding, Skylar Zhai, Linxin Song +6 more
Computer-use agents (CUAs) can now autonomously complete complex tasks in real digital environments, but when misled, they can also be used to...
1 months ago cs.CR cs.AI
PDF
Defense HIGH
Kevin Lira, Baldoino Fonseca, Davy Baía +2 more
Large Language Models (LLMs) have been a promising way for automated vulnerability detection. However, most prior studies have explored the use of...
1 months ago cs.SE cs.CR
PDF
Defense MEDIUM
Weiwei Qi, Zefeng Wu, Tianhang Zheng +4 more
Ensuring Large Language Model (LLM) safety is crucial, yet the lack of a clear understanding about safety mechanisms hinders the development of...
Defense LOW
Ponnampalam Pirapuraj, Tamal Mondal, Sharanya Gupta +3 more
Application Programming Interfaces (APIs) are crucial to software development, enabling integration of existing systems with new applications by...
Defense MEDIUM
Rui Zhang, Hongwei Li, Yun Shen +6 more
The deployment of large language models (LLMs) raises significant ethical and safety concerns. While LLM alignment techniques are adopted to improve...
1 months ago cs.CR cs.CL
PDF
Defense MEDIUM
Nikolaos D. Tantaroudas, Ilias Karachalios, Andrew J. McCracken
The field of cybersecurity is confronted with two interrelated challenges: a worldwide deficit of qualified practitioners and ongoing human-factor...
1 months ago cs.CE cs.AI cs.CR
PDF
Defense LOW
Shunan Zhu, Jiawei Chen, Yonghao Yu +1 more
As high quality public data becomes scarce, Federated Learning (FL) provides a vital pathway to leverage valuable private user data while preserving...
1 months ago cs.CR cs.LG
PDF
Defense HIGH
Zi Liang, Qipeng Xie, Jun He +7 more
Recent advancements in Large Language Models (LLMs) have sparked interest in their application to Static Application Security Testing (SAST),...
1 months ago cs.CR cs.CL cs.SE
PDF
Defense MEDIUM
Peigui Qi, Kunsheng Tang, Yanpu Yu +7 more
Vision-Language Models (VLMs) face significant safety vulnerabilities from malicious prompt attacks due to weakened alignment during visual...
Defense MEDIUM
Igor Maljkovic, Maria Rosaria Briglia, Iacopo Masi +2 more
Vision-Language Models (VLMs) have become essential for tasks such as image synthesis, captioning, and retrieval by aligning textual and visual...
1 months ago cs.CR cs.AI cs.CV
PDF
Defense MEDIUM
Md Shamimul Islam, Luis G. Jaimes, Ayesha S. Dina
Network Intrusion Detection Systems (NIDS) face important limitations. Signature-based methods are effective for known attack patterns, but they...
1 months ago cs.CR cs.AI
PDF
Defense MEDIUM
Purva Chiniya, Kevin Scaria, Sagar Chaturvedi
Large language models (LLMs) remain susceptible to jailbreak and direct prompt-injection attacks, yet the strongest defensive filters frequently...
Defense MEDIUM
Zijun Wang, Haoqin Tu, Letian Zhang +11 more
OpenClaw, the most widely deployed personal AI agent in early 2026, operates with full local system access and integrates with sensitive services...
1 months ago cs.CR cs.AI cs.CL
PDF
Track AI security vulnerabilities in real time
Get breaking CVE alerts, compliance reports (ISO 42001, EU AI Act),
and CISO risk assessments for your AI/ML stack.
Start 14-Day Free Trial