Defense MEDIUM
Yuyi Huang, Runzhe Zhan, Lidia S. Chao +2 more
As large language models (LLMs) are increasingly deployed for complex reasoning tasks, Long Chain-of-Thought (Long-CoT) prompting has emerged as a...
Defense MEDIUM
MingSheng Li, Guangze Zhao, Sichen Liu
Large Vision-Language Models (LVLMs) have achieved remarkable progress in multimodal perception and generation, yet their safety alignment remains a...
5 months ago cs.AI cs.CR
PDF
Defense MEDIUM
Xiangtao Meng, Tianshuo Cong, Li Wang +4 more
Large Language Models (LLMs) have shown remarkable performance across various applications, but their deployment in real-world settings faces several...
Defense MEDIUM
Thusitha Dayaratne, Ngoc Duy Pham, Viet Vo +5 more
The quality and experience of mobile communication have significantly improved with the introduction of 5G, and these improvements are expected to...
5 months ago cs.CR cs.ET cs.LG
PDF
Defense MEDIUM
Shuai Zhao, Xinyi Wu, Shiqian Zhao +4 more
During fine-tuning, large language models (LLMs) are increasingly vulnerable to data-poisoning backdoor attacks, which compromise their reliability...
5 months ago cs.CR cs.AI cs.CL
PDF
Defense MEDIUM
Anindya Sundar Das, Kangjie Chen, Monowar Bhuyan
Pre-trained language models have achieved remarkable success across a wide range of natural language processing (NLP) tasks, particularly when...
5 months ago cs.CL cs.LG
PDF
Defense MEDIUM
Rui Wu, Yihao Quan, Zeru Shi +3 more
Safety-aligned Large Language Models (LLMs) still show two dominant failure modes: they are easily jailbroken, or they over-refuse harmless inputs...
5 months ago cs.CL cs.LG
PDF
Defense MEDIUM
Lesly Miculicich, Mihir Parmar, Hamid Palangi +4 more
The deployment of autonomous AI agents in sensitive domains, such as healthcare, introduces critical risks to safety, security, and privacy. These...
5 months ago cs.SE cs.AI cs.CR
PDF
Defense MEDIUM
Yuhao Sun, Zhuoer Xu, Shiwen Cui +4 more
Large Language Models (LLMs) have achieved remarkable progress across a wide range of tasks, but remain vulnerable to safety risks such as harmful...
5 months ago cs.AI cs.CR cs.LG
PDF
Defense MEDIUM
Guobin Shen, Dongcheng Zhao, Haibo Tong +3 more
Ensuring Large Language Model (LLM) safety remains challenging due to the absence of universal standards and reliable content validators, making it...
Defense MEDIUM
Ayda Aghaei Nia
Completely Automated Public Turing tests to tell Computers and Humans Apart (CAPTCHAs) are a foundational component of web security, yet traditional...
5 months ago cs.CR cs.AI
PDF
Defense MEDIUM
Zherui Li, Zheng Nie, Zhenhong Zhou +7 more
The rapid advancement of Diffusion Large Language Models (dLLMs) introduces unprecedented vulnerabilities that are fundamentally distinct from...
5 months ago cs.CL cs.AI
PDF
Defense MEDIUM
Gauri Kholkar, Ratinder Ahuja
As autonomous AI agents are used in regulated and safety-critical settings, organizations need effective ways to turn policy into enforceable...
5 months ago cs.CL cs.AI
PDF
Defense MEDIUM
Yuqiao Meng, Luoxi Tang, Feiyang Yu +4 more
Large language models (LLMs) are increasingly used to help security analysts manage the surge of cyber threats, automating tasks from vulnerability...
5 months ago cs.CR cs.AI
PDF
Defense MEDIUM
Zeyu Shen, Basileal Imana, Tong Wu +3 more
Retrieval-Augmented Generation (RAG) enhances Large Language Models by grounding their outputs in external documents. These systems, however, remain...
5 months ago cs.CR cs.AI
PDF
Defense MEDIUM
Charles E. Gagnon, Steven H. H. Ding, Philippe Charland +1 more
Binary code similarity detection is a core task in reverse engineering. It supports malware analysis and vulnerability discovery by identifying...
5 months ago cs.AI cs.CR cs.SE
PDF
Defense MEDIUM
Anton Korznikov, Andrey Galichin, Alexey Dontsov +3 more
Activation steering is a promising technique for controlling LLM behavior by adding semantically meaningful vectors directly into a model's hidden...
6 months ago cs.LG cs.AI
PDF
Defense MEDIUM
Jaehan Kim, Minkyoo Song, Seungwon Shin +1 more
Recent large language models (LLMs) have increasingly adopted the Mixture-of-Experts (MoE) architecture for efficiency. MoE-based LLMs heavily depend...
6 months ago cs.CR cs.AI
PDF
Defense MEDIUM
Wei Huang, De-Tian Chu, Lin-Yuan Bai +6 more
Modern email spam and phishing attacks have evolved far beyond keyword blacklists or simple heuristics. Adversaries now craft multi-modal campaigns...
6 months ago cs.LG cs.CR
PDF
Track AI security vulnerabilities in real time
Get breaking CVE alerts, compliance reports (ISO 42001, EU AI Act),
and CISO risk assessments for your AI/ML stack.
Start 14-Day Free Trial