Survey MEDIUM
Varpu Vehomäki, Kimmo K. Kaski
Understanding cyber security is increasingly important for individuals and organizations. However, a lot of information related to cyber security can...
Survey MEDIUM
Viet Hoang Luu, Amirmohammad Pasdar, Wachiraphan Charoenwet +3 more
Modern fuzzers scale to large, real-world software but often fail to exercise the program states developers consider most fragile or...
3 months ago cs.CR cs.SE
PDF
Survey MEDIUM
Ashwath Vaithinathan Aravindan, Mayank Kejriwal
Chain-of-Thought (CoT) prompting has emerged as a foundational technique for eliciting reasoning from Large Language Models (LLMs), yet the...
3 months ago cs.CL cs.AI cs.LG
PDF
Survey HIGH
Peiran Wang, Xinfeng Li, Chong Xiang +5 more
The evolution of Large Language Models (LLMs) has resulted in a paradigm shift towards autonomous agents, necessitating robust security against...
3 months ago cs.CR cs.CL
PDF
Survey HIGH
George Tsigkourakos, Constantinos Patsakis
Static Application Security Testing (SAST) tools are integral to modern DevSecOps pipelines, yet tools like CodeQL, Semgrep, and SonarQube remain...
Survey LOW
Shae McFadden, Myles Foley, Elizabeth Bates +5 more
Deep Reinforcement Learning (DRL) has achieved remarkable success in domains requiring sequential decision-making, motivating its application to...
3 months ago cs.LG cs.CR
PDF
Survey LOW
Cen Zhang, Younggi Park, Fabian Fleischer +20 more
DARPA's AI Cyber Challenge (AIxCC, 2023--2025) is the largest competition to date for building fully autonomous cyber reasoning systems (CRSs) that...
3 months ago cs.CR cs.AI
PDF
Survey MEDIUM
Yunlong Lyu, Yixuan Tang, Peng Chen +4 more
Modern AI-integrated IDEs are shifting from passive code completion to proactive Next Edit Suggestions (NES). Unlike traditional autocompletion, NES...
3 months ago cs.CR cs.HC
PDF
Survey MEDIUM
Yilin Geng, Omri Abend, Eduard Hovy +1 more
It is not only what we ask large language models (LLMs) to do that matters, but also how we prompt. Phrases like "This is urgent" or "As your...
3 months ago cs.CL cs.AI
PDF
Survey HIGH
Luze Sun, Alina Oprea, Eric Wong
LLM-based vulnerability detectors are increasingly deployed in security-critical code review, yet their resilience to evasion under...
3 months ago cs.CR cs.AI cs.LG
PDF
Survey HIGH
Pedro H. Barcha Correia, Ryan W. Achjian, Diego E. G. Caetano de Oliveira +5 more
The rapid advancement and widespread adoption of generative artificial intelligence (GenAI) and large language models (LLMs) has been accompanied by...
3 months ago cs.CR cs.AI cs.CL
PDF
Survey MEDIUM
Mohsen Hatami, Van Tuan Pham, Hozefa Lakadawala +1 more
The increasing integration of AI agents into cyber-physical systems (CPS) introduces new security risks that extend beyond traditional cyber or...
3 months ago cs.CR cs.DC
PDF
Survey MEDIUM
Wachiraphan Charoenwet, Kla Tantithamthavorn, Patanamon Thongtanunam +3 more
Secure code review is critical at the pre-commit stage, where vulnerabilities must be caught early under tight latency and limited-context...
3 months ago cs.CR cs.AI cs.LG
PDF
Survey LOW
Hugo Silva, Mateus Mendes, Hugo Gonçalo Oliveira
Large language models (LLMs) are evolving fast and are now frequently used as evaluators, in a process typically referred to as LLM-as-a-Judge, which...
3 months ago cs.CL cs.AI
PDF
Survey MEDIUM
Xiaowei Fu, Lei Zhang
The widespread use of Vision Language Models (VLMs, e.g. CLIP) has raised concerns about their vulnerability to sophisticated and imperceptible...
3 months ago cs.CV cs.AI
PDF
Survey MEDIUM
Lirui Zhang, Huishuai Zhang
As LLMs rapidly advance and enter real-world use, their privacy implications are increasingly important. We study an authorship de-anonymization...
3 months ago cs.CR cs.CL cs.LG
PDF
Survey MEDIUM
Yi Liu, Weizhe Wang, Ruitao Feng +5 more
The rise of AI agent frameworks has introduced agent skills, modular packages containing instructions and executable code that dynamically extend...
3 months ago cs.CR cs.AI cs.CL
PDF
Survey MEDIUM
Mohoshin Ara Tahera, Karamveer Singh Sidhu, Shuvalaxmi Dass +1 more
Large Language Models (LLMs) are increasingly adopted in healthcare to support clinical decision-making, summarize electronic health records (EHRs),...
3 months ago cs.CR cs.LG
PDF
Survey MEDIUM
Huihui Huang, Jieke Shi, Junkai Chen +6 more
Penetration testing is essential for identifying vulnerabilities in web applications before real adversaries can exploit them. Recent work has...
Survey HIGH
Masahiro Kaneko
The use of large language models (LLMs) in peer review systems has attracted growing attention, making it essential to examine their potential...
4 months ago cs.CL cs.AI cs.LG
PDF
Track AI security vulnerabilities in real time
Get breaking CVE alerts, compliance reports (ISO 42001, EU AI Act),
and CISO risk assessments for your AI/ML stack.
Start 14-Day Free Trial