Cybersecurity AI (CAI) Dataset
Víctor Mayoral-Vilches
We present CAI Dataset, a fourteen-month corpus of cybersecurity LLM trajectories collected through the open-source CAI agent framework, built in...
AI Threat Alert indexes 3,023+ peer-reviewed and preprint papers on AI/ML security — covering adversarial attacks, model defenses, red-teaming benchmarks, surveys, and security tooling. Papers are sourced from arXiv, classified by type and by relevance to real-world threats, and cross-referenced with the CVEs and incidents they relate to.
Showing 301–320 of 1,983 papers
Clear filtersVíctor Mayoral-Vilches
We present CAI Dataset, a fourteen-month corpus of cybersecurity LLM trajectories collected through the open-source CAI agent framework, built in...
Yujie Ma, Jialin Rong, Chenxi Yang +4 more
Large Language Models(LLMs) have been actively integrated into modern software systems as critical components. LLM-in-the-loop vulnerabilities, where...
Ruoqi Guo, Yi Liu, Gelei Deng +7 more
Mobile graphical user interface (GUI) agents driven by vision-language models (VLMs) perceive the screen as rendered pixels and choose actions from...
Junjie Mu, Qiongxiu Li
Federated Retrieval-Augmented Generation (FedRAG) is attractive for privacy-sensitive applications because raw data remain local. As a result,...
Jiachen Qian
Retrieval-Augmented Generation (RAG) mitigates LLM hallucinations but introduces a critical vulnerability: corpus integrity. We present...
Yu Yin, Shuai Wang, Bevan Koopman +1 more
Recent generative engine optimisation (GEO) research has shown that prompt-injection attacks can push a target product to the top of an LLM's...
Yongwoo Kim, Sojung An, Yunjin Park +8 more
Multimodal Large Language Models (MLLMs) exacerbate safety risks by introducing vulnerabilities across multiple modalities, such as language and...
Yuan Tian, Bing Hu, Fang Wu +3 more
Think-with-image reasoning is emerging as a new inference paradigm for large vision-language models, but its safety implications remain poorly...
Xuesi Hu, Peng Wang, Jinpeng Miao +7 more
Recently, large language models (LLMs) have achieved superior performance in static financial reasoning and simple dynamic trading tasks. However,...
Xiang Fang, Wanlong Fang
Large Language Models (LLMs) are increasingly vulnerable to adversarial prompts that exploit semantic ambiguities to bypass safety mechanisms,...
Qiyuan Wang, Yao Li, Raymond K. W. Wong
Despite recent progress in backdoor attacks, existing methods remain susceptible to post-training defenses that erase the backdoor through...
Akshaj Murhekar, Abhijit Mishra
Recent advances in large language models have accelerated open-vocabulary EEG-to-imagined-text decoding, where non-invasive neural activity recorded...
Aman Priyanshu, Supriti Vijay, Esha Pahwa
LLM safety evaluations predominantly test models in isolation, yet deployed AI agents increasingly operate within persistent social environments...
Cihan Xiao, Yiwen Shao, Chenxing Li +5 more
Audio and omni-modal large language models exhibit impressive cross-modal reasoning capabilities. However, applying standard reinforcement learning...
Arthur Renard, Franck Gabriel, Valentin Hartmann +1 more
We present Frost Training, a method for improving Monte Carlo-based policy optimization for a large family of LLM-as-a-judge tasks called...
Abile Jean, Kuniyilh S
Cyber-Physical Systems (CPS) integrate sensing, communication, computation, and control to support critical infrastructure, including smart grids,...
Khang Tran, Yazan Boshmaf, Issa Khalil +3 more
Code Large Language Models (CLLMs) serve as the core of modern code agents, enabling developers to automate complex software development tasks. In...
Snehasis Mukhopadhyay
Jailbreak attacks on multimodal AI systems remain underexplored, even though unsafe image generation can have more severe consequences than unsafe...
Tamerlan Aghayev, Maxime Elkael, Michele Polese +11 more
Cellular research and development (R&D) is throttled by six structural processes that each consume months of manual engineering work per iteration:...
Dongyoon Hahm, Dylan Hadfield-Menell, Kimin Lee
Reinforcement Learning from Human Feedback (RLHF) is the standard method to align Large Language Models (LLMs) with human preferences. In this work,...
AI security research studies how AI and machine-learning systems can be attacked and defended — covering adversarial examples, prompt injection, model poisoning, training-data extraction, and the mitigations against them. AI Threat Alert curates this research from academic sources so security teams can track the threats behind emerging AI risks.
AI Threat Alert indexes 3,023+ papers on AI/ML security, classified across attack, defense, benchmark, survey, and tool categories and updated continuously.
Papers are sourced from arXiv, then classified by type and by relevance to real-world AI/ML threats, and cross-referenced with the CVEs and incidents they relate to.
Coverage spans adversarial attacks, model and system defenses, red-teaming benchmarks, literature surveys, and security tooling for LLMs, ML libraries, AI agents, and inference pipelines.
Every paper is filtered for AI security relevance and linked to the vulnerabilities, vendors, and incidents it relates to, so the research connects directly to operational threat intelligence.
Get breaking CVE alerts, compliance reports (ISO 42001, EU AI Act), and CISO risk assessments for your AI/ML stack.
Start 14-Day Free Trial