Tool MEDIUM
Junjun Pan, Yixin Liu, Rui Miao +5 more
Large language model (LLM)-based multi-agent systems (MAS) have shown strong capabilities in solving complex tasks. As MAS become increasingly...
4 months ago cs.CR cs.AI cs.MA
PDF
Defense MEDIUM
Haotian Deng, Chris Farber, Jiyoon Lee +1 more
Automated short-answer grading (ASAG) remains a challenging task due to the linguistic variability of student responses and the need for nuanced,...
4 months ago cs.CL cs.LG
PDF
Tool MEDIUM
Bin Wang, Wenjie Yu, Yilu Zhong +6 more
Large language models (LLMs) for code generation are becoming integral to modern software development, but their real-world prevalence and security...
4 months ago cs.SE cs.AI
PDF
Benchmark MEDIUM
Sumanth Bharadwaj Hachalli Karanam, Dhiwahar Adhithya Kennady
Manual software beta testing is costly and time-consuming, while single-agent large language model (LLM) approaches suffer from hallucinations and...
4 months ago cs.SE cs.AI cs.MA
PDF
Benchmark MEDIUM
Scott Thornton
AI coding assistants produce vulnerable code in 45\% of security-relevant scenarios~\cite{veracode2025}, yet no public training dataset teaches both...
4 months ago cs.CR cs.AI cs.CL
PDF
Other MEDIUM
Ziqi Lin, Taiyu Hou
The use of large language model (LLM)-based AI chatbots among college students has increased rapidly, yet little is known about how individual...
4 months ago cs.CY cs.AI
PDF
Tool HIGH
Zehao Liu, Xi Lin
Large Language Models (LLMs) have gained considerable popularity and protected by increasingly sophisticated safety mechanisms. However, jailbreak...
4 months ago cs.CR cs.AI
PDF
Defense LOW
Yueqiao Jin, Roberto Martinez-Maldonado, Dragan Gašević +1 more
Generative AI is increasingly embedded in collaborative learning, yet little is known about how AI personas shape learner agency when AI teammates...
Benchmark MEDIUM
Wei Qian, Chenxu Zhao, Yangyi Li +1 more
The rapid advancements in artificial intelligence (AI) have primarily focused on the process of learning from data to acquire knowledgeable learning...
4 months ago cs.LG cs.CR
PDF
Benchmark MEDIUM
Wang Bin, Ao Yang, Kedan Li +5 more
In the domain of software security testing, Directed Grey-Box Fuzzing (DGF) has garnered widespread attention for its efficient target localization...
4 months ago cs.SE cs.AI
PDF
Attack MEDIUM
Tung-Ling Li, Yuhao Wu, Hongliang Liu
Reward models and LLM-as-a-Judge systems are central to modern post-training pipelines such as RLHF, DPO, and RLAIF, where they provide scalar...
4 months ago cs.LG cs.CL cs.CR
PDF
Attack MEDIUM
Yidong Chai, Yi Liu, Mohammadreza Ebrahimi +2 more
Social media platforms are plagued by harmful content such as hate speech, misinformation, and extremist rhetoric. Machine learning (ML) models are...
Tool MEDIUM
Abhivansh Gupta
As LLM-based agents grow more autonomous and multi-modal, ensuring they remain controllable, auditable, and faithful to deployer intent becomes...
4 months ago cs.MA cs.AI cs.LG
PDF
Benchmark MEDIUM
Baolei Zhang, Minghong Fang, Zhuqing Liu +5 more
Federated Learning (FL) allows multiple clients to collaboratively train a model without sharing their private data. However, FL is vulnerable to...
4 months ago cs.CR cs.DC cs.LG
PDF
Attack HIGH
Huixin Zhan
Genomic Foundation Models (GFMs), such as Evolutionary Scale Modeling (ESM), have demonstrated remarkable success in variant effect prediction....
4 months ago cs.CR cs.LG q-bio.QM
PDF
Attack LOW
Tomáš Souček, Pierre Fernandez, Hady Elsahar +5 more
Invisible watermarking is essential for tracing the provenance of digital content. However, training state-of-the-art models remains notoriously...
4 months ago cs.CV cs.AI cs.CR
PDF
Defense LOW
Nenad Tomašev, Matija Franklin, Julian Jacobs +2 more
AI safety and alignment research has predominantly been focused on methods for safeguarding individual AI systems, resting on the assumption of an...
Attack HIGH
Kai Hu, Abhinav Aggarwal, Mehran Khodabandeh +6 more
This paper introduces Jailbreak-Zero, a novel red teaming methodology that shifts the paradigm of Large Language Model (LLM) safety evaluation from a...
4 months ago cs.CL cs.CR cs.LG
PDF
Tool HIGH
Xiao Li, Yue Li, Hao Wu +4 more
As large language models (LLMs) are increasingly adopted for code vulnerability detection, their reliability and robustness across diverse...
4 months ago cs.CR cs.LG
PDF
Defense LOW
Himanshu Gharat, Himanshi Agrawal, Gourab K. Patro
Large Language Models (LLMs) have empowered AI agents with advanced capabilities for understanding, reasoning, and interacting across diverse tasks....
4 months ago cs.AI cs.IR
PDF
Track AI security vulnerabilities in real time
Get breaking CVE alerts, compliance reports (ISO 42001, EU AI Act),
and CISO risk assessments for your AI/ML stack.
Start 14-Day Free Trial