Benchmark MEDIUM
Naseem Machlovi, Maryam Saleki, Ruhul Amin +5 more
As large language models (LLMs) become deeply embedded in daily life, the urgent need for safer moderation systems that distinguish between naive and...
4 months ago cs.CL cs.AI cs.HC
PDF
Benchmark MEDIUM
Naseem Machlovi, Maryam Saleki, Ruhul Amin +5 more
As large language models (LLMs) become deeply embedded in daily life, the urgent need for safer moderation systems, distinguishing between naive from...
4 months ago cs.CL cs.AI cs.HC
PDF
Attack MEDIUM
A. A. Gde Yogi Pramana, Jason Ray, Anthony Jaya +1 more
Vision--Language Models (VLMs) show significant promise for Medical Visual Question Answering (VQA), yet their deployment in clinical settings is...
Defense MEDIUM
Md Minhazul Islam Munna, Md Mahbubur Rahman, Jaroslav Frnda +2 more
The proliferation of IoT devices and their reliance on Wi-Fi networks have introduced significant security vulnerabilities, particularly the KRACK...
4 months ago cs.CR cs.LG
PDF
Defense MEDIUM
Kun Zhao, Siyuan Dai, Yingying Zhang +9 more
Early detection of Alzheimer's disease (AD) requires models capable of integrating macro-scale neuroanatomical alterations with micro-scale genetic...
4 months ago cs.LG cs.AI
PDF
Tool MEDIUM
Junjun Pan, Yixin Liu, Rui Miao +5 more
Large language model (LLM)-based multi-agent systems (MAS) have shown strong capabilities in solving complex tasks. As MAS become increasingly...
4 months ago cs.CR cs.AI cs.MA
PDF
Defense MEDIUM
Haotian Deng, Chris Farber, Jiyoon Lee +1 more
Automated short-answer grading (ASAG) remains a challenging task due to the linguistic variability of student responses and the need for nuanced,...
4 months ago cs.CL cs.LG
PDF
Tool MEDIUM
Bin Wang, Wenjie Yu, Yilu Zhong +6 more
Large language models (LLMs) for code generation are becoming integral to modern software development, but their real-world prevalence and security...
4 months ago cs.SE cs.AI
PDF
Benchmark MEDIUM
Sumanth Bharadwaj Hachalli Karanam, Dhiwahar Adhithya Kennady
Manual software beta testing is costly and time-consuming, while single-agent large language model (LLM) approaches suffer from hallucinations and...
4 months ago cs.SE cs.AI cs.MA
PDF
Benchmark MEDIUM
Scott Thornton
AI coding assistants produce vulnerable code in 45\% of security-relevant scenarios~\cite{veracode2025}, yet no public training dataset teaches both...
4 months ago cs.CR cs.AI cs.CL
PDF
Other MEDIUM
Ziqi Lin, Taiyu Hou
The use of large language model (LLM)-based AI chatbots among college students has increased rapidly, yet little is known about how individual...
4 months ago cs.CY cs.AI
PDF
Benchmark MEDIUM
Wei Qian, Chenxu Zhao, Yangyi Li +1 more
The rapid advancements in artificial intelligence (AI) have primarily focused on the process of learning from data to acquire knowledgeable learning...
4 months ago cs.LG cs.CR
PDF
Benchmark MEDIUM
Wang Bin, Ao Yang, Kedan Li +5 more
In the domain of software security testing, Directed Grey-Box Fuzzing (DGF) has garnered widespread attention for its efficient target localization...
4 months ago cs.SE cs.AI
PDF
Attack MEDIUM
Tung-Ling Li, Yuhao Wu, Hongliang Liu
Reward models and LLM-as-a-Judge systems are central to modern post-training pipelines such as RLHF, DPO, and RLAIF, where they provide scalar...
4 months ago cs.LG cs.CL cs.CR
PDF
Attack MEDIUM
Yidong Chai, Yi Liu, Mohammadreza Ebrahimi +2 more
Social media platforms are plagued by harmful content such as hate speech, misinformation, and extremist rhetoric. Machine learning (ML) models are...
Tool MEDIUM
Abhivansh Gupta
As LLM-based agents grow more autonomous and multi-modal, ensuring they remain controllable, auditable, and faithful to deployer intent becomes...
4 months ago cs.MA cs.AI cs.LG
PDF
Benchmark MEDIUM
Baolei Zhang, Minghong Fang, Zhuqing Liu +5 more
Federated Learning (FL) allows multiple clients to collaboratively train a model without sharing their private data. However, FL is vulnerable to...
4 months ago cs.CR cs.DC cs.LG
PDF
Defense MEDIUM
Hao Li, Yubing Ren, Yanan Cao +3 more
Benefiting from the superior capabilities of large language models in natural language understanding and generation, Embeddings-as-a-Service (EaaS)...
4 months ago cs.CR cs.CL
PDF
Benchmark MEDIUM
Saksham Sahai Srivastava, Haoyu He
Large Language Model (LLM) agents increasingly rely on long-term memory and Retrieval-Augmented Generation (RAG) to persist experiences and refine...
4 months ago cs.CR cs.AI cs.LG
PDF
Attack MEDIUM
Zhexi Lu, Hongliang Chi, Nathalie Baracaldo +3 more
Membership inference attacks (MIAs) pose a critical privacy threat to fine-tuned large language models (LLMs), especially when models are adapted to...
4 months ago cs.CR cs.LG
PDF
Track AI security vulnerabilities in real time
Get breaking CVE alerts, compliance reports (ISO 42001, EU AI Act),
and CISO risk assessments for your AI/ML stack.
Start 14-Day Free Trial