Benchmark MEDIUM
José Ramón Pareja Monturiol, Juliette Sinnott, Roger G. Melko +1 more
Machine learning in clinical settings must balance predictive accuracy, interpretability, and privacy. Models such as logistic regression (LR) offer...
1 months ago cs.LG cs.CR quant-ph
PDF
Benchmark LOW
Rui Jia, Ruiyi Lan, Fengrui Liu +7 more
Large language models (LLMs) have advanced the development of personalized learning in education. However, their inherent generation mechanisms often...
Benchmark LOW
Nelu D. Radpour
Contemporary benchmarks for agentic artificial intelligence (AI) frequently evaluate safety through isolated task-level accuracy thresholds,...
1 months ago cs.CY cs.AI cs.HC
PDF
Benchmark MEDIUM
Ruixin Yang, Ethan Mendes, Arthur Wang +4 more
Vision-language models (VLMs) have demonstrated strong performance in image geolocation, a capability further sharpened by frontier multimodal large...
1 months ago cs.CR cs.AI
PDF
Benchmark MEDIUM
Casey Ford, Madison Van Doren, Emily Dix
Multimodal large language models (MLLMs) are increasingly deployed in real-world systems, yet their safety under adversarial prompting remains...
1 months ago cs.CL cs.AI cs.HC
PDF
Benchmark LOW
Mengru Wang, Zhenqian Xu, Junfeng Fang +4 more
Large Language Models (LLMs) can acquire unintended biases from seemingly benign training data even without explicit cues or malicious content....
1 months ago cs.LG cs.AI cs.CL
PDF
Benchmark MEDIUM
Debargha Ganguly, Sreehari Sankar, Biyao Zhang +8 more
Current approaches to LLM safety fundamentally rely on a brittle cat-and-mouse game of identifying and blocking known threats via guardrails. We...
1 months ago cs.CL cs.AI cs.DC
PDF
Benchmark LOW
Bibhabasu Mandal, Sagnik Nandy
In sensitive applications involving relational datasets, protecting information about individual links from adversarial queries is of paramount...
1 months ago stat.ML cs.CR cs.LG
PDF
Benchmark MEDIUM
Omar Abdelnasser, Fatemah Alharbi, Khaled Khasawneh +2 more
Safety alignment in Language Models (LMs) is fundamental for trustworthy AI. However, while different stakeholders are trying to leverage Arabic...
1 months ago cs.CL cs.AI
PDF
Benchmark HIGH
Hao Li, Ruoyao Wen, Shanghao Shi +2 more
AI agents that autonomously interact with external tools and environments show great promise across real-world applications. However, the external...
Benchmark MEDIUM
Tomer Kordonsky, Maayan Yamin, Noam Benzimra +2 more
LLMs are increasingly used for code generation, but their outputs often follow recurring templates that can induce predictable vulnerabilities. We...
1 months ago cs.CR cs.AI
PDF
Benchmark MEDIUM
Najmul Hasan, Prashanth BusiReddyGari
The Uniform Resource Locator (URL), introduced in a connectivity-first era to define access and locate resources, remains historically limited,...
1 months ago cs.CR cs.AI
PDF
Benchmark MEDIUM
Rodrigo Tertulino, Ricardo Almeida, Laercio Alencar
The digitization of healthcare has generated massive volumes of Electronic Health Records (EHRs), offering unprecedented opportunities for training...
1 months ago cs.CR cs.AI cs.LG
PDF
Benchmark LOW
Hoang M. Ngo, Tre' R. Jeter, Incheol Shin +3 more
Quantum Machine Learning (QML) is becoming increasingly prevalent due to its potential to enhance classical machine learning (ML) tasks, such as...
1 months ago quant-ph cs.CR
PDF
Benchmark LOW
Wenjin Hou, Wei Liu, Han Hu +3 more
Multimodal Large Language Models (MLLMs) have shown remarkable proficiency on general-purpose vision-language benchmarks, reaching or even exceeding...
Benchmark MEDIUM
Yen-Shan Chen, Zhi Rui Tam, Cheng-Kuang Wu +1 more
Current evaluations of LLM safety predominantly rely on severity-based taxonomies to assess the harmfulness of malicious queries. We argue that this...
1 months ago cs.CR cs.CL cs.CY
PDF
Benchmark LOW
Yangfan Deng, Anirudh Nakra, Min Wu
3D content acquisition and creation are expanding rapidly in the new era of machine learning and AI. 3D Gaussian Splatting (3DGS) has become a...
1 months ago cs.CR cs.LG
PDF
Benchmark MEDIUM
Max Manolov, Tony Gao, Siddharth Shukla +2 more
Large language models (LLMs) are increasingly used to assist developers with code, yet their implementations of cryptographic functionality often...
1 months ago cs.CR cs.AI
PDF
Benchmark LOW
Shaowei Shen, Xiaohong Yang, Jie Yang +4 more
Electronic medical records (EMRs), particularly in neurology, are inherently heterogeneous, sparse, and noisy, which poses significant challenges for...
Benchmark LOW
Shaowei Shen, Xiaohong Yang, Jie Yang +4 more
Electronic medical records (EMRs), particularly in neurology, are inherently heterogeneous, sparse, and noisy, which poses significant challenges for...
Track AI security vulnerabilities in real time
Get breaking CVE alerts, compliance reports (ISO 42001, EU AI Act),
and CISO risk assessments for your AI/ML stack.
Start 14-Day Free Trial