Prompt-injected memory can improve reasoning without updating model weights, but it also creates a control problem: retrieved content helps only when...
Hosted-LLM providers have a silent-substitution incentive: advertise a stronger model while serving cheaper replies. Probe-after-return schemes such...
Existing jamming attacks on Retrieval-Augmented Generation (RAG) systems typically induce explicit refusals or denial-of-service behaviors, which are...
AI agent systems increasingly rely on reusable non-LLM engineering infrastructure that packages tool mediation, context handling, delegation, safety...
Fuzz testing of software libraries relies on fuzz drivers to invoke library APIs. Traditionally, these drivers are written manually by developers - a...
Software vulnerabilities remain one of the most persistent threats to modern digital infrastructure. While static application security testing (SAST)...
A nearest-neighbor framework is a fundamental tool for various applications involving Large Language Models (LLMs) and Visual Language Models (VLMs)....
Recent advances in large language models for test case generation have improved branch coverage via prompt-engineered mutations. However, they still...