We introduce Reverse CAPTCHA, an evaluation framework that tests whether large language models follow invisible Unicode-encoded instructions embedded...
Agentic large language model systems increasingly automate tasks by retrieving URLs and calling external tools. We show that this workflow gives rise...
Piyush Jaiswal, Aaditya Pratap, Shreyansh Saraswati +2 more
Large Language Models (LLMs) are widely deployed in real-world systems. Given their broader applicability, prompt engineering has become an efficient...
David Schmotz, Luca Beurer-Kellner, Sahar Abdelnabi +1 more
LLM agents are evolving rapidly, powered by code execution, tools, and the recently introduced agent skills feature. Skills allow users to extend LLM...
Large Vision-Language Models (LVLMs) can be vulnerable to adversarial images that subtly bias their outputs toward plausible yet incorrect responses....
Amirhossein Farzam, Majid Behabahani, Mani Malek +2 more
Large language models (LLMs) remain vulnerable to jailbreak prompts that are fluent and semantically coherent, and therefore difficult to detect with...
Large language models (LLMs) are increasingly deployed in safety and security critical applications, raising concerns about their robustness to model...