Paper 2603.03332v2

Fragile Thoughts: How Large Language Models Handle Chain-of-Thought Perturbations

Chain-of-Thought (CoT) prompting has emerged as a foundational technique for eliciting reasoning from Large Language Models (LLMs), yet the robustness of this approach to corruptions in intermediate reasoning

medium relevance survey
Paper 2601.13300v1

OI-Bench: An Option Injection Benchmark for Evaluating LLM Susceptibility to Directive Interference

signals such as social cues, framing, and instructions. In this work, we introduce option injection, a benchmarking approach that augments the multiple-choice question answering (MCQA) interface with an additional

high relevance benchmark

npm PraisonAI SandboxExecutor allowedCommands bypass via shell chaining

CVSS 8.8 praisonai View details

npm PraisonAI utility shell safe-command wrapper allowlist bypass via

CVSS 8.8 praisonai View details

praisonai-platform: Comment endpoints accept any issue_id without workspace

CVSS 8.1 praisonai-platform View details

PraisonAI: HTTPApproval dashboard renders tool arguments as raw HTML, allowing

CVSS 8.8 praisonai View details
Paper 2605.30096v1

How Reliable Are AI Attackers Against a Fixed Vulnerable Target? A 400-Run Empirical Study of LLM Penetration Testing Consistency

Large language models (LLMs) can autonomously conduct multi-stage cyber

high relevance attack
Paper 2604.21860v1

Transient Turn Injection: Exposing Stateless Multi-Turn Vulnerabilities in Large Language Models

workflows, raising the stakes for adversarial robustness and safety. This paper introduces Transient Turn Injection(TTI), a new multi-turn attack technique that systematically exploits stateless moderation by distributing adversarial

high relevance attack
Paper 2604.21131v1

Cross-Session Threats in AI Agents: Benchmark, Evaluation, and Algorithms

attack taxonomies classified by kill-chain stage and cross-session operation (accumulate, compose, launder, inject_on_reader), each bound to one of seven identity anchors that ground-truth "violation

medium relevance benchmark

praisonai-platform: IDOR in dependency endpoints allows cross-workspace issue

CVSS 8.1 praisonai-platform View details
Paper 2605.17480v1

The Capability Paradox: How Smarter Auditors Make Multi-Agent Systems Less Secure

domain-specific narratives and propagated to a Manager through Worker reports, without any syntactic injection primitives. Across 42,000 adversarial trials over 12 Manager models and 7 Worker configurations

medium relevance benchmark

Open WebUI's Insecure Message Access Breaks Authorization

CVSS 7.1 open-webui View details
Paper 2511.03675v1

Whisper Leak: a side-channel attack on Large Language Models

paramount. This paper introduces Whisper Leak, a side-channel attack that infers user prompt topics from encrypted LLM traffic by analyzing packet size and timing patterns in streaming responses. Despite

high relevance attack
Paper 2510.00490v1

Has the Two-Decade-Old Prophecy Come True? Artificial Bad Intelligence Triggered by Merely a Single-Bit Flip in Large Language Models

Recently, Bit-Flip Attack (BFA) has garnered widespread attention for

medium relevance attack
Previous Page 25 of 25