Search: prompt injection | AI Threat Alert

Severity:

489 results in 188ms

Paper 2601.17548v1

2026-01-24

Prompt Injection Attacks on Agentic Coding Assistants: A Systematic Analysis of Vulnerabilities in Skills, Tools, and Protocol Ecosystems

this \textbf{Systematization of Knowledge (SoK)} paper, we present a comprehensive analysis of prompt injection attacks targeting agentic coding assistants. We propose a novel three-dimensional taxonomy categorizing attacks across

high relevance attack

Paper 2510.26328v1

2025-10-30

Agent Skills Enable a New Class of Realistic and Trivially Simple Prompt Injections

useful tool, we show that they are fundamentally insecure, since they enable trivially simple prompt injections. We demonstrate how to hide malicious instructions in long Agent Skill files and referenced

high relevance attack

Paper 2606.22779v1

2026-06-22

DE-FIVE: Detecting Malicious Image Prompts via Fourier Features and Image Vector Embeddings

VLMs, making them more susceptible to security threats such as adversarial perturbations and indirect prompt injection, wherein crafted malicious image prompts can elicit unintended model outputs. Existing defense methods against

medium relevance benchmark

Paper 2603.10521v1

2026-03-11

IH-Challenge: A Training Dataset to Improve Instruction Hierarchy on Frontier LLMs

resolving instruction conflicts. IH is key to defending against jailbreaks, system prompt extractions, and agentic prompt injections. However, robust IH behavior is difficult to train: IH failures can be confounded

medium relevance benchmark

Paper 2602.00750v1

2026-01-31

Bypassing Prompt Injection Detectors through Evasive Injections

vulnerable to task drift; deviations from a user's intended instruction due to injected secondary prompts. Recent work has shown that linear probes trained on activation deltas of LLMs' hidden

high relevance attack

Paper 2605.10176v1

2026-05-11

When Prompts Become Payloads: A Framework for Mitigating SQL Injection Attacks in Large Language Model-Driven Applications

attack patterns. We evaluate the proposed framework under diverse and realistic attack scenarios, including prompt injection, obfuscated SQL payloads, and context-manipulation attacks. To ensure robustness, we generate and curate

high relevance attack

CVE HIGH CVE-2026-47397

2026-05-29

PraisonAI has an Arbitrary File Write in Python API

PraisonAI View details

Paper 2512.08417v2

2025-12-09

Attention is All You Need to Defend Against Indirect Prompt Injection Attacks in LLMs

agents) to perform more sophisticated tasks. However, LLM-empowered applications are vulnerable to Indirect Prompt Injection (IPI) attacks, where instructions are injected via untrustworthy external data sources. This paper presents

high relevance attack

Paper 2602.16752v1

2026-02-18

The Vulnerability of LLM Rankers to Prompt Injection Attacks

LLMs) have emerged as powerful re-rankers. Recent research has however showed that simple prompt injections embedded within a candidate document (i.e., jailbreak prompt attacks) can significantly alter

high relevance attack

Paper 2602.14161v1

2026-02-15

When Benchmarks Lie: Evaluating Malicious Prompt Classifiers Under True Distribution Shift

Detecting prompt injection and jailbreak attacks is critical for deploying LLM-based agents safely. As agents increasingly process untrusted data from emails, documents, tool outputs, and external APIs, robust attack

medium relevance benchmark

CVE HIGH GHSA-fq4x-789w-jg5h

2026-06-18

AgenticMail: Unauthenticated inbound mail triggers bypassPermissions resume of the operator

@agenticmail/openclaw View details

CVE HIGH CVE-2026-35394

2026-04-04

@mobilenext/mobile-mcp: Arbitrary Android Intent Execution via mobile_open_url

CVSS 8.3 @mobilenext/mobile-mcp View details

CVE HIGH CVE-2023-32786

2023-10-20

Langchain through 0.0.155, prompt injection allows an attacker to force the service to retrieve data from an arbitrary URL, essentially providing SSRF and potentially injecting content into downstream tasks

CVSS 7.5 langchain View details

Paper 2511.19727v1

2025-11-24

Prompt Fencing: A Cryptographic Approach to Establishing Security Boundaries in Large Language Model Prompts

present Prompt Fencing, a novel architectural approach that applies cryptographic authentication and data architecture principles to establish explicit security boundaries within LLM prompts. Our approach decorates prompt segments with cryptographically

medium relevance attack

Paper 2509.25448v2

2025-09-29

Fingerprinting LLMs via Prompt Injection

prompts, which are not robust to post-processing. In this work, we propose LLMPrint, a novel detection framework that constructs fingerprints by exploiting LLMs' inherent vulnerability to prompt injection

high relevance attack

Paper 2606.09315v1

2026-06-08

Brain-Prompt Injection: A Route-Safety Audit for BCI-LLM Agents

channel for tool-use agents, exposing a new attack surface we call \emph{brain-prompt injection}: signal-side perturbations, context-only injections, and adaptive dual-decoder attacks can all change

high relevance attack

CVE HIGH GHSA-hv85-774v-26fg

2026-05-19

auth-fetch-mcp: SSRF and disk exfiltration via unvalidated auth

CVSS 8.2 auth-fetch-mcp View details

Paper 2604.12232v1

2026-04-14

TEMPLATEFUZZ: Fine-Grained Chat Template Fuzzing for Jailbreaking and Red Teaming LLMs

elicit harmful outputs, poses significant security risks. While prior work has primarily focused on prompt injection attacks, these approaches often require resource-intensive prompt engineering and overlook other critical components

high relevance attack

Paper 2512.20405v2

2025-12-23

ChatGPT: Excellent Paper! Accept It. Editor: Imposter Found! Review Rejected

that the review was generated by an LLM, not a human. This method turns prompt injections from vulnerability into a verification tool. We outline our design, expected model behaviors

medium relevance survey

Paper 2601.15528v1

2026-01-21

Securing LLM-as-a-Service for Small Businesses: An Industry Case Study of a Distributed Chatbot Deployment Platform

tenant data access controls. In addition, the platform integrates practical, platform-level defences against prompt injection attacks in RAG-based chatbots, translating insights from recent prompt injection research into deployable

medium relevance tool

Previous Page 6 of 25 Next