Paper 2606.23496v1

TROPT: An Open Framework for Unifying and Advancing Discrete Text Optimization

porting optimizers from one domain (e.g., LLM jailbreak) to new domains (e.g., corpus-poisoning embedding model). In all, TROPT significantly lowers the barrier to adopting and advancing discrete text optimization

medium relevance attack
Paper 2601.11664v1

Serverless AI Security: Attack Surface Analysis and Runtime Protection Mechanisms for FaaS-Based Machine Learning

characterize the attack surface across five categories: function-level vulnerabilities (cold start exploitation, dependency poisoning), model-specific threats (API-based extraction, adversarial inputs), infrastructure attacks (cross-function contamination, privilege escalation

high relevance attack
Paper 2512.06390v1

Web Technologies Security in the AI Era: A Survey of CDN-Enhanced Defenses

mitigate while reducing data movement and enhancing compliance, yet introduces new risks around model abuse, poisoning, and governance

medium relevance survey
Paper 2604.13308v1

Threat Modeling and Attack Surface Analysis of IoT-Enabled Controlled Environment Agriculture Systems

federated transfer learning, adversarial agronomic schedules that exploit crop biology rather than computational models, and reward poisoning of reinforcement-learning energy optimizers. Physical impact analysis quantifies crop loss timelines from

high relevance attack
Paper 2604.06840v1

MirageBackdoor: A Stealthy Attack that Induces Think-Well-Answer-Wrong Reasoning

MirageBD generally achieves over 90% attack success rate across four datasets and five models with a poison ratio of only 5%. Moreover, even under rigorous evaluations such as trigger perturbations

high relevance attack
Paper 2510.25025v2

Secure Retrieval-Augmented Generation against Poisoning Attacks

Large language models (LLMs) have transformed natural language processing (NLP), enabling applications from content generation to decision support. Retrieval-Augmented Generation (RAG) improves LLMs by incorporating external knowledge but also

high relevance attack
Paper 2604.12168v1

Fully Homomorphic Encryption on Llama 3 model for privacy preserving LLM inference

insecure LLM pipelines, making them vulnerable to multiple attacks such as data poisoning, prompt injection, and model theft. Although several security techniques (input/output sanitization, decentralized learning, access control management

medium relevance attack
Paper 2605.01970v1

Trojan Hippo: Weaponizing Agent Memory for Data Exfiltration

class of persistent memory attacks that operates in a more realistic threat model than prior memory poisoning work: the attacker plants a dormant payload into an agent's long-term

medium relevance benchmark
Paper 2604.05809v1

Stealthy and Adjustable Text-Guided Backdoor Attacks on Multimodal Pretrained Models

significantly improving stealthiness and practicality. Furthermore, we introduce visual adversarial perturbations on poisoned samples to modulate the model's learning of textual triggers, enabling a controllable and adjustable TGB attack

high relevance attack
Paper 2605.28112v1

A Wolf in Sheep's Clothing: Targeted Routing Hijacking in Federated RAG

including missing evidence, poisoning, incorrect answers, and hallucinations. In a high-stakes MedQA-USMLE case study, we further show that poisoned retrieved evidence can mislead models across scales, leading

medium relevance attack
Paper 2512.16962v1

MemoryGraft: Persistent Compromise of LLM Agents via Poisoned Experience Retrieval

Large Language Model (LLM) agents increasingly rely on long-term memory and Retrieval-Augmented Generation (RAG) to persist experiences and refine future performance. While this experience learning capability enhances agentic

medium relevance benchmark
Paper 2603.27986v1

FedFG: Privacy-Preserving and Robust Federated Learning via Flow-Matching Generation

data; on the other hand, they may compromise clients to launch poisoning attacks that corrupt the global model. To balance accuracy and security, we propose FedFG, a robust FL framework

medium relevance attack
Paper 2511.14715v2

FLARE: Adaptive Multi-Dimensional Reputation for Robust Client Reliability in Federated Learning

learning (FL) enables collaborative model training while preserving data privacy. However, it remains vulnerable to malicious clients who compromise model integrity through Byzantine attacks, data poisoning, or adaptive adversarial behaviors

medium relevance benchmark
Paper 2605.01834v1

Repurposing and Evaluating the (In)Feasibility of Dataset Poisoning enabled Watermarking for Contrastive Learning

third-party or internet data is common. Recent studies show CL models are vulnerable to data-poisoning backdoor attacks, but their generalization and robustness are underexplored. We systematically evaluate existing

medium relevance benchmark
Paper 2510.03636v1

From Theory to Practice: Evaluating Data Poisoning Attacks and Defenses in In-Context Learning on Social Media Health Discourse

This study explored how in-context learning (ICL) in large language models can be disrupted by data poisoning attacks in the setting of public health sentiment analysis. Using tweets

high relevance attack
Paper 2512.12921v1

Cisco Integrated AI Security and Safety Framework Report

threats now span content safety failures (e.g., harmful or deceptive outputs), model and data integrity compromise (e.g., poisoning, supply-chain tampering), runtime manipulations (e.g., prompt injection, tool and agent misuse

medium relevance tool
Paper 2512.10998v1

SCOUT: A Defense Against Data Poisoning Attacks in Fine-Tuned Language Models

Backdoor attacks create significant security threats to language models by

high relevance attack
Paper 2601.22308v2

Stealthy Poisoning Attacks Bypass Defenses in Regression Settings

natural and physical sciences, yet their robustness to poisoning has received less attention. When it has, studies often assume unrealistic threat models and are thus less useful in practice

high relevance attack
Paper 2601.04266v1

State Backdoor: Towards Stealthy Real-world Poisoning Attack on Vision-Language-Action Model in State Space

Vision-Language-Action (VLA) models are widely deployed in safety

high relevance attack
Paper 2603.04859v1

Osmosis Distillation: Model Hijacking with the Fewest Samples

generated by dataset distillation methods, where an adversary can perform a model hijacking attack with only a few poisoned samples in the synthetic dataset. To reveal this threat, we propose

medium relevance benchmark
Previous Page 5 of 16 Next