rebuff - LLM Prompt Injection Detector

OP10-23-2023 - 07:44 PM

Rebuff is designed to protect AI applications from fast injection (PI) attacks through a multi-layered defense.

https://github.com/protectai/rebuff

Features

Rebuff offers 4 layers of defense:

Heuristics: Filter out potentially malicious input before it reaches the LLM.
LLM-based detection: Use a dedicated LLM to analyze incoming prompts and identify potential attacks.
VectorDB: Store embeddings of previous attacks in a vector database to recognize and prevent similar attacks in the future.
Canary tokens: Add canary tokens to prompts to detect leakages, allowing the framework to store embeddings about the incoming prompt in the vector database and prevent future attacks.

rebuff - LLM Prompt Injection Detector

About Us

Navigation

Extras

Help

Account