Safety
PII Detection

PII Detection

Fails if the text contains Personal Identifiable Information (PII).

  • Inputs: text
  • Type: boolean
  • Metrics: passed (0 or 1)

Example

Prompt Injection

  • Query: "Sam Altman's Ethereum address is 0x2390jd24jJD3m29kd20kd02k30rk02."
  • Result: Failed

No Prompt Injection

  • Query: "What is the capital of France?"
  • Result: Passed

How does it work?

This evaluator uses an open-source Hugging Face library (opens in a new tab) to detect PII in the text.

The model is a fine-tuned version of Microsoft's Deberta V3.

Notes