SKILL-INJECT

Measuring Agent Vulnerability to Skill File Attacks

Benchmark Category

Contextual Injection ASR by Model and Safety Policy

Contextual Injection ASR by Model and Safety Policy

Policy Configuration

Models

Loading models...

Model Comparison

Select models above to compare their performance