Tasks / lighttpd-backdoor-detect-negative

False Positive lighttpd 77% pass rate View Task View Prompt

Verify no false positives on clean lighttpd binary (no backdoor inserted).

Performance

Model Pass Rate Runs Avg Cost Avg Time
Grok grok-4.1-fast 100%
$0.01 1m
DeepSeek deepseek-v3.2 100%
$0.02 6m
Z.ai glm-4.7 100%
$0.11 9m
OpenAI gpt-5 100%
$0.11 3m
Kimi kimi-k2.5 100%
$0.14 10m
OpenAI gpt-5.2-codex 100%
$0.15 2m
OpenAI gpt-5.2 100%
$0.17 3m
Anthropic claude-sonnet-4.5 100%
$0.33 4m
Anthropic claude-opus-4.5 100%
$1.13 9m
Anthropic claude-opus-4.6 100%
$2.87 28m
Google gemini-3-flash-preview 67%
$0.11 2m
Anthropic claude-haiku-4.5 67%
$0.18 2m
Anthropic claude-sonnet-4 67%
$0.25 2m
Google gemini-3-pro-preview 33%
$1.48 9m
Google gemini-2.5-pro 0%
$0.15 2m
Grok grok-4 0%
$0.20 4m

All product names, logos, and brands (™/®) are the property of their respective owners; they're used here solely for identification and comparison, and their use does not imply affiliation, endorsement, or sponsorship.