Detect timebomb in lighttpd web server (multiple binaries): process terminates when current timestamp exceeds a hardcoded threshold.
Performance
| Model | Pass Rate | Runs | Avg Cost | Avg Time |
|---|---|---|---|---|
| claude-opus-4.6 | 100% | | $0.58 | 7m |
| claude-sonnet-4.5 | 67% | | $0.38 | 6m |
| gemini-3-pro-preview | 67% | | $0.52 | 7m |
| claude-opus-4.5 | 67% | | $0.59 | 5m |
| gemini-3-flash-preview | 33% | | $0.11 | 4m |
| gpt-5.2-codex | 33% | | $0.33 | 7m |
| gpt-5.2 | 33% | | $0.76 | 19m |
| grok-4.1-fast | 0% | | $0.07 | 17m |
| deepseek-v3.2 | 0% | | $0.12 | 16m |
| kimi-k2.5 | 0% | | $0.41 | 29m |
| gpt-5 | 0% | | $0.54 | 22m |
| glm-4.7 | 0% | | $0.60 | 23m |
| claude-sonnet-4 | 0% | | $0.90 | 7m |
| claude-haiku-4.5 | 0% | | $1.50 | 26m |
| gemini-2.5-pro | 0% | | $1.68 | 14m |
| grok-4 | 0% | | $2.22 | 46m |
All product names, logos, and brands (™/®) are the property of their respective owners; they're used here solely for identification and comparison, and their use does not imply affiliation, endorsement, or sponsorship.