Instrument a simple C++ application with 3 methods (`work`, `fastOp`, `slowOp`) to generate OpenTelemetry-compatible trace output. Create OTEL implementation that outputs JSON traces with parent-child span relationships.
What makes it easy
- Simple, well-defined scope (3 spans only) - No external dependencies or network calls - Clear output format requirements
Common failure modes
- Missing parent-child relationships
Performance
| Model | Pass Rate | Runs | Avg Cost | Avg Time |
|---|---|---|---|---|
| deepseek-v3.2 | 100% | | $0.03 | 6m |
| gpt-5.1 | 100% | | $0.07 | 2m |
| gpt-5.2-codex | 100% | | $0.10 | 5m |
| gpt-5.2 | 100% | | $0.11 | 2m |
| claude-sonnet-4.5 | 100% | | $0.17 | 2m |
| claude-opus-4.5 | 100% | | $0.28 | 2m |
| gemini-3-flash-preview | 67% | | $0.01 | 1m |
| glm-4.7 | 67% | | $0.04 | 3m |
| gpt-5.1-codex-max | 67% | | $0.04 | 4m |
| claude-haiku-4.5 | 67% | | $0.05 | 1m |
| gemini-3-pro-preview | 67% | | $0.13 | 2m |
| grok-4 | 67% | | $0.30 | 7m |
| kimi-k2-thinking | 33% | | $0.02 | 7m |
| grok-4.1-fast | 33% | | $0.03 | 7m |
All product names, logos, and brands (™/®) are the property of their respective owners; they're used here solely for identification and comparison, and their use does not imply affiliation, endorsement, or sponsorship.