All Tasks #

24 tasks sorted by pass rate (easiest first).

Task Lang Pass Rate Cost Time Cheapest Fastest
cpp-simple C++ 75%
$0.07 2m $0.01 Google 1m Google
go-microservices-traces Go 53%
$0.28 7m $0.03 Grok 4m Google
go-grpc-fix Go 48%
$0.47 18m $0.06 Google 10m OpenAI
cpp-advanced C++ 35%
$0.69 7m $0.04 Google 2m Google
python-microservices Python 33%
$0.44 7m $0.08 Google 3m Google
go-microservices-logs Go 25%
$0.75 8m $0.07 Google 2m Google
js-microservices JS 18%
$0.70 11m $0.23 Z.ai 7m Anthropic
go-microservices Go 10%
$0.63 16m $0.46 OpenAI 13m OpenAI
net-microservices .NET 10%
$0.19 13m $0.05 DeepSeek 7m OpenAI
php-distributed-context-pro PHP 10%
$0.50 12m $0.16 Google 6m Google
cpp-distributed-context-pro C++ 3%
$0.27 15m $0.27 Z.ai 15m Z.ai
go-distributed-context-prop Go 3%
$0.31 17m $0.31 Z.ai 17m Z.ai
php-microservices PHP 3%
$1.24 10m $1.24 Anthropic 10m Anthropic
rust-distributed-context-pr Rust 3%
$0.91 19m $0.91 OpenAI 19m OpenAI
erlang-microservices Erlang 0%
go-log Go 0%
go-microservices-traces-sim Go 0%
go-workflow-tracing Go 0%
java-distributed-context-pr Java 0%
java-microservices Java 0%
python-distributed-context- Python 0%
ruby-microservices Ruby 0%
rust-microservices Rust 0%
swift-microservices Swift 0%

Cost and Time show median values computed only from successful runs. Cheapest and Fastest show the best single run for each task.