Gian.cool Gianfranco's blog

  • Opus 4.6 vs ChatGPT Codex 5.3

    A comparison of benchmark metrics between Opus 4.6 and Codex 5.3 models.

    Anthropic and OpenAI both recently published Terminal-Bench 2.0 results, but in separate charts and a table. I wanted the full picture, so I combined them.

    Benchmark Comparison

    Agentic Coding

    Terminal-Bench 2.0
    Anthropic
    OpenAI
    Google

    Note: All OpenAI models shown at xhigh compute setting. GPT-5.2-Codex appears twice — 64.7% as reported by Anthropic, 64.0% as reported by OpenAI.

    Read more →
  • I Built a Desktop Audio Converter With Claude Code

    I’ve been meaning to build this app, and I actually started around this time last year. But after learning how to “code” or build with AI coding agents like claude code, I just gave it this prompt:

    plan how to complete this app. it should allow one or multiple files to be selected or dragged (audio only) and then it should show a box to select which format to convert to e.g mp3, wav, aac,ogg, flac,m4a, mp4) plus certain options that come from ffmpeg to compress the file

    It wrote this comprehensive plan. And the it went for it. I asked a small question to fix a small UI color issue. And voila!

    A(I) built Audioslim, a native macOS app that converts audio between MP3, WAV, AAC, OGG, FLAC, M4A, and MP4. I built it with claude code, Anthropic’s AI coding assistant for the terminal.

    screenshot

    Read more →