Category Archives: AI

How to Use GPT-5.5 Today at the CLI (Via Your Existing Codex Subscription)

Update 2026-04-26 Small 4/26 update before you run the commands below: Codex CLI 0.125.0 added GPT-5.5 to codex exec -m gpt-5.5 and the Codex MCP server. (For MCP, restart Claude Code after upgrading so the running server picks up the new binary.) The relay is no longer the only path to 5.5 from your terminal. The OpenAI public API still hasn’t shipped 5.5 as of today. I’m still keeping the relay for fast one-off pipes and as a backup connection

How to Fix Codex Sandbox Errors on Ubuntu 24.04

Terminal before and after: bwrap RTM_NEWADDR error before the AppArmor profile, ok after

TL;DR: Codex sandbox errors on Ubuntu 24.04 almost always trace back to one thing on freshly installed boxes: AppArmor blocking bwrap. A five-line /etc/apparmor.d/bwrap profile fixed it on my system. If you’re hitting the same wall, paste your error into Claude Code and let it walk you through. 💪 The Symptom: Codex Sandbox Hangs on Ubuntu 24.04 My Codex setup on a fresh Ubuntu 24.04 VM was unusable. Every codex exec call burned 35K to 54K tokens over 2 or

Claude Opus 4.6 vs 4.7 Max: Which is better? Graded on Real World Planning

Letter-grade scorecard comparing Claude Opus 4.6 Max and Claude Opus 4.7 Max on a WordPress modernization planning task, with Opus 4.7 winning overall A- to C

TL;DR: This is one real WordPress website maintenance task, run once against each model. On that task, Claude Opus 4.7 produced the stronger first-pass plan because it started from a more accurate baseline. Opus 4.6 was faster and caught one thing 4.7 missed, but it got the active theme wrong and claimed an impossible PHP version. I would not execute either plan as written. Grades below. Note: Opus 4.7 is getting a rough reception on Reddit and around the web

Switch to Claude Code Opus 4.7 [instructions]

Opus 4.7 launched on April 16, 2026. Here’s the fast path to switch Claude Code over, including the gotcha I hit in the VS Code extension. Quick upgrade (terminal) Three commands, in order: claude update claude –version claude –model claude-opus-4-7 If that last command starts a session confirming claude-opus-4-7, you’re done. If not, keep reading. Step 1: Update Claude Code Most people run Claude Code in a terminal. One command does it: claude update Anthropic lists v2.1.111+ as the minimum

Meta Muse Spark: The Honest Scorecard (3 Wins Out of 20)

Meta just released Muse Spark, the first model from their new Superintelligence Labs. Along with it came a 20-benchmark comparison chart. The chart highlights every Muse Spark score in blue, which makes it look like Muse is leading across the board. The original Meta release table/graphic just highlighted their column and it made it seem almost as if they won every row… They did not win every test (far from it) so maybe the the table could use some actual

Can MacBook Neo Run Claude Code?

So in updating my MacBook Neo review with some additional stress testing and thermal testing, I came across some interesting results that begged the question: “Can it run Claude?” If you’re a gamer, that probably triggers the old “Can it run Crysis?” reflex. For those who don’t remember (or weren’t there), Crysis was the 2007 PC game that became the de facto hardware torture test. If your rig could run Crysis, it could run anything. It was a rite of

Claude vs ChatGPT vs Gemini for Coding: Testing Results

TL;DR: I ran the same 5 coding tasks through Claude Opus 4.6, OpenAI Codex CLI (gpt-5.3-codex), Google Gemini 2.5 Flash (sorry I did not have easy access to the newer models, but Gemma 4 was tested!), and two open-source models I ran locally: Gemma 4 31B and Qwen 3.5 35B. Claude’s code was the most production ready. Codex and Qwen tied for best code reviewer. Gemini was the cheapest. The open-source models scored A-, closing in on the paid tier.

Claude Code /buddy How to Preview, Hatch, and Reroll Your Terminal Pet

TL;DR: Claude Code v2.1.89 added /buddy, a virtual pet companion in your terminal. Your buddy’s body is deterministically generated from your account, and the personality generates permanently the first time you hatch it. Preview yours first with npx any-buddy current, but install Bun before you do (if you don’t already have it) Is This an April Fools Joke? I first noticed /buddy the least trustworthy way possible, by seeing it appear in Claude Code’s slash-command autocomplete on April 1. 😜

« Older Entries Recent Entries »