Linkblog

Gemini 3.5 Flash: more expensive, but Google plan to use it for everything

As Simon observes:

This fits a trend: OpenAI’s GPT-5.5 was 2x the price of GPT-5.4, and Claude Opus 4.7 is around 1.46x the price of 4.6 when you take the new tokenizer into account…. It feels like all three of the major AI labs are starting to probe the price tolerance of their API customers.

Given we’re also seeing pricing pressure on the prosumer subscription costs, it seems like we may be about to see API prices across the board increasing, increasing the appeal of “good enough” SLMs that can be self-hosted

Back to linkblog