Are you saying that some models will take 100x more tokens than other (models in...

simpaticoder · 2025-09-01T21:10:32 1756761032

With thinking models, yes 100x is not just possible, but probable. You get charged for the intermediate thinking tokens, even if you don't see them (which is the case for Grok, for example). And even if you do see them, they won't necessarily add value.

monsieurbanana · 2025-09-03T08:37:03 1756888623

> With thinking models, yes 100x is not just possible, but probable

So the answer is no then, because I don't put reasoning and non-reasoning models in the same ballpark when it comes to token usage. You can just turn off reasoning.

datadrivenangel · 2025-09-02T13:10:54 1756818654

the GPT 5 models use ~10x more tokens depending on the reasoning settings.