so apparently they have custom hardware that is basically absolutely gigantic ch...

bigyabai · 2025-11-08T03:57:36 1762574256

For $50/month, it's a non-starter. I hope they can find a way to use all this excess bandwidth to put out a $10 equivalent to Claude Code instead of a 1000 tok/s party trick I can't use properly.

typpilol · 2025-11-08T06:07:19 1762582039

I feel the same and it's also why I can't understand all these people using small local models.

Every local model I've used and even most open source are just not good

behnamoh · 2025-11-08T06:26:50 1762583210

the only good-enough model I still use it gpt-oss-120b-mxfp4 (not 20b) and glm-4.6 at q8 (not q4).

quantization ruins models and some models aren't that smart to begin with.

csomar · 2025-11-08T08:07:59 1762589279

GLM-4.6 is on par with Sonnet 4.5. Sometimes it is better, sometimes it is worse. Give it a shot. It's the only model that made me (almost) ditch Claude. The only problem is, Claude Code is still the best agentic program in town and search doesn't function without a proper subscription.

DeathArrow · 2025-11-08T19:05:44 1762628744

Have you tried Claude Code Router with GLM 4.6?

https://github.com/musistudio/claude-code-router

mcpeepants · 2025-11-08T14:05:39 1762610739

z.ai hosted GLM 4.6 works great with claude code, drops right in

esafak · 2025-11-08T17:26:25 1762622785

Have you tried opencode?

wyre · 2025-11-09T01:28:14 1762651694

Cerebras offers pay-per-token. What are you asking for? Claude Code starts at $100, or $15/mtok. Cerebras is already much cheaper, but you want it to be even cheaper at $10?

xadhominemx · 2025-11-08T18:58:52 1762628332

$600 per year is a trivial cost for a professional tool

bigyabai · 2025-11-08T23:43:00 1762645380

$600 per anything is Herman Miller territory, pal. I'm not paying that for a SaaS.