Many API endpoints (and local services for that matter) does caching at this poi...

		embedding-shape 29 days ago \| parent \| context \| favorite \| on: Cerebras Code now supports GLM 4.6 at 1000 tokens/... Many API endpoints (and local services for that matter) does caching at this point though, with much cheaper prices for input/outputs that were found in the caching. I know Anthrophic does this, and DeepSeek I think too, at the very least.