> Output tokens are just an artefact for our convenience That's nonsense. The hi... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		delusional 5 months ago \| parent \| context \| favorite \| on: A non-anthropomorphized view of LLMs > Output tokens are just an artefact for our convenience That's nonsense. The hidden layers are specifically constructed to increase the probability that the model picks the right next word. Without the output/token generation stage the hidden layers are meaningless. Just empty noise. It is fundamentally an algorithm for generating text. If you take the text away it's just a bunch of fmadds. A mute person can still think, an LLM without output tokens can do nothing.

Tarq0n 5 months ago [–]

I think that's almost completely backwards. The input and output layers just convert between natural language and embeddings i.e. shift the format of the language. But operating on the embeddings is where meaning (locations in vector-space) are transformed.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact