Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Hmmm, I might be rounding off wrong? Or reading it wrong?

IIUC the data we have:

2K tokens / 12 seconds = 166 tokens/s prefill

120K tokens / (10 minutes == 600 seconds) = 200 token/s prefill



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: