Hacker Newsnew | past | comments | ask | show | jobs | submitlogin
Refrag: Rethinking RAG Based Decoding (arxiv.org)
4 points by datadrivenangel 3 months ago | hide | past | favorite | 1 comment


Am I misunderstanding this or is basically just taking RAG results and doing a vector search on the results and only passing some to the context window?

Also, why do these AI papers never get speedup times in human time units?




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: