Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

A very worthwhile mention is also Stable-TS: https://github.com/jianfch/stable-ts

Out of the box it can transcribe with Whisper or Faster-Whisper, but it can also align audio with an existing human-written transcript, providing time information without losing accuracy. This last feature was something I really needed, and my attempt at building it myself ended up much worse, so I'm glad I found this

I self-host it using Modal.com, as do some other commenters



how much do you spend for modal.com?


I don't even go past their monthly allowance :) I don't have that much audio to process




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: