Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

It is very basic. I have built my own version of this based on Chromium that integrates both Claude and ChatGPT in the browser. It can do a lot of tasks like translate or shorten the text I selected and so on. It took me like a couple of hours to build. The problem is the cost of using the LLMs, especially since they are still pretty stupid and requires huge prompts.

EDIT: I think I misunderstood your Q. Sorry. You can take a screenshot and post it to ChatGPT and get back what it is seeing, in theory. I mean, I use ChatGPT to post screenshots of my sites to get feedback on my layout and designs...



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: