Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Interesting. When i asked Gemini 3 Pro to generate a Infographic from my personal accounting sheet, it first failed to generate anything except a black background, then it generated something where it mixed different languages in a non-sensical way, with obvious typos and irrelevant information grouping. It's certainly a leap forward in OCR, rendering classic OCR useless.




That's more of an issue with Nano Banana Pro than with Gemini 3 Pro.

What's the difference? I thought the vision ai component of gemini 3 is called nano banana?

That’s about generating images, the other side is about understanding images.

i assumed nano banana was just a tool that gemini 3 used though i don't know

Gemini 3 Pro's text encoder powers Nano Banana Pro, but it has its own image decoding model that decodes the generated image tokens into an actual image, which appears to be the more pertinent issue in this case.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: