Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

The old models were doing it correct also.

There is no one correct way to interpert 'full'. If you go to a wine bar and ask for a full glass of wine, they'll probably interpert that as a double. But you could also interpert it the way a friend would at home, which is about 2-3cm from the rim.

Personally I would call a glass of wine filled to the brim 'overfilled', not 'full'.



I think you're missing the context everyone else has - this video is where the "AI can't draw a full glass of wine" meme got traction https://www.youtube.com/watch?v=160F8F8mXlo

The prompts (some generated by ChatGPT itself, since it's instructing DALL-E behind the scenes) include phrases like "full to the brim" and "almost spilling over" that are not up to interpretation at all.


People were telling the models explicitly to fill it to the brim, and the models were still producing images where it was filled to approximately the half-way point.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: