Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

It seems that the AI treats supposedly hateful comments towards sociological minorities as more problematic than similar comments towards those who are not + privileged groups. Couple that with the “liberal bias” and you have something with the same inclinations that you would expect that a bunch of AI technologists would have. Nothing specific to the quirks of the AI.


Also seems exactly like the quirk of an AI that's been trained to consider associate the hatefulness of a sentence on a corpus of data which is full of material about anti gay hate and entirely empty of material about the essentially nonexistent phenomena of anti-"straight people" hate and inferred that having the word "gay" in a sentence makes it more likely to be a serious statement of hate than the word "straight people". (Maybe less a quirk and more a reflection of the reality of human discourse...)

You could force it to treat all sentence subjects equally, of course, although the logical consequence of that is ChatGPT giving you a lecture when you ask it to write a joke about hating your shoes




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: