It seems that the AI treats supposedly hateful comments towards sociological min...

notahacker · on Feb 2, 2023

Also seems exactly like the quirk of an AI that's been trained to consider associate the hatefulness of a sentence on a corpus of data which is full of material about anti gay hate and entirely empty of material about the essentially nonexistent phenomena of anti-"straight people" hate and inferred that having the word "gay" in a sentence makes it more likely to be a serious statement of hate than the word "straight people". (Maybe less a quirk and more a reflection of the reality of human discourse...)

You could force it to treat all sentence subjects equally, of course, although the logical consequence of that is ChatGPT giving you a lecture when you ask it to write a joke about hating your shoes