It trains off social media, and even white kids use AAVE online. And kids make the most social media comments.
A lot of times when someone posts a text screenshot and everyone talks about how kids talk crazy, it’s just a patois of AAEV mixed in with “regular” English.
It should be able to “read” it fine.
The bias part (as clearly stated in the article…) is when you ask a LLM to describe the person who would phrase something in AAVE, and the LLM replies back with stereotypes about Black people.
So it can read and interpret it fine, it just has a bias against people who talk like that
They can’t possibly encounter much of it in training material… Of course they’re not going to like it.
What?
It trains off social media, and even white kids use AAVE online. And kids make the most social media comments.
A lot of times when someone posts a text screenshot and everyone talks about how kids talk crazy, it’s just a patois of AAEV mixed in with “regular” English.
It should be able to “read” it fine.
The bias part (as clearly stated in the article…) is when you ask a LLM to describe the person who would phrase something in AAVE, and the LLM replies back with stereotypes about Black people.
So it can read and interpret it fine, it just has a bias against people who talk like that