Why a YouTube Chat About Chess Got Flagged for Hate Speech
Blog: Decision Management Community
WIRED fed some of the comments gathered by the CMU researchers into two hate-speech classifiers—one from Jigsaw, an Alphabet subsidiary focused on tackling misinformation and toxic content, and another from Facebook. Some statements, such as “At 1:43, if white king simply moves to G1, it’s the end of black’s attack and white is only down a knight, right?” were judged 90 percent likely not hate speech. But the statement “White’s attack on black is brutal. White is stomping all over black’s defenses. The black king is gonna fall… ” was judged more than 60 percent likely to be hate speech.
“Fundamentally, language is still a very subtle thing,” says Tom Mitchell, a CMU professor. “These kinds of trained classifiers are not soon going to be 100 percent accurate.” Link