The Dire Defect of ‘Multilingual’ AI Content Moderation
Three parts Bosnian text. Thirteen parts Kurdish. Fifty-five parts Swahili. Eleven thousand parts English.This is part of the data recipe for Facebook’s new large language model, which the…