This ongoing project collates and organises resources for research and policymaking on online hate. These resources aim to cover all aspects of research, policymaking, the law and civil society activism to monitor, understand and counter online hate. Resources are focused on the UK, but include international work as well.
On hatespeechdata.com we have catalogued a large number of readily available datasets annotated for hate speech, online abuse, and offensive language. They may be useful for e.g. training a natural language processing system to detect this language. The page currently consists of 50+ datasets in 15 languages – including Arabic, Danish, English, French, German, Hindu-English, Indonesian and Turkish.