This ongoing project collates and organises resources for research and policymaking on online hate. These resources aim to cover all aspects of research, policymaking, the law and civil society activism to monitor, understand and counter online hate. Resources are focused on the UK, but include international work as well.
On hatespeechdata.com we have catalogued a large number of readily available datasets annotated for hate speech, online abuse, and offensive language. They may be useful for e.g. training a natural language processing system to detect this language. The page currently consists of 50+ datasets in 15 languages – including Arabic, Danish, English, French, German, Hindu-English, Indonesian and Turkish.
The Online Harms Observatory is a new platform which will provide real-time insight into the scope, prevalence and dynamics of harmful online content. It will be powered by a mix of large-scale data analysis, cutting-edge AI and survey data. This exciting new resource will leverage our innovative research to help policymakers, regulators, security services and other stakeholders better understand the landscape of online harms.