Developing High-Quality Linguistic Corpora for Analysing Highly Subjective Phenomena

Methods for the creation of high-quality datasets, in particular from natural language data. The tutorial focuses on a survey of agreement metrics, annotation techniques such as crowdsourcing, and leveraging of controversiality and polarization of opinions. An exercise will be included, to provide hands-on experience with annotation techniques.

For more information regarding the tutorial and in time necessary files, you can visit the tutorial website of organizers Valerio Basile and Komal Florio.