03 Normalization
Computer-mediated communication (CMC) language significantly diverges from the standard language, posing challenges for current automatic text annotation tools. Normalization is essential for enhancing further text processing because it provides a standard equivalent for each non-standard occurrence. This step is critical as both lemmatization and morphosyntactic annotation of CMC language rely on these normalized forms (Čibej et al. 2016).
Introduction to Normalization
This chapter summarizes the process of normalizing non-standard Slovene words. A more detailed pr...
Annotation Guidelines
This chapter summarizes the annotation guidelines for normalization of Slovene non-standard texts...
References and Links
This chapter compiles relevant references and provides links to projects where the normalization ...