04 MULTEXT-East Morphosyntax
The MULTEXT-East framework for morphosyntactic annotation of text corpora defines character codes, referred to as MSD-tags (with 'MSD' standing for morphosyntactic description). For example, the "Ncmsn" tag represents a set of grammatical features "Noun Type=common Gender=masculine Number=singular Case=nominative". This annotation system has been established for 20 languages or dialects, including all Slavic languages.
The use of MULTEXT-East tags for Slovene began in 1996 and has since continued in all subsequent open corpora of Slovene, whether manually or automatically annotated, up until the emergence of the Universal Dependencies morphosyntactic annotation framework, which is now gradually taking over the role that MULTEXT-East played for decades.
Introduction to Tags
In this chapter, we outline the design of the MULTEXT-East specifications. The multilingual MULTE...
Annotation Guidelines
This chapter summarizes the annotation guidelines for the MULTEXT-East morphosyntax as applied to...
References and Links
This chapter compiles relevant references and provides links to projects where the MULTEXT-East m...