# References and Links

This chapter compiles relevant references and provides links to projects where the lemmatization process has been developed and applied to Slovene texts.<br />

**Projects, in which the system has been developed or applied**<br />
[Universal Dependencies](https://universaldependencies.org/)<br />
[MULTEXT-East - Multilingual corpora and text tools for Central and East European langauges](https://nl.ijs.si/ME/)<br />
[JOS - Linguistic Annotation of Slovene: Methods and Resources](http://nl.ijs.si/jos/index-en.html)<br />
[Communication in Slovene](http://eng.slovenscina.eu/)<br />
[Janes - Resources, Tools and Methods for the Research of Nonstandard Internet Slovene](https://nl.ijs.si/janes/english)<br />
[Development of Slovene in a Digital Environment](https://rsdo.slovenscina.eu/en)<br />

**The Obeliks tool for tokenization and sentence segmentation**<br />
[https://github.com/clarinsi/obeliks](https://github.com/clarinsi/obeliks)<br />

**References**<br />
Krek, Simon; Erjavec, Tomaž; Dobrovoljc, Kaja; Gantar, Polona; Arhar Holdt, Špela; Čibej, Jaka in Brank, Janez. The ssj500k training corpus for Slovene language processing. V: Fišer, D. in Erjavec, T. Jezikovne tehnologije in digitalna humanistika: zbornik konference: 24.-25. september 2020, Ljubljana, Slovenija. Ljubljana: Inštitut za novejšo zgodovino, 2020. Str. 24–33.	 
[http://nl.ijs.si/jtdh20/pdf/JT-DH_2020_Krek-et-al_The-ssj500k-Training-Corpus-for-Slovene-Language-Processing.pdf](http://nl.ijs.si/jtdh20/pdf/JT-DH_2020_Krek-et-al_The-ssj500k-Training-Corpus-for-Slovene-Language-Processing.pdf) [[PDF]](https://wiki.cjvt.si/attachments/48)<br />