# 11 Developmental corpus Šolar # Introduction to Tags This chapter summarises the Šolar tags. A more detailed presentation can be found in the guidelines in the Annotation Guidelines chapter.
TagLinguistic levelCategory of correctionSpecific language problem
Č/VOK/odvečSpellingVowelsSuperfluous vowel
Č/VOK/izpustSpellingVowelsOmitted vowel
Č/VOK/menjava-aoSpellingVowelsAO substitution
Č/VOK/menjava-eiSpellingVowelsEI substitution
Č/VOK/menjava-uoSpellingVowelsUO substitution
Č/VOK/menjava-drugoSpellingVowelsOther vowel substitutions
Č/KONZ/odvečSpellingConsonantsSuperfluous consonant
Č/KONZ/izpustSpellingConsonantsOmitted consonant
Č/KONZ/menjava-szSpellingConsonantsSubstitution SZ
Č/KONZ/menjava-tdSpellingConsonantsSubstitution TD
Č/KONZ/menjava-kghSpellingConsonantsSubstitution KGH
Č/KONZ/menjava-mnSpellingConsonantsSubstitution MN
Č/KONZ/menjava-šžSpellingConsonantsSubstitution ŠŽ
Č/KONZ/menjava-strešiceSpellingConsonantsSubstitution of DIACRITIC
Č/KONZ/menjava-drugoSpellingConsonantsOther consonant substitution
Č/W/začetekSpellingLabio-velar approximant wWord-initially
Č/W/sredinaSpellingLabio-velar approximant wWord-medial
Č/W/konecSpellingLabio-velar approximant wWord-final
Č/W/vSpellingLabio-velar approximant wPrepositional V
Č/SKLOP/zlogSpellingLetter clustersA syllable is missing or is superfluous
Č/SKLOP/ljSpellingLetter clustersCluster LJ
Č/SKLOP/njSpellingLetter clustersCluster NJ
Č/SKLOP/ijSpellingLetter clustersCluster IJ
Č/SKLOP/podvojeneSpellingLetter clustersDoubled letters
Č/SKLOP/premetSpellingLetter clustersMetathesis
Č/PRED/szSpellingVariable (allophonic) prepositionsPreposition s/z
Č/PRED/khSpellingVariable (allophonic) prepositionsPreposition k/h
O/KAT/sklon-rtMorphologyCategorical correctionsCase: genitive-accusative
O/KAT/sklon-dmMorphologyCategorical correctionsCase: dative-locative
O/KAT/sklon-moMorphologyCategorical correctionsCase: locative-instrumental
O/KAT/sklon-drugoMorphologyCategorical correctionsOther case substitutions
O/KAT/število-emMorphologyCategorical correctionsNumber: singular-plural
O/KAT/število-dmMorphologyCategorical correctionsNumber: dual-plural
O/KAT/število-edMorphologyCategorical correctionsNumber: single-dual
O/KAT/spolMorphologyCategorical correctionsGender
O/KAT/vidMorphologyCategorical correctionsAspect
O/KAT/časMorphologyCategorical correctionsTense
O/KAT/osebaMorphologyCategorical correctionsPerson
O/KAT/nedoločnik-kratkiMorphologyCategorical correctionsReduced infinitive
O/KAT/nedoločnik-namenilnikMorphologyCategorical correctionsInfinitive and supine
O/KAT/nedoločnik-osebnaMorphologyCategorical correctionsInfinitive and finite verb
O/KAT/povratnostMorphologyCategorical correctionsReflexivity
O/KAT/naklonMorphologyCategorical correctionsMood
O/KAT/načinMorphologyCategorical correctionsVoice
O/KAT/oblika-zaimkaMorphologyCategorical correctionsPronomial form
O/KAT/določnostMorphologyCategorical correctionsDefiniteness
O/KAT/stopnjevanjeMorphologyCategorical correctionsComparison
O/PAR/glagolska-osnovaMorphologyParadigmatic CorrectionsVerbal root
O/PAR/glagolska-končnicaMorphologyParadigmatic CorrectionsVerbal ending
O/PAR/neglagolska-osnovaMorphologyParadigmatic CorrectionsNon-verbal root
O/PAR/neglagolska-končnicaMorphologyParadigmatic CorrectionsNon-verbal ending
O/PAR/neobstojni-vokalMorphologyParadigmatic CorrectionsEpenthetic vowel
O/PAR/preglas-in-cčMorphologyParadigmatic CorrectionsUmlaut and cč
O/DOD/varianteMorphologyAdditional AnnotationMorphological variants
O/DOD/besede-mati-hčiMorphologyAdditional AnnotationMati-hči
O/DOD/besede-otrokMorphologyAdditional AnnotationOtrok
B/SAM/napačno-lastnoVocabularyNounsErroneous proper noun
B/SAM/lastno-občnoVocabularyNounsProper and common name
B/SAM/občno-besediščeVocabularyNounsCommon vocabulary
B/GLAG/predponaVocabularyVerbsVerbal prefixes
B/GLAG/moči-moratiVocabularyVerbsSubstitution moči-morati
B/GLAG/naklonskiVocabularyVerbsOther substitutions of modal verbs
B/GLAG/drugoVocabularyVerbsOther substitutions of verbs
B/ZAIM/povratna-svojilnostVocabularyPronounReflexive possessive
B/ZAIM/ki-kateriVocabularyPronounSubstitution ki -- kateri
B/ZAIM/oziralniVocabularyPronounOther problems with relative pronouns
B/ZAIM/nobenVocabularyPronounSubstitution of negative pronouns
B/ZAIM/drugoVocabularyPronounOther pronomial substitutions
B/PRED/glagolske-zvezeVocabularyPrepositionPrepositions in verbal phrases
B/PRED/neglagolske-zvezeVocabularyPrepositionPrepositions into non-verbal phrases
B/PRED/lokacijske-dvojniceVocabularyPrepositionLocative doublets
B/PRED/drugoVocabularyPrepositionOther substitutions of prepositions
B/VEZ/in-pa-terVocabularyConjunctionSubstitutions of in-pa-ter
B/VEZ/protivniVocabularyConjunctionCoordinating adversative conjunction
B/VEZ/sprememba-odnosaVocabularyConjunctionChange to subordination
B/VEZ/drugoVocabularyConjunctionOther substitutions of conjunctions
B/PRID/drugoVocabularyAdjectiveAll problems related to adjectives
B/PRISL/drugoVocabularyAdverbAll problems related to adverbs
B/OST/drugoVocabularyOther parts of speechAll problems related to other parts of speech
B/MEN/polnopomenska-v-zaimekVocabularySubstitutions beyond the confines of part of speechLexical word or phrase changed into pronoun
B/MEN/zaimek-v-polnopomenskoVocabularySubstitutions beyond the confines of part of speechPronoun to a lexical word or phrase
B/MEN/veznik-zaimekVocabularySubstitutions beyond the confines of part of speechSubstitution of conjunction and pronoun
B/MEN/besedna-družinaVocabularySubstitutions beyond the confines of part of speechWord family
B/MEN/samostalnik-bzVocabularySubstitutions beyond the confines of part of speechNoun and phrase
B/MEN/glagol-bzVocabularySubstitutions beyond the confines of part of speechVerb and phrase
B/MEN/prislov-pridevnik-bzVocabularySubstitutions beyond the confines of part of speechAdverb/adjective and phrase
B/MEN/drugoVocabularySubstitutions beyond the confines of part of speechOther types of substitutions
B/DOD/zaznamovanoVocabularySubstitutions beyond the confines of part of speechStylistically marked vocabulary
S/BR/povedek-osebekSyntaxWord orderConstituent order: predicate-subject
S/BR/povedek-predmetSyntaxWord orderConstituent order: predicate-object
S/BR/povedek-prislovno-določiloSyntaxWord orderOrder: sentence-adverbial determiner
S/BR/členekSyntaxWord orderOrder: particle
S/BR/znotraj-stavčnega-členaSyntaxWord orderOrder within clausal constituents
S/BR/naslonski-niz-znotrajSyntaxWord orderClitic string: order of clitics
S/BR/naslonski-niz-prirednost-podrednostSyntaxWord orderClitic string: independent-subordinate
S/BR/drugoSyntaxWord orderOther changes to word order
S/IZPUST/samostalnik-občno-imeSyntaxOmitted constituentsNoun: common noun
S/IZPUST/samostalnik-lastno-imeSyntaxOmitted constituentsNoun: proper noun
S/IZPUST/glagol-bitiSyntaxOmitted constituentsThe verb biti
S/IZPUST/glagol-drugoSyntaxOmitted constituentsOther omitted verbs
S/IZPUST/veznik-paSyntaxOmitted constituentsThe word pa
S/IZPUST/veznik-drugoSyntaxOmitted constituentsOther omitted conjunctions
S/IZPUST/predlog-ponovljenSyntaxOmitted constituentsRepeated prepositions
S/IZPUST/predlog-drugoSyntaxOmitted constituentsOther omitted prepositions
S/IZPUST/zaimek-osebniSyntaxOmitted constituentsPersonal pronoun
S/IZPUST/zaimek-drugoSyntaxOmitted constituentsOther omitted pronouns
S/IZPUST/pridevnikSyntaxOmitted constituentsAdjective
S/IZPUST/prislovSyntaxOmitted constituentsAdverb
S/IZPUST/členekSyntaxOmitted constituentsParticle
S/IZPUST/stavekSyntaxOmitted constituentsSentence
S/ODVEČ/ponavljanjeSyntaxSuperfluous constituentsLiteral repetition
S/ODVEČ/samostalnik-občno-imeSyntaxSuperfluous constituentsNoun: common noun
S/ODVEČ/samostalnik-lastno-imeSyntaxSuperfluous constituentsNoun: proper noun
S/ODVEČ/glagol-bitiSyntaxSuperfluous constituentsThe verb biti
S/ODVEČ/glagol-drugoSyntaxSuperfluous constituentsOther superfluous verb
S/ODVEČ/veznik-pa-veznikiSyntaxSuperfluous constituentsThe word pa with another conjunction
S/ODVEČ/veznik-pa-drugoSyntaxSuperfluous constituentsOther examples including the word pa
S/ODVEČ/veznik-začetekSyntaxSuperfluous constituentsConjunction at the beginning of a sentence
S/ODVEČ/veznik-dvojniSyntaxSuperfluous constituentsDoubled conjunction
S/ODVEČ/veznik-drugoSyntaxSuperfluous constituentsOther superfluous conjunction
S/ODVEČ/predlogSyntaxSuperfluous constituentsPreposition
S/ODVEČ/zaimek-osebniSyntaxSuperfluous constituentsPersonal pronoun
S/ODVEČ/zaimek-kazalniSyntaxSuperfluous constituentsDemonstrative pronoun
S/ODVEČ/zaimek-svojilniSyntaxSuperfluous constituentsPossessive pronoun
S/ODVEČ/zaimek-drugoSyntaxSuperfluous constituentsOther superfluous pronouns
S/ODVEČ/pridevnikSyntaxSuperfluous constituentsAdjective
S/ODVEČ/prislov-meraSyntaxSuperfluous constituentsAdverb of degree
S/ODVEČ/prislov-drugoSyntaxSuperfluous constituentsOther superfluous adverbs
S/ODVEČ/členekSyntaxSuperfluous constituentsParticle
S/ODVEČ/stavekSyntaxSuperfluous constituentsClause
S/ODVEČ/povedSyntaxSuperfluous constituentsSentence
S/STR/svojina-odSyntaxStructurePossessives with od
S/STR/svojina-rodilnikSyntaxStructurePossessives with the genitive
S/STR/ločilo-veznikSyntaxStructureSubstitution punctuation-conjunction
S/STR/združevanje-stavkovSyntaxStructureMerged clauses
S/STR/deljenje-stavkovSyntaxStructureSeparation of clauses/sentences
S/STR/besedna-zveza-stavekSyntaxStructureWord/phrase instead of clause and vice versa
S/STR/preoblikovanje-stavkaSyntaxStructureReworked clause
S/DOD/pleonazemSyntaxAdditional AnnotationPleonasm
S/DOD/vsebina-drugoSyntaxAdditional AnnotationSuperfluous content
S/DOD/vsebina-napakeSyntaxAdditional AnnotationErroneous content
S/DOD/pomensko-prazniSyntaxAdditional AnnotationSemantically null
Z/MV/pridevnik-skiOrthographyCapital/lowercase lettersAdjectives ending in -ski
Z/MV/pridevnik-drugoOrthographyCapital/lowercase lettersOther adjectives
Z/MV/občna-imenaOrthographyCapital/lowercase lettersCommon noun
Z/MV/osebna-imenaOrthographyCapital/lowercase lettersPersonal name with lowercase letter
Z/MV/narodnostOrthographyCapital/lowercase lettersNationality with lowercase letter
Z/MV/zemljepisna-imenaOrthographyCapital/lowercase lettersGeographical name with lowercase letter
Z/MV/stvarna-imenaOrthographyCapital/lowercase lettersProper nouns with lowercase letter
Z/MV/premi-govorOrthographyCapital/lowercase lettersDirect speech
Z/MV/začetek-povediOrthographyCapital/lowercase lettersSentence initial
Z/MV/hiperkorekcija-ločilaOrthographyCapital/lowercase lettersHypercorrection following a period
Z/MV/drugoOrthographyCapital/lowercase lettersOther problems with initial letters
Z/SN/skupaj-glagolOrthographyTogether/separateVerb together
Z/SN/skupaj-predlogOrthographyTogether/separatePreposition together
Z/SN/narazen-predlogOrthographyTogether/separatePreposition separate
Z/SN/skupaj-prislovOrthographyTogether/separateAdverb together
Z/SN/narazen-prislovOrthographyTogether/separateAdverb separate
Z/SN/narazen-pridevnikOrthographyTogether/separateAdjective separate
Z/SN/narazen-drugoOrthographyTogether/separateOther separate
Z/SN/skupaj-drugoOrthographyTogether/separateOther together
Z/KR/drugoOrthographyAbbreviationsAll problems related to abbreviations
Z/ŠTEV/drugoOrthographyNumbersAll problems related to numbers
Z/LOČ/nerazvrščenoOrthographyPunctuationUnclassified punctuation corrections
Z/LOČ/vzorec-vejica-stavkiOrthographyPunctuationComma before subordinate clauses
Z/LOČ/vzorec-vejica-stavčni-členiOrthographyPunctuationComma between parts-of-speech
Z/LOČ/vzorec-vejica-veznikiOrthographyPunctuationComma and multi-word conjunctions
Z/LOČ/vzorec-vejica-kotOrthographyPunctuationComma and comparative structures
Z/LOČ/vzorec-vejica-pristavkiOrthographyPunctuationComma and appositions etc.
Z/LOČ/vzorec-vejica-vrinjen-odvisnikOrthographyPunctuationComma and inserted subordinate clauses
Z/LOČ/vzorec-vejica-priredja-zvezOrthographyPunctuationComma and coordinate phrases
Z/LOČ/vzorec-vejica-priredja-odvisnikovOrthographyPunctuationComma and coordinate clauses
Z/LOČ/vzorec-vejica-pridevniški-nizOrthographyPunctuationComma in adjective strings
Z/LOČ/vzorec-vejica-elipsa-povedkaOrthographyPunctuationComma and predicate ellipsis
Z/LOČ/vzorec-vejica-kopičenje-ločilOrthographyPunctuationComma and punctuation accumulation
Z/LOČ/vzorec-vejica-kopičenje-veznikovOrthographyPunctuationComma and conjunction accumulation
Z/LOČ/vzorec-vejica-navajanjeOrthographyPunctuationComma and quotation
P/OBL/drugoRelated correctionsRelated morphology correctionsAll corrections related to morphology
P/SKLA/osebekRelated correctionsRelated syntax correctionsCorrections of subject
P/SKLA/drugoRelated correctionsRelated syntax correctionsOther corrections related to syntax
P/ZAP/mala-velikaRelated correctionsRelated orthography correctionsCorrections of initial letter
N//nečitljivoIllegible and dubious examplesIllegible examples
N//preveriIllegible and dubious examplesDubious examples
# Annotation Guidelines This chapter summarizes the annotation guidelines for semantic-role labelling as applied to Slovene texts. The guidelines are arranged from the latest, up-to-date version to the oldest version. **Version 1.2 (22/11/2023) Project [Empirical foundations for digitally-supported development of writing skills](https://www.cjvt.si/prop/en/)** ARHAR HOLDT, Špela, LAVRIČ, Polona, ROBLEK, Rebeka, GOLI, Teja, BON, Mija, 2023: *Categorizing Teachers’ Corrections: Guidelines for Annotating the Šolar Corpus.* Version 1.2. Prepared in the project Empirical foundations for digitally-supported development of writing skills. [\[DOCX\]](https://wiki.cjvt.si/attachments/51) [\[PDF\]](https://wiki.cjvt.si/attachments/52) **Version 1.1 (12/8/2022) Project [Development of Slovene in a Digital Environment](https://rsdo.slovenscina.eu/en)** ARHAR HOLDT, Špela, LAVRIČ, Polona, ROBLEK, Rebeka, GOLI, Teja, 2022: *Categorizing Teachers’ Corrections: Guidelines for Annotating the Šolar Corpus.* Version 1.1. Prepared in the project Development of Slovene in a Digital Environment. [\[DOCX\]](https://wiki.cjvt.si/attachments/36) [\[PDF\]](https://wiki.cjvt.si/attachments/37) **Version 1.0 (16/12/2018) Project [Upgrade of Šolar Corpus](https://solar.trojina.si/)** ARHAR HOLDT, Špela, LAVRIČ, Polona, ROBLEK, Rebeka, GOLI, Teja, 2018: *Kategorizacija učiteljskih popravkov: Smernice za označevanje korpusa Šolar 2.0.* Različica 1.0. Rezultat projekta Nagradnja korpusa Šolar. [\[PDF\]](https://wiki.cjvt.si/attachments/13) - only in Slovene # References and Links This chapter compiles relevant references and provides links to projects where the Šolar system has been developed and applied to Slovene texts. **Projects, in which the system has been developed:** [Communication in Slovene](http://ssj.slovenscina.eu/) [Upgrade of Šolar corpus](https://solar.trojina.si/) [Development of Slovene in a Digital Environment](https://rsdo.slovenscina.eu/en) [Empirical foundations for digitally-supported development of writing skills](https://www.cjvt.si/prop/en/) **Corpora containing manually revised Šolar tags:** ARHAR HOLDT, Špela, ROZMAN, Tadeja, STRITAR KUČUK, Mojca, KREK, Simon, KRAPŠ VODOPIVEC, Irena, STABEJ, Marko, PORI, Eva, GOLI, Teja, LAVRIČ, Polona, LASKOWSKI, Cyprian Adam, KOCJANČIČ, Polonca, KLEMENC, Bojan, KRSNIK, Luka, KOSEM, Iztok, 2022, *Developmental corpus Šolar 3.0,* Slovenian language resource repository CLARIN.SI, ISSN 2820-4042, [http://hdl.handle.net/11356/1589](http://hdl.handle.net/11356/1589). **The CJVT Svala tool for manual annotation following the Šolar system:** ARHAR HOLDT, Špela, KOSEM, Iztok, STRITAR KUČUK, Mojca, KRSNIK, Luka, JOVAN, Leon Noe, 2022: *CJVT Svala* (Kazalnik projekta Razvoj slovenščine v digitalnem okolju), v1.0, [https://orodja.cjvt.si/svala/](https://orodja.cjvt.si/svala/), Accessed on 2 March 2023. **References:** ARHAR HOLDT, Špela and KOSEM, Iztok. *Šolar, the developmental corpus of Slovene.* 24 August 2023, PREPRINT (Version 1) available at Research Square. [https://doi.org/10.21203/rs.3.rs-3274669/v1](https://doi.org/10.21203/rs.3.rs-3274669/v1) ARHAR HOLDT, Špela, KOSEM, Iztok, STRITAR KUČUK, Mojca. *Metode in orodja za lažjo pripravo korpusov usvajanja jezika.* PIRIH SVETINA, Nataša (ur.), FERBEŽAR, Ina (ur.). Na stičišču svetov: slovenščina kot drugi in tuji jezik. Ljubljana: Založba Univerze, 2022. Str. 23-30, Zbirka Obdobja, 41. [https://centerslo.si/wp-content/uploads/2022/11/Arhar-Holdt-et-al\_Obdobja-41.pdf](https://centerslo.si/wp-content/uploads/2022/11/Arhar-Holdt-et-al_Obdobja-41.pdf). ARHAR HOLDT, Špela, KOSEM, Iztok, GANTAR, Polona, 2017: *Corpus-based resources for L1 teaching: the case of Slovene.* Ann Marcus-Quinn, Tríona Hourigan (ur.): Handbook on digital learning for K-12 schools. Cham: Springer. 91–113. KOSEM, Iztok, ROZMAN, Tadeja, ARHAR HOLDT, Špela, KOCJANČIČ, Polonca, LASKOWSKI, Cyprian Adam, 2016: *Šolar 2.0: nadgradnja korpusa šolskih pisnih izdelkov.* Tomaž Erjavec, Darja Fišer (ur.): Zbornik konference Jezikovne tehnologije in digitalna humanistika. Ljubljana: Znanstvena založba Filozofske fakultete. 95–100. [https://www.sdjt.si/wp/wp-content/uploads/2016/09/JTDH-2016\_Kosem-et-al\_Solar-2-0-nadgradnja-korpusa-solskih-pisnih-izdelkov.pdf](https://www.sdjt.si/wp/wp-content/uploads/2016/09/JTDH-2016_Kosem-et-al_Solar-2-0-nadgradnja-korpusa-solskih-pisnih-izdelkov.pdf) KOSEM, Iztok, STRITAR KUČUK, Mojca, MOŽE, Sara, ZWITTER VITEZ, Ana, ARHAR HOLDT, Špela, ROZMAN, Tadeja, 2012: *Analiza jezikovnih težav učencev: korpusni pristop.* Ljubljana: Znanstvena založba Filozofske fakultete. [https://e-knjige.ff.uni-lj.si/znanstvena-zalozba/catalog/view/229/329/5311-1](https://e-knjige.ff.uni-lj.si/znanstvena-zalozba/catalog/view/229/329/5311-1)