AGAT-semantics: semantic markup of the Ukrainian Corpus

Authors

  • Nataliia Darchuk Taras Shevchenko National University of Kyiv image/svg+xml
  • Oksana Zuban Taras Shevchenko National University of Kyiv image/svg+xml
  • Marharyta Lanhenbakh Taras Shevchenko National University of Kyiv image/svg+xml
  • Yaryna Khodakivska Taras Shevchenko National University of Kyiv image/svg+xml

DOI:

https://doi.org/10.17721/um/46(2016).92-102

Keywords:

linguistic corpus, semantic markup, taxonomic classification, taxon

Abstract

The article views linguistic aspects of semantic markup of the Ukrainian Corpus as the fourth stage of presenting information about Corpus units. The markup is based on taxonomic classification of the Russian Corpus but with extra modifica- tion. There was developed the software tools for online work based on materials of frequency dictionary of journalistic style with a total volume of 40,000 lexemes compiled from the sampling of 16 Million word forms of Ukrainian texts.

 

Information about the authors:

Natalia Darchuk – Dr Hab, Prof. of the Department of the Modern Ukrainian Language, Institute of Philology, Taras Shevchenko National University of Kyiv (Ukraine).

E-mail: nataliadarchuk@gmail.com

Oksana Zuban – PhD, Associate Prof. of the Department of the Modern Ukrainian Language, Institute of Philology, Taras Shevchenko National University of Kyiv (Ukraine).

E-mail: oxana.mell.zuban@gmail.com

Marharyta Lanhenbakh – PhD, Assist. of the Department of the Modern Ukrainian Language, Institute of Philology, Taras Shevchenko National University of Kyiv (Ukraine).

E-mail: labacompli@gmail.com

Yaryna Khodakivska – Philologist of the Computer Linguistics Laboratory, Institute of Philology, Taras Shevchenko National University of Kyiv (Ukraine).

E-mail: yaryna.yaryna@gmail.com

____________

References

  1. Apresjan Ju.D. (1974) Leksicheskaja semantika: sinonimicheskie sredstva jazyka [Lexical Semantics: Synonymous Language Means]. Moskva: Nauka, 367 p. (in Russian).
  2. Gerd A.S. (2005) Prikladnaja lingvistika [Applied Linguistics]. Sankt- Peterburg : Sankt-Peterburg University Press, 266, [1] p. (in Russian).
  3. Darchuk N. (2013) Kompiuterne anotuvannia ukrainskoho tekstu : rezultaty i perspektyvy [Computer Annotation of Ukrainian Texts: Results and Perspectives]. Kyiv : Osvita Ukrainy, 543 p. (in Ukrainian).
  4. Krasilshhik I.S., Rahilina E.V. (1992) Predmetnye imena v sisteme “Leksikograf” [Subject Names in the “Leksikograf” System]. Scientific and technical informatio, 2. series. Information processes and systems, no 9, pp. 24-31. (in Russian).
  5. Kustova G.I., Ljashevskaja O.N., Paducheva E.V ., Rahilina E.V . (2005) Semanticheskaja razmetka leksiki v nacionalnom korpuse russkogo jazyka: princhipy, problemy, perspektivy [Semantic Markup Vocabulary in the Russian National Corpus: Principe, Problems and Perspectives]. Russian National Corpus: 2003-2005, Moskva: Indrik, pp. 155–174. (in Russian).
  6. Kustova G.I., Paducheva E.V. (1994) Slovar kak leksicheskaja baza dannyh [Dictionary as a Lexical Database]. Problems of linguistics, no4, pp.96-106. (in Russian).
  7. Rahilina E.V., Kustova G.I., Ljashevskaja O.N., Reznikova T.I., Shemanaeva O. Ju. (2009) Zadachi i principy semanticheskoj razmetki leksiki v NKRJa [The Objectives and Principles of Semantic Markup Language in the Russian National Corpus]. Russian National Corpus. New Results and Perspectives. Sankt-Peterburg: NESTOR-ISTORIJa, pp. 215-239. (in Russian).
  8. Sokolovskaja Zh.P. (1990) Problemy sistemnogo opisanija leksicheskoj semantiki [Problems of the Systematic Description of Lexical Semantics]. Kiev: Naukova dumka, 184 p. (in Russian).
  9. Shtern I.B. (1998) Vybrani topiky ta leksykon suchasnoi linhvistyky [Selected topics and vocabulary of modern linguistics: encyclopedic Dictionary]. Kyiv: AtrEk, 1998, 335 p. (in Ukrainian).

References

Published

2016-09-09

Issue

Section

COMPUTER LINGUISTICS

How to Cite

Darchuk, N., Zuban, O., Lanhenbakh, M., & Khodakivska, Y. (2016). AGAT-semantics: semantic markup of the Ukrainian Corpus. Ukrainian Linguistics, 1(46), 92-102. https://doi.org/10.17721/um/46(2016).92-102

Most read articles by the same author(s)