Andrei Kutuzov

Doctoral Research Fellow - Research Group for Language Technology
Image of Andrei Kutuzov
Norwegian version of this page
Mobile phone +4740648218
Username
Visiting address Gaustadalléen 23B Ole-Johan Dahls hus 0373 OSLO
Postal address Postboks 1080 Blindern 0316 OSLO

Academic interests

Computational linguistics and natural language processing, distributional semantics, diachronic word embedding models, machine learning, translation studies, learner corpora.

You may want to have a look at WebVectors, the web service we created to play with neural distributional models for English and Norwegian languages.

Courses taught

Background

I received my Master's degree in Computational Linguistics at National Research University Higher School of Economics (Moscow) in 2014, with a thesis "Semantic clustering of Russian web search results: possibilities and problems".

Full CV

Github page

Below is the list of my selected recent publications. Full list and texts can be found at my Academia page.

Tags: Corpus Linguistics, Word Embeddings, Machine Learning, Computational Linguistics, Natural Language Processing

Publications

  • Kutuzov, Andrei & Kunilovskaya, Maria (2018). Size vs. Structure in Training Corpora for Word Embedding Models: Araneum Russicum Maximum and Russian National Corpus. Lecture Notes in Computer Science.  ISSN 0302-9743.  10716 . doi: https://doi.org/10.1007/978-3-319-73013-4_5 Show summary
  • Kunilovskaya, Maria & Kutuzov, Andrei (2017). Testing target text fluency: A machine learning approach to detecting syntactic translationese in English-Russian translation, In  New perspectives on cohesion and coherence: Implications for translation.  Language Science Press.  ISBN 978-3-946234-72-2.  Chapter 5.  s 75 - 103 Show summary
  • Kutuzov, Andrei (2017). Arbitrariness of Linguistic Sign Questioned: Correlation between Word Form and Meaning in Russian. Komp'yuternaya Lingvistika i Intellektual'nye Tekhnologii.  ISSN 2221-7932.  1(16), s 109- 120 Full text in Research Archive. Show summary
  • Kutuzov, Andrei; Fares, Murhaf; Oepen, Stephan & Velldal, Erik (2017). Word vectors, reuse, and replicability: Towards a community repository of large-text resources, In Jörg Tiedemann (ed.),  Proceedings of the 21st Nordic Conference on Computational Linguistics (NoDaLiDa).  Linköping University Electronic Press.  ISBN 978-91-7685-601-7.  chapter.  s 271 - 276 Show summary
  • Kutuzov, Andrei & Kunilovskaya, Maria (2017). Universal Dependencies-based syntactic features in detecting human translation varieties, In Jan Hajič (ed.),  Proceedings of the 16th International Workshop on Treebanks and Linguistic Theories.  Association for Computational Linguistics.  ISBN 978-80-88132-04-2.  chapter.  s 27 - 36 Show summary
  • Kutuzov, Andrei & Kuzmenko, Elizaveta (2017). Two centuries in two thousand words: Neural embedding models in detecting diachronic lexical changes, In  Quantitative Approaches to the Russian Language.  Routledge.  ISBN 9781138097155.  chapter. Show summary
  • Kutuzov, Andrei & Kuzmenko, Elizaveta (2017). WebVectors: A toolkit for building web interfaces for vector semantic models. Communications in Computer and Information Science.  ISSN 1865-0929.  661, s 155- 161 . doi: 10.1007/978-3-319-52920-2_15 Show summary
  • Kutuzov, Andrei; Kuzmenko, Elizaveta & Pivovarova, Lidia (2017). Clustering of Russian Adjective-Noun Constructions using Word Embeddings, In Lidia Pivovarova; Jakub Piskorski & Tomaž Erjavec (ed.),  Proceedings of the 6th Workshop on Balto-Slavic Natural Language Processing.  Association for Computational Linguistics.  ISBN 978-1-945626-45-6.  chapter.  s 3 - 13 Full text in Research Archive. Show summary
  • Kutuzov, Andrei; Velldal, Erik & Øvrelid, Lilja (2017). Temporal dynamics of semantic relations in word embeddings: an application to predicting armed conflict participants, In  Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing.  Association for Computational Linguistics.  ISBN 978-1-945626-83-8.  chapter.  s 1825 - 1830 Full text in Research Archive. Show summary
  • Kutuzov, Andrei; Velldal, Erik & Øvrelid, Lilja (2017). Tracing armed conflicts with diachronic word embedding models, In Tommaso Caselli (ed.),  Proceedings of the Events and Stories in the News Workshop.  Association for Computational Linguistics.  ISBN 978-1-945626-63-0.  Chapter.  s 31 - 36 Full text in Research Archive. Show summary
  • Lison, Pierre & Kutuzov, Andrei (2017). Redefining Context Windows for Word Embedding Models: An Experimental Study, In Jörg Tiedemann (ed.),  Proceedings of the 21st Nordic Conference on Computational Linguistics (NoDaLiDa).  Linköping University Electronic Press.  ISBN 978-91-7685-601-7.  chapter.  s 284 - 288 Show summary
  • Koslowa, Olga & Kutuzov, Andrei (2016). Improving Distributional Semantic Models Using Anaphora Resolution during Linguistic Preprocessing. Komp'yuternaya Lingvistika i Intellektual'nye Tekhnologii.  ISSN 2221-7932.  15, s 288- 299
  • Kutuzov, Andrei; Kopotev, Mikhail; Sviridenko, Tatyana & Ivanova, Lyubov (2016). Clustering Comparable Corpora of Russian and Ukrainian Academic Texts: Word Embeddings and Semantic Fingerprints, In  Proceedings of the Ninth Workshop on Building and Using Comparable Corpora, held at LREC-2016.  European Language Resources Association.  ISBN 978-2-9517408-9-1.  Conference paper.  s 3 - 10
  • Kutuzov, Andrei & Kuzmenko, Elizaveta (2016). Cross-lingual Trends Detection for Named Entities in News Texts with Dynamic Neural Embedding Models, In  Proceedings of the First International Workshop on Recent Trends in News Information Retrieval co-located with 38th European Conference on Information Retrieval (ECIR 2016).  Technical University of Aachen.  ISBN 978-3-319-30671-1.  Chapter.  s 27 - 32
  • Kutuzov, Andrei & Kuzmenko, Elizaveta (2016). Neural Embedding Language Models in Semantic Clustering of Web Search Results, In  Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016).  European Language Resources Association.  ISBN 978-2-9517408-9-1.  Conference paper.  s 3044 - 3048
  • Kutuzov, Andrei; Kuzmenko, Elizaveta & Marakasova, Anna (2016). Exploration of register-dependent lexical semantics using word embeddings, In  Proceedings of the Workshop on Language Technology Resources and Tools for Digital Humanities (LT4DH).  Association for Computational Linguistics.  ISBN 978-4-87974-708-2.  Chapter.  s 26 - 34
  • Kutuzov, Andrei; Velldal, Erik & Øvrelid, Lilja (2016). Redefining part-of-speech classes with distributional semantic models, In  Proceedings of The 20th SIGNLL Conference on Computational Natural Language Learning (CoNLL).  Association for Computational Linguistics.  ISBN 978-1-945626-19-7.  Chapter.  s 115 - 125 Full text in Research Archive.
  • Kutuzov, Andrei (2015). Semantic Clustering of Russian Web Search Results: Possibilities and Problems, In  Information retrieval.  Springer Publishing Company.  ISBN 978-3-319-25485-2.  Chapter.  s 320 - 331
  • Kutuzov, Andrei & Kuzmenko, Elizaveta (2015). Comparing Neural Lexical Models of a Classic National Corpus and a Web Corpus: The Case for Russian, In Alexander Gelbukh (ed.),  Computational Linguistics and Intelligent Text Processing.  Springer Publishing Company.  ISBN 978-3-319-18111-0.  Chapter.  s 47 - 58
  • Kutuzov, Andrei & Kuzmenko, Elizaveta (2015). Semi-automated typical error annotation for learner English essays: integrating frameworks, In  Proceedings of the 4th workshop on NLP for Computer Assisted Language Learning at NODALIDA 2015.  Linköping University Electronic Press.  ISBN 978-91-7519-036-5.  Chapter.  s 35 - 41

View all works in Cristin

Published Oct. 14, 2015 6:11 PM - Last modified Jan. 16, 2018 10:28 PM