Andrei Kutuzov

Bilde av Andrei Kutuzov
English version of this page
Mobiltelefon +4740648218
Brukernavn
Besøksadresse Gaustadalléen 23B Ole-Johan Dahls hus 0373 OSLO
Postadresse Postboks 1080 Blindern 0316 OSLO

Faglige interesser

Datalingvistikk og naturlig språk prosessering, fordelings semantikk, word embeddings (inkludert diakron modeller), nevrale nettverk språkmodeller , oversettelsesstudier, elev korpus.

Det kan være lurt å ta en titt på Semantic Vectors, webtjenesten vi skapt til å spille med nevrale fordelingsmodeller for norsk og engelsk tekst.

Bakgrunn

Fikk min mastergrad i datalingvistikk ved National Research University Higher School of Economics (Moskva) i 2014, med en avhandling "Semantic clustering of Russian web search results: possibilities and problems".

Full CV

Nedenfor er en liste over mine utvalgte nyere publikasjoner. Full liste og tekster finner du på min Academia side.

Publikasjoner

  • Fomin, Vadim; Bakshandaeva, Daria; Rodina, Julia & Kutuzov, Andrei (2019). Tracing Cultural Diachronic Semantic Shifts in Russian Using Word Embeddings: Test Sets and Baselines. Komp'yuternaya Lingvistika i Intellektual'nye Tekhnologii.  ISSN 2221-7932.  18, s 203- 218 Vis sammendrag
  • Kutuzov, Andrei; Dorgham, Mohammad; Oliynyk, Oleksiy; Biemann, Chris & Panchenko, Alexander (2019). Learning Graph Embeddings from WordNet-based Similarity Measures, In Rada Mihalcea; Ekaterina Shutova; Lun-Wei Ku; Kilian Evang & Soujanya Poria (ed.),  Proceedings of the Eighth Joint Conference on Lexical and Computational Semantics (*SEM 2019).  Association for Computational Linguistics.  ISBN 978-1-948087-93-3.  conference paper.  s 125 - 135 Vis sammendrag
  • Bakarov, A; Kutuzov, Andrei & Nikishina, I (2018). Russian computational linguistics: Topical structure in 2007-2017 conference papers. Komp'yuternaya Lingvistika i Intellektual'nye Tekhnologii.  ISSN 2221-7932.  2018-May(17) Vis sammendrag
  • Kutuzov, Andrei (2018). Russian Word Sense Induction by Clustering Averaged Word Embeddings. Komp'yuternaya Lingvistika i Intellektual'nye Tekhnologii.  ISSN 2221-7932.  2018-May(17), s 391- 403 Fulltekst i vitenarkiv. Vis sammendrag
  • Kutuzov, Andrei & Kunilovskaya, Maria (2018). Size vs. Structure in Training Corpora for Word Embedding Models: Araneum Russicum Maximum and Russian National Corpus. Lecture Notes in Computer Science.  ISSN 0302-9743.  10716 LNCS, s 47- 58 . doi: 10.1007/978-3-319-73013-4_5 Vis sammendrag
  • Kutuzov, Andrei; Øvrelid, Lilja; Szymanski, Terrence & Velldal, Erik (2018). Diachronic word embeddings and semantic shifts: a survey, In  Proceedings of the 27th International Conference on Computational Linguistics.  Association for Computational Linguistics.  ISBN 978-1-948087-50-6.  conference paper.  s 1384 - 1397 Fulltekst i vitenarkiv. Vis sammendrag
  • Nikishina, Irina; Bakarov, Amir & Kutuzov, Andrei (2018). RusNLP: Semantic search engine for Russian NLP conference papers. Lecture Notes in Computer Science.  ISSN 0302-9743.  11179 LNCS, s 111- 120 . doi: 10.1007/978-3-030-11027-7_11
  • Sadov, Mikhail A. & Kutuzov, Andrei (2018). Use of morphology in distributional word embedding models: Russian language case. Komp'yuternaya Lingvistika i Intellektual'nye Tekhnologii.  ISSN 2221-7932.  2018-May(17), s 1- 12
  • Ustalov, Dmitry; Panchenko, Alexander; Kutuzov, Andrei; Biemann, Chris & Ponzetto, Simone (2018). Unsupervised Semantic Frame Induction using Triclustering, In  Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers).  Association for Computational Linguistics.  ISBN 978-1-948087-34-6.  conference paper.  s 55 - 62 Fulltekst i vitenarkiv. Vis sammendrag
  • Kunilovskaya, Maria & Kutuzov, Andrei (2017). Testing target text fluency: A machine learning approach to detecting syntactic translationese in English-Russian translation, In  New perspectives on cohesion and coherence: Implications for translation.  Language Science Press.  ISBN 978-3-946234-72-2.  Chapter 5.  s 75 - 103 Vis sammendrag
  • Kunilovskaya, Maria & Kutuzov, Andrei (2017). Universal Dependencies-based syntactic features in detecting human translation varieties, In Jan Hajič (ed.),  Proceedings of the 16th International Workshop on Treebanks and Linguistic Theories.  Association for Computational Linguistics.  ISBN 978-80-88132-04-2.  chapter.  s 27 - 36 Vis sammendrag
  • Kutuzov, Andrei (2017). Arbitrariness of Linguistic Sign Questioned: Correlation between Word Form and Meaning in Russian. Komp'yuternaya Lingvistika i Intellektual'nye Tekhnologii.  ISSN 2221-7932.  1(16), s 109- 120 Fulltekst i vitenarkiv. Vis sammendrag
  • Kutuzov, Andrei; Fares, Murhaf; Oepen, Stephan & Velldal, Erik (2017). Word vectors, reuse, and replicability: Towards a community repository of large-text resources, In Jörg Tiedemann (ed.),  Proceedings of the 21st Nordic Conference on Computational Linguistics (NoDaLiDa).  Linköping University Electronic Press.  ISBN 978-91-7685-601-7.  chapter.  s 271 - 276 Fulltekst i vitenarkiv. Vis sammendrag
  • Kutuzov, Andrei & Kuzmenko, Elizaveta (2017). Two centuries in two thousand words: Neural embedding models in detecting diachronic lexical changes, In  Quantitative Approaches to the Russian Language.  Routledge.  ISBN 9781138097155.  chapter. Vis sammendrag
  • Kutuzov, Andrei & Kuzmenko, Elizaveta (2017). WebVectors: A toolkit for building web interfaces for vector semantic models. Communications in Computer and Information Science.  ISSN 1865-0929.  661, s 155- 161 . doi: 10.1007/978-3-319-52920-2_15 Vis sammendrag
  • Kutuzov, Andrei; Kuzmenko, Elizaveta & Pivovarova, Lidia (2017). Clustering of Russian Adjective-Noun Constructions using Word Embeddings, In Lidia Pivovarova; Jakub Piskorski & Tomaž Erjavec (ed.),  Proceedings of the 6th Workshop on Balto-Slavic Natural Language Processing.  Association for Computational Linguistics.  ISBN 978-1-945626-45-6.  chapter.  s 3 - 13 Fulltekst i vitenarkiv. Vis sammendrag
  • Kutuzov, Andrei; Velldal, Erik & Øvrelid, Lilja (2017). Temporal dynamics of semantic relations in word embeddings: an application to predicting armed conflict participants, In  Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing.  Association for Computational Linguistics.  ISBN 978-1-945626-83-8.  chapter.  s 1825 - 1830 Fulltekst i vitenarkiv. Vis sammendrag
  • Kutuzov, Andrei; Velldal, Erik & Øvrelid, Lilja (2017). Tracing armed conflicts with diachronic word embedding models, In Tommaso Caselli (ed.),  Proceedings of the Events and Stories in the News Workshop.  Association for Computational Linguistics.  ISBN 978-1-945626-63-0.  Chapter.  s 31 - 36 Fulltekst i vitenarkiv. Vis sammendrag
  • Lison, Pierre & Kutuzov, Andrei (2017). Redefining Context Windows for Word Embedding Models: An Experimental Study, In Jörg Tiedemann (ed.),  Proceedings of the 21st Nordic Conference on Computational Linguistics (NoDaLiDa).  Linköping University Electronic Press.  ISBN 978-91-7685-601-7.  chapter.  s 284 - 288 Fulltekst i vitenarkiv. Vis sammendrag
  • Smirnov, Ivan V.; Kuznetsova, Rita; Kopotev, Mikhail; Khazov, Andrey; Lyashevskaya, Olga; Ivanova, L. & Kutuzov, Andrei (2017). Evaluation tracks on plagiarism detection algorithms for the Russian language. Komp'yuternaya Lingvistika i Intellektual'nye Tekhnologii.  ISSN 2221-7932.  1(16), s 271- 283
  • Koslowa, Olga & Kutuzov, Andrei (2016). Improving Distributional Semantic Models Using Anaphora Resolution during Linguistic Preprocessing. Komp'yuternaya Lingvistika i Intellektual'nye Tekhnologii.  ISSN 2221-7932.  15, s 288- 299
  • Kutuzov, Andrei; Kopotev, Mikhail; Sviridenko, Tatyana & Ivanova, Lyubov (2016). Clustering Comparable Corpora of Russian and Ukrainian Academic Texts: Word Embeddings and Semantic Fingerprints, In  Proceedings of the Ninth Workshop on Building and Using Comparable Corpora, held at LREC-2016.  European Language Resources Association.  ISBN 978-2-9517408-9-1.  Conference paper.  s 3 - 10
  • Kutuzov, Andrei & Kuzmenko, Elizaveta (2016). Cross-lingual Trends Detection for Named Entities in News Texts with Dynamic Neural Embedding Models, In  Proceedings of the First International Workshop on Recent Trends in News Information Retrieval co-located with 38th European Conference on Information Retrieval (ECIR 2016).  Technical University of Aachen.  ISBN 978-3-319-30671-1.  Chapter.  s 27 - 32
  • Kutuzov, Andrei & Kuzmenko, Elizaveta (2016). Neural Embedding Language Models in Semantic Clustering of Web Search Results, In  Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016).  European Language Resources Association.  ISBN 978-2-9517408-9-1.  Conference paper.  s 3044 - 3048
  • Kutuzov, Andrei; Kuzmenko, Elizaveta & Marakasova, Anna (2016). Exploration of register-dependent lexical semantics using word embeddings, In  Proceedings of the Workshop on Language Technology Resources and Tools for Digital Humanities (LT4DH).  Association for Computational Linguistics.  ISBN 978-4-87974-708-2.  Chapter.  s 26 - 34
  • Kutuzov, Andrei; Velldal, Erik & Øvrelid, Lilja (2016). Redefining part-of-speech classes with distributional semantic models, In  Proceedings of The 20th SIGNLL Conference on Computational Natural Language Learning (CoNLL).  Association for Computational Linguistics.  ISBN 978-1-945626-19-7.  Chapter.  s 115 - 125 Fulltekst i vitenarkiv.
  • Kutuzov, Andrei (2015). Semantic Clustering of Russian Web Search Results: Possibilities and Problems, In  Information retrieval.  Springer Publishing Company.  ISBN 978-3-319-25485-2.  Chapter.  s 320 - 331
  • Kutuzov, Andrei & Kuzmenko, Elizaveta (2015). Comparing Neural Lexical Models of a Classic National Corpus and a Web Corpus: The Case for Russian, In Alexander Gelbukh (ed.),  Computational Linguistics and Intelligent Text Processing.  Springer Publishing Company.  ISBN 978-3-319-18111-0.  Chapter.  s 47 - 58
  • Kutuzov, Andrei & Kuzmenko, Elizaveta (2015). Semi-automated typical error annotation for learner English essays: integrating frameworks, In  Proceedings of the 4th workshop on NLP for Computer Assisted Language Learning at NODALIDA 2015.  Linköping University Electronic Press.  ISBN 978-91-7519-036-5.  Chapter.  s 35 - 41

Se alle arbeider i Cristin

Publisert 15. juni 2016 14:47 - Sist endret 21. sep. 2016 19:24