LTG research seminar
NLP researchers both from and outside LTG are presenting their findings in an informal environment, followed by questions and discussions.
Language technology group research seminar is a biweekly event. The regular time slot for the seminar is Monday, 12:15-13:15 CEST.
In the Fall of 2021, the seminar is conducted in a hybrid form: both offline at the room 4118 of Ole-Johan Dahls hus, UiO, and online in Zoom (link available by request). With questions or suggestions related to the LTG seminar, please contact Andrey Kutuzov.
October 25, 2021
LTG research seminar has a long history, but this page starts from Fall 2021 only.
October 11, 2021
"Improving Multilingual Lexical Normalization by Fine-tuning ByT5"
David Samuel (LTG)
We present the winning entry to the Multilingual Lexical Normalization (MultiLexNorm) shared task at W-NUT 2021, which evaluates lexical normalization systems on 12 social media datasets in 11 languages.
Our system is based on a pre-trained byte-level language model, ByT5, which we further pre-train on synthetic data and then fine-tune on authentic normalization data. It achieves the best performance by a wide margin in intrinsic evaluation, and also the best performance in extrinsic evaluation through dependency parsing. We release both the source code and the fine-tuned models.
September 27, 2021
Semantics, morphology and syntax are strongly interdependent. However, the majority of computational methods for semantic change detection use distributional word representations which encode mostly semantics. We investigate an alternative method, grammatical profiling, based entirely on changes in the morphosyntactic behaviour of words.
We demonstrate that it can be used for semantic change detection and even outperforms some distributional semantic methods. We present an in-depth qualitative and quantitative analysis of the predictions made by our grammatical profiling system, showing that they are plausible and interpretable.