Samia Touileb

Image of Samia Touileb
Norwegian version of this page
Username
Visiting address Gaustadalléen 23B Ole-Johan Dahls hus 0373 Oslo
Postal address Postboks 1080 Blindern 0316 Oslo
Other affiliations Department for Informatics

I am a Post Doc in the Language Technology Group at the University of Oslo. My main research interests are information extraction, sentiment analysis (collaborating with the SANT project), and applications of Natural Language Processing and machine learning methods to tasks within social science research. I also mainly work on under-resourced languages.  

Teaching

Tags: language technology, Natural Language Processing, Computational Linguistics, Machine Learning

Publications

Upcoming paper presentation at GeBNLP, titled "Using Gender- and Polarity-Informed Models to Investigate Bias" by Samia Touileb, Lilja Øvrelid, and Erik Velldal can be seen here.

  • Barnes, Jeremy; Mæhlum, Petter & Touileb, Samia (2021). NorDial: A Preliminary Corpus of Written Norwegian Dialect Use, In Simon Dobnik & Lilja Øvrelid (ed.),  Proceedings of the 23rd Nordic Conference on Computational Linguistics (NoDaLiDa).  Linköping University Electronic Press.  ISBN 978-91-7929-614-8.  2021.nodalida-main.51.  s 445 - 451 Show summary
  • Adouane, Wafia; Touileb, Samia & Bernardy, Jean-Philippe (2020). Identifying Sentiments in Algerian Code-switched User-generated Comments, In Nicoletta Calzolari; Frédéric Béchet; Philippe Blache; Khalid Choukri; Christopher Cieri; Thierry Declerck; Sara Goggi; Hitoshi Isahara; Bente Maegaard; Joseph Mariani; Hélène Mazo; Asuncion Moreno; Jan Odijk & Stelios Piperidis (ed.),  Proceedings of The 12th Language Resources and Evaluation Conference.  European Language Resources Association.  ISBN 979-10-95546-34-4.  1.328.  s 2698 - 2705
  • Lison, Pierre; Barnes, Jeremy; Hubin, Aliaksandr & Touileb, Samia (2020). Named Entity Recognition without Labelled Data: A Weak Supervision Approach, In Dan Jurafsky; Joyce Chai; Natalie Schluter & Joel Tetreault (ed.),  Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics.  Association for Computational Linguistics.  ISBN 978-1-952148-25-5.  139.  s 1518 - 1533 Show summary
  • Touileb, Samia (2020). LTG-ST at NADI Shared Task 1: Arabic Dialect Identification using a Stacking Classifier, In Imed Zitouni; Muhammad Abdul-Mageed; Houda Bouamor; Fethi Bougares; Mahmoud El-Haj; Nadi Tomeh & Wajdi Zaghouani (ed.),  Proceedings of the Fifth Arabic Natural Language Processing Workshop.  Association for Computational Linguistics.  ISBN 978-1-952148-38-5.  2020.wanlp-1.34.  s 313 - 319 Show summary
  • Touileb, Samia; Øvrelid, Lilja & Velldal, Erik (2020). Gender and sentiment, critics and authors: a dataset of Norwegian book reviews, In Marta R. Costa-jussà; Christian Hardmeier; Will Radford & Kellie Webster (ed.),  Proceedings of the Second Workshop on Gender Bias in Natural Language Processing.  Association for Computational Linguistics.  ISBN 978-1-952148-43-9.  2020.gebnlp-1.11.  s 125 - 138 Show summary
  • Barnes, Jeremy Claude; Touileb, Samia; Øvrelid, Lilja & Velldal, Erik (2019). Lexicon information in neural sentiment analysis: a multi-task learning approach, In Mareike Hartmann & Barbara Plank (ed.),  Proceedings of the 22nd Nordic Conference on Computational Linguistics (NoDaLiDa).  Linköping University Electronic Press.  ISBN 978-91-7929-995-8.  Artikkel.  s 175 - 186 Show summary
  • Rodina, Julia; Bakshandaeva, Daria; Fomin, Vadim; Kutuzov, Andrei; Touileb, Samia & Velldal, Erik (2019). Measuring Diachronic Evolution of Evaluative Adjectives with Word Embeddings: the Case for English, Norwegian, and Russian, In Nina Tahmasebi; Lars Borin; Adam Jatowt & Yang Xu (ed.),  Proceedings of the 1st International Workshop on Computational Approaches to Historical Language Change.  Association for Computational Linguistics.  ISBN 978-1-950737-31-4.  chapter.  s 202 - 209 Show summary
  • Touileb, Samia; Pedersen, Truls Andre & Sjøvaag, Helle (2018). Automatic identification of unknown names with specific roles, In Beatrice Alex; Stefania Degaetano-Ortlieb; Anna Feldman; Anna Kazantseva; Nils Reiter & Stan Szpakowicz (ed.),  Proceedings of the Second Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature.  Association for Computational Linguistics.  ISBN 978-1-948087-61-2.  150–158.  s 150 - 158
  • Velldal, Erik; Øvrelid, Lilja; Bergem, Eivind Alexander; Stadsnes, Cathrine; Touileb, Samia & Jørgensen, Fredrik (2018). NoReC: The Norwegian Review Corpus, In Nicoletta Calzolari; Khalid Choukri; Christopher Cieri; Thierry Declerck; Sara Goggi; Koiti Hasida; Hitoshi Isahara; Bente Maegaard; Joseph Mariani; Hélène Mazo; Asuncion Moreno; Jan Odijk; Stelios Piperidis & Takenobu Tokunaga (ed.),  Proceedings of the Eleventh International Conference on Language Resources and Evaluation.  European Language Resources Association.  ISBN 979-10-95546-00-9.  Article.  s 4186 - 4191 Show summary
  • Touileb, Samia & Steskal, Lubos (2016). ADIOS LDA: When Grammar Induction Meets Topic Modeling. NIKT: Norsk IKT-konferanse for forskning og utdanning.  ISSN 1892-0713.
  • Salway, Andrew & Touileb, Samia (2014). Applying grammar induction to text mining. Association for Computational Linguistics (ACL). Annual Meeting Conference Proceedings.  ISSN 0736-587X.  s 712- 717 . doi: 10.3115/v1/p14-2116
  • Salway, Andrew; Touileb, Samia & Tvinnereim, Endre (2014). Inducing Information Structures for Data-driven Text Analysis. Association for Computational Linguistics (ACL). Annual Meeting Conference Proceedings.  ISSN 0736-587X. . doi: 10.3115/v1/w14-2510 Show summary
  • Touileb, Samia & Salway, Andrew (2014). Constructions: a new unit of analysis for corpus-based discourse analysis, In Wirote Aroonmanakun; Thepchai Supnithi & Prachya Boonkwan (ed.),  Proceedings of the 28th Pacific Asia Conference on Language, Information and Computation (PACLIC 28).  Chulalongkorn University.  ISBN 978-616-551-887-1.  -.  s 634 - 644

View all works in Cristin

  • Sjøvaag, Helle; Pedersen, Truls Andre & Touileb, Samia (2018). Operationalising Diversity for Big Data Policy Research.
  • Touileb, Samia; Pedersen, Truls Andre & Sjøvaag, Helle (2018). Automatically identifying names of unrecognized politicians.
  • Pedersen, Truls Andre; Touileb, Samia & Sjøvaag, Helle (2017). Finding Voices in the Margins: Computer-Assisted Discovery of Naturally Belonging Names.
  • Touileb, Samia; Elgesem, Dag & Salway, Andrew (2017). Automatically Inducing Information Structures. A Text Mining Approach Based on the Distributional Hypothesis.
  • Touileb, Samia & Duarte, Katherine (2016). Getting to know large newsflows: Automatically induced information structures as keyphrases for news content analysis.
  • Iversen, Magnus Hoem; Pedersen, Truls Andre; Stavelin, Eirik & Touileb, Samia (2015). Computer supported deliberation and argumentation online. Proposing a system for online argumentation..
  • Touileb, Samia & Steskal, Lubos (2015). A computational approach to organize and analyze online communication data.
  • Salway, Andrew; Hofland, Knut & Touileb, Samia (2013). Applying Corpus Techniques to Climate Change Blogs.
  • Touileb, Samia (2013). Inducing local grammars from n-grams.
  • Touileb, Samia; Elgesem, Dag & Steskal, Lubos (2012). Networks of texts and people.

View all works in Cristin

Published June 19, 2017 3:22 PM - Last modified July 10, 2021 8:07 PM