Argument mining for Norwegian
This thesis will focus on argument mining for Norwegian texts.
The goal of argument mining is the automatic extraction and identification of argumentative structures from natural language text, in order to provide structured data for example for reasoning engines. An argument can be defined as a set of statements consisting of three main elements: a set of premises, a conclusion, and an inference that goes from the premises to the conclusion. Conclusions are often seen as claims, while premises are usually defined as the evidences or reasons, whereas the inference is seen as the argument itself (Lippi and Torroni, 2016).
In this project, we will use a corpus of Norwegian reviews. The structure of review documents usually contain evidences and reasons on why a product is good or bad, as well as a conclusion identified as the rating given to the product.
Due to the lack of annotated data, a seemingly unavoidable solution to this task is to employ machine learning techniques capable of dealing with unannotated data. Deep learning techniques seems to be one of the most interesting choices in this direction. It is however, necessary for the evaluation of such a system to have a small annotated test set, which the candidate(s) should develop.
The precise details and scope of the thesis will be further decided in agreement between the supervisors and the candidate(s), and the thesis can be well suited for two students who are comfortable working together.
The project presupposes a good balance of technical and linguistic expertise. Good programming skills, experience with machine learning and a solid background in NLP are relevant qualifications. Please contact the supervisors to discuss further details.
Lippi M., and Torroni P. 2016. Argumentation Mining: State of the Art and Emerging Trends. ACM Trans. Internet Technol. 16, 2. DOI=http://dx.doi.org/10.1145/2850417 available here