Measuring the impact of collocational knowledge on sentence parsing

In this paper we focus on collocations, which have been studied in computational linguistics since they constitute a key factor when processing natural languages. For instance, they usually represent a challenge in automatic translation because the association of two terms is not easily computed. We...

Descripción completa

Detalles Bibliográficos
Autor principal:	Wehrli, Eric
Formato:	Online
Idioma:	spa
Publicado:	Universidad de Costa Rica. Campus Rodrigo Facio. Sitio web: https://www.ucr.ac.cr/ Teléfono: (506) 2511-4000. Correo de soporte: revistas@ucr.ac.cr 2017
Acceso en línea:	https://revistas.ucr.ac.cr/index.php/kanina/article/view/30225

Descripción
Sumario:	In this paper we focus on collocations, which have been studied in computational linguistics since they constitute a key factor when processing natural languages. For instance, they usually represent a challenge in automatic translation because the association of two terms is not easily computed. We proposed that the parser should be provided with a lexical database in order to make more effective the identification of collocations during the parsing process. We assessed this claim by using a corpus of 6’000 sentences retrieved from the British magazine The Economist Espresso. The corpus was parsed twice, first with the collocation detection component turned on and then with it turned off, and to make the comparison the Fips tagger was used. The results showed an improvement of the quality when the parser has access to collocation knowledge.

Measuring the impact of collocational knowledge on sentence parsing

Ejemplares similares