Measuring the impact of collocational knowledge on sentence parsing

In this paper we focus on collocations, which have been studied in computational linguistics since they constitute a key factor when processing natural languages. For instance, they usually represent a challenge in automatic translation because the association of two terms is not easily computed. We...

Full description

Bibliographic Details
Main Author: Wehrli, Eric
Format: Online
Language:spa
Published: Universidad de Costa Rica. Campus Rodrigo Facio. Sitio web: https://www.ucr.ac.cr/ Teléfono: (506) 2511-4000. Correo de soporte: revistas@ucr.ac.cr 2017
Online Access:https://revistas.ucr.ac.cr/index.php/kanina/article/view/30225
id KANINA30225
record_format ojs
spelling KANINA302252022-05-31T02:51:58Z Measuring the impact of collocational knowledge on sentence parsing Wehrli, Eric collocations multiword expressions sentence parsing computational linguistics natural language processing In this paper we focus on collocations, which have been studied in computational linguistics since they constitute a key factor when processing natural languages. For instance, they usually represent a challenge in automatic translation because the association of two terms is not easily computed. We proposed that the parser should be provided with a lexical database in order to make more effective the identification of collocations during the parsing process. We assessed this claim by using a corpus of 6’000 sentences retrieved from the British magazine The Economist Espresso. The corpus was parsed twice, first with the collocation detection component turned on and then with it turned off, and to make the comparison the Fips tagger was used. The results showed an improvement of the quality when the parser has access to collocation knowledge.  Universidad de Costa Rica. Campus Rodrigo Facio. Sitio web: https://www.ucr.ac.cr/ Teléfono: (506) 2511-4000. Correo de soporte: revistas@ucr.ac.cr 2017-08-16 info:eu-repo/semantics/article info:eu-repo/semantics/publishedVersion Article Article application/pdf https://revistas.ucr.ac.cr/index.php/kanina/article/view/30225 10.15517/rk.v40i4.30225 Káñina; Vol. 40 No. 4 (2016): Káñina número extraordinario; 49-58 Káñina; Vol. 40 Núm. 4 (2016): Káñina número extraordinario; 49-58 Káñina; Vol. 40 N.º 4 (2016): Káñina número extraordinario; 49-58 2215-2636 0378-0473 spa https://revistas.ucr.ac.cr/index.php/kanina/article/view/30225/30205 Derechos de autor 2017 Káñina
institution Universidad de Costa Rica
collection Káñina
language spa
format Online
author Wehrli, Eric
spellingShingle Wehrli, Eric
Measuring the impact of collocational knowledge on sentence parsing
author_facet Wehrli, Eric
author_sort Wehrli, Eric
description In this paper we focus on collocations, which have been studied in computational linguistics since they constitute a key factor when processing natural languages. For instance, they usually represent a challenge in automatic translation because the association of two terms is not easily computed. We proposed that the parser should be provided with a lexical database in order to make more effective the identification of collocations during the parsing process. We assessed this claim by using a corpus of 6’000 sentences retrieved from the British magazine The Economist Espresso. The corpus was parsed twice, first with the collocation detection component turned on and then with it turned off, and to make the comparison the Fips tagger was used. The results showed an improvement of the quality when the parser has access to collocation knowledge. 
title Measuring the impact of collocational knowledge on sentence parsing
title_short Measuring the impact of collocational knowledge on sentence parsing
title_full Measuring the impact of collocational knowledge on sentence parsing
title_fullStr Measuring the impact of collocational knowledge on sentence parsing
title_full_unstemmed Measuring the impact of collocational knowledge on sentence parsing
title_sort measuring the impact of collocational knowledge on sentence parsing
publisher Universidad de Costa Rica. Campus Rodrigo Facio. Sitio web: https://www.ucr.ac.cr/ Teléfono: (506) 2511-4000. Correo de soporte: revistas@ucr.ac.cr
publishDate 2017
url https://revistas.ucr.ac.cr/index.php/kanina/article/view/30225
work_keys_str_mv AT wehrlieric measuringtheimpactofcollocationalknowledgeonsentenceparsing
_version_ 1810112809223061504