AMAD

"Archivum Medii Aevi Digitale - Specialized open access repository for research in the middle ages"
 To submission
AMAD BETA logo
Title: Automated Creation of a Medieval Portuguese Partial Treebank
Contributor: The Pennsylvania State University CiteSeerX Archives
Author: Mário Amado Alves
Graça Vicente
Maria Francisca Xavier
J. Gabriel Lopes
Vitor Rocio
Description: The growing trend towards corpus-based linguistics has led researchers to manually annotate large quantities of text. The human effort involved in this task is often enormous, and requires highly specialised linguistically trained manpower. According to our point of view, another approach should be followed, using this highly trained manpower in other activities, more rewarding and creative, in a constructive dialogue among the various kinds of expertise needed for overcoming our ignorance about languages. As an experiment, we used tools and linguistic resources previously built for Contemporary Portuguese for partially automating the process of partial annotation of a Medieval Portuguese corpus. In this paper, we describe the tools used (POS tagger, lexical analyser and partial parser) and demonstrate that the similarities between a language at two different time periods is sufficient for bootstrapping and acquiring lexical knowledge from the partially parsed, automatically annotated corpus.
URI: https://www.amad.org/jspui/handle/123456789/81650
Other Identifier: http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.4.6984
http://treebank.linguist.jussieu.fr/./pdf/12.pdf
AMAD ID: 568186
Appears in Collections:BASE (Bielefeld Academic Search Engine)
General history of Europe


Files in This Item:
There are no files associated with this item.


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.