AMAD

„Archivum Medii Aevi Digitale - Interdisziplinäres Open-Access-Fachrepositorium und Wissenschaftsblog für Mittelalterforschung‟
 Zur Einreichung
AMAD BETA logo
Titel: Automated Creation of a Medieval Portuguese Partial Treebank
Mitwirkende: The Pennsylvania State University CiteSeerX Archives
Autor*in: Mário Amado Alves
Graça Vicente
Maria Francisca Xavier
J. Gabriel Lopes
Vitor Rocio
Beschreibung: The growing trend towards corpus-based linguistics has led researchers to manually annotate large quantities of text. The human effort involved in this task is often enormous, and requires highly specialised linguistically trained manpower. According to our point of view, another approach should be followed, using this highly trained manpower in other activities, more rewarding and creative, in a constructive dialogue among the various kinds of expertise needed for overcoming our ignorance about languages. As an experiment, we used tools and linguistic resources previously built for Contemporary Portuguese for partially automating the process of partial annotation of a Medieval Portuguese corpus. In this paper, we describe the tools used (POS tagger, lexical analyser and partial parser) and demonstrate that the similarities between a language at two different time periods is sufficient for bootstrapping and acquiring lexical knowledge from the partially parsed, automatically annotated corpus.
URI: https://www.amad.org/jspui/handle/123456789/81650
Quelle: http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.4.6984
http://treebank.linguist.jussieu.fr/./pdf/12.pdf
AMAD ID: 568186
Enthalten in den Sammlungen:BASE (Bielefeld Academic Search Engine)
General history of Europe


Dateien zu dieser Ressource:
Es gibt keine Dateien zu dieser Ressource.


Alle Ressourcen in diesem Repository sind urheberrechtlich geschützt, soweit nicht anderweitig angezeigt.