AMAD
„Archivum Medii Aevi Digitale - Interdisziplinäres Open-Access-Fachrepositorium und Wissenschaftsblog für Mittelalterforschung‟Zur Einreichung

Titel: | Automated Creation of a Medieval Portuguese Partial Treebank |
Mitwirkende: | The Pennsylvania State University CiteSeerX Archives |
Autor*in: | Mário Amado Alves Graça Vicente Maria Francisca Xavier J. Gabriel Lopes Vitor Rocio |
Beschreibung: | The growing trend towards corpus-based linguistics has led researchers to manually annotate large quantities of text. The human effort involved in this task is often enormous, and requires highly specialised linguistically trained manpower. According to our point of view, another approach should be followed, using this highly trained manpower in other activities, more rewarding and creative, in a constructive dialogue among the various kinds of expertise needed for overcoming our ignorance about languages. As an experiment, we used tools and linguistic resources previously built for Contemporary Portuguese for partially automating the process of partial annotation of a Medieval Portuguese corpus. In this paper, we describe the tools used (POS tagger, lexical analyser and partial parser) and demonstrate that the similarities between a language at two different time periods is sufficient for bootstrapping and acquiring lexical knowledge from the partially parsed, automatically annotated corpus. |
URI: | https://www.amad.org/jspui/handle/123456789/81650 |
Quelle: | http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.4.6984 http://treebank.linguist.jussieu.fr/./pdf/12.pdf |
AMAD ID: | 568186 |
Enthalten in den Sammlungen: | BASE (Bielefeld Academic Search Engine) General history of Europe |