next up previous
Next: 2 Why this text? Up: Creating a Parallel Corpus Previous: Creating a Parallel Corpus

1 Abstract

This paper reports on a project to annotate biblical texts in order to create an aligned multilingual Bible corpus for linguistic research, particularly computational linguistics, including automatically creating and evaluating translation lexicons and semantically tagged texts. The output of this project will enable researchers to take advantage of parallel translations across a wider number of languages than previously available, providing, with relatively little effort, a corpus that contains careful translations and reliable alignment at the near-sentence level. We discuss the nature of the text, our annotation process, and intended uses for the corpus, and we point out relevant aspects and potential limitations of the current draft of the Corpus Encoding Standard with respect to this corpus.


next up previous
Next: 2 Why this text? Up: Creating a Parallel Corpus Previous: Creating a Parallel Corpus

Philip Resnik
Tue Oct 21 19:23:13 EDT 1997