OLAC Language Resource Catalog

Navigation Aids

OLAC Language Resource Catalog
Search for language resources
 

Main Content

Principles and Practicalities of Corpus Design in Language Retrieval: Issues in the Digitization of the Beynon Corpus of Early Twentieth-Century Sm’algyax Materials
Title:
Principles and Practicalities of Corpus Design in Language Retrieval: Issues in the Digitization of the Beynon Corpus of Early Twentieth-Century Sm’algyax Materials
ID:
Stebbins, Tonya N. and Birgit Hellwig. 2010. Principles and Practicalities of Corpus Design in Language Retrieval: Issues in the Digitization of the Beynon Corpus of Early Twentieth-Century Sm’algyax Materials. Language Documentation & Conservation 4. 34-59.
1934-5275
Link to the object:
Online:
Yes
Archive:
Contributor:
Stebbins, Tonya N. (author)
Hellwig, Birgit (author)
Date:
2010
Publisher:
University of Hawai'i Press
Description:
This paper describes a pilot project to develop a machine-readable corpus of early twentieth-century Sm’algyax texts from a large collection of handwritten manuscripts collected by the Tsimshian ethnographer and chief William Beynon. The project seeks to ensure that the materials produced are maximally accessible to the Tsimshian community. It relates established principles for corpus design to practical issues in language retrieval, recognizing that the corpus will likely function as an intermediate stage between the original manuscripts and any language materials developed by the community. The paper is addressed primarily to linguists working on language retrieval projects but may also be of use to communities who are working with linguists, as it provides insight into the concerns and preoccupations that linguists bring to such tasks.
National Foreign Language Resource Center
Content language:
English
DCMI type:
Text
Other format:
26 pages
Other rights:
Creative Commons Attribution Non-Commercial No Derivatives License
Attribution Non-Commercial No Derivatives
by-nc-nd-nsa
Other subject:
corpus
Sm'algyax
William Beynon
Tsimshian
Other type:
Article
Complete OLAC record:
Link for this page:

Find Related Information:

Archive: Language Documentation and Conservation
Online: Yes
DCMI type: Text
Content language: English
Date: 2000 and later
Date: 2010 - 2019
Contributor: Hellwig, Birgit
Contributor: Stebbins, Tonya N.
Publisher: University of Hawai'i Press
Title: Principles and Practicalities of Corpus Design in Language Retrieval: Issues in the Digitization of the Beynon Corpus of Early Twentieth-Century Sm’algyax Materials
Other format: 26 pages
Other rights: Attribution Non-Commercial No Derivatives
Other rights: Creative Commons Attribution Non-Commercial No Derivatives License
Other rights: by-nc-nd-nsa
Other subject: Sm'algyax
Other subject: Tsimshian
Other subject: William Beynon
Other subject: corpus
Other type: Article