OLAC Language Resource Catalog

Navigation Aids

OLAC Language Resource Catalog
Search for language resources
 

Main Content

OGI Multilanguage Corpus
Title:
OGI Multilanguage Corpus
ID:
LDC94S17
https://catalog.ldc.upenn.edu/LDC94S17
ISBN: 1-58563-035-7
ISLRN: 650-021-622-719-8
Online:
Yes
Archive:
Date:
1994
Publisher:
Linguistic Data Consortium
https://www.ldc.upenn.edu
Description:
The corpus consists of responses to prompts spoken over commercial telephone lines by speakers of English, Farsi (Persian), French, German, Hindi, Japanese, Korean, Mandarin Chinese, Spanish, Tamil and Vietnamese. It contains a total of 1,927 calls, an average of 175 calls per language. Speech was collected using an automated system that answered the telephone, played digitized prompts in the appropriate language to request the speech samples and digitized the callers' responses for a designated period of time. Log files are included that provide a set of automatic measurements made on each utterance. In addition, some utterances were automatically segmented into broad phonetic catagories. The speech data are compressed, with NIST SPHERE headers.
Content language:
Vietnamese
Tamil
Korean
Japanese
Hindi
French
English
German
Spanish
Mandarin Chinese
Persian
Dari
Iranian Persian
Linguistic type:
Primary text
DCMI type:
Sound
Other format:
Sampling Rate: 8000
Sampling Format: 1-channel pcm compressed
Distribution: Web Download
Other language:
Vietnamese
Tamil
Korean
Japanese
Hindi
French
English
German
Spanish
Mandarin Chinese
Persian
Dari
Iranian Persian
Other rights:
Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining
LDC User Agreement for Non-Members: https://catalog.ldc.upenn.edu/license/ldc-non-members-agreement.pdf
Complete OLAC record:
Link for this page:

Find Related Information:

Archive: The LDC Corpus Catalog
Online: Yes
Linguistic type: Primary text
DCMI type: Sound
Content language: Dari
Content language: English
Content language: French
Content language: German
Content language: Hindi
Date: 1950 - 1999
Date: 1990 - 1999
Contributor: Cole, Ronald Allan
Contributor: Muthusamy, Yeshwant
Publisher: Linguistic Data Consortium
Publisher: https://www.ldc.upenn.edu
Title: OGI Multilanguage Corpus
Other format: Distribution: Web Download
Other format: Sampling Format: 1-channel pcm compressed
Other format: Sampling Rate: 8000
Other language: Dari
Other language: English
Other language: French
Other language: German
Other language: Hindi
Other rights: LDC User Agreement for Non-Members: https://catalog.ldc.upenn.edu/license/ldc-non-members-agreement.pdf
Other rights: Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining