OLAC Language Resource Catalog

Navigation Aids

OLAC Language Resource Catalog
Search for language resources
 

Main Content

CALLFRIEND Mandarin Chinese-Mainland Dialect Second Edition
Title:
CALLFRIEND Mandarin Chinese-Mainland Dialect Second Edition
ID:
LDC2018S09
https://catalog.ldc.upenn.edu/LDC2018S09
ISBN: 1-58563-851-X
ISLRN: 466-791-939-707-1
Online:
Yes
Archive:
Date:
2018
Publisher:
Linguistic Data Consortium
https://www.ldc.upenn.edu
Description:
*Introduction* CALLFRIEND Mandarin Chinese-Mainland Dialect Second Edition was developed by the Linguistic Data Consortium (LDC) and consists of approximately 24 hours of unscripted telephone conversations between native speakers of the Mandarin Chinese dialect spoken in mainland China. This second edition updates the audio files to wav format, simplifies the directory structure and adds documentation and metadata. The first edition is available as CALLFRIEND Mandarin Chinese-Mainland Dialect (LDC96S55). The CALLFRIEND series is a collection of telephone conversations in several languages conducted by LDC in support of language identification technology development. Languages covered in the collection include American English, Canadian French, Egyptian Arabic, Farsi, German, Hindi, Japanese, Korean, Mandarin Chinese, Spanish, Tamil and Vietnamese. *Data* All data was collected before July 1997. Participants could speak with a person of their choice on any topic; most called family members and friends. All calls originated in North America. The recorded conversations last up to 30 minutes. The data was recorded as 8kHz u-law SPH encoded stereo files, with one end of the phone call on each channel. In this release, files were converted to WAV format, and information from the original SPH headers is described in the documentation. SPH files are not included in this second edition. The audio files were originally split into train, dev and test folders of 20 recordings each, but they are combined in this release. Completed calls passed through two human audits. The first audit was conducted to verify that the target language was spoken by the participants and to check the quality of the recordings. The second audit was conducted by a native speaker familiar with Mainland and Taiwanese Mandarin dialects to classify the conversations under one of the two categories. *Samples* Please listen to this sample. *Updates* None at this time.
Content language:
Mandarin Chinese
Linguistic type:
Primary text
DCMI type:
Sound
Other format:
Sampling Rate: 8000
Sampling Format: ulaw
Distribution: Web Download
Other language:
Mandarin Chinese
Other rights:
Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining
LDC User Agreement for Non-Members: https://catalog.ldc.upenn.edu/license/ldc-non-members-agreement.pdf
Rights holder: Portions © 1996, 1997, 2018 Trustees of the University of Pennsylvania
Complete OLAC record:
Link for this page:

Find Related Information:

Archive: The LDC Corpus Catalog
Online: Yes
Linguistic type: Primary text
DCMI type: Sound
Content language: Mandarin Chinese
Date: 2000 and later
Date: 2010 - 2019
Contributor: Bartlett, John
Contributor: Canavan, Alexandra
Contributor: Zipperlen, George
Publisher: Linguistic Data Consortium
Publisher: https://www.ldc.upenn.edu
Title: CALLFRIEND Mandarin Chinese-Mainland Dialect Second Edition
Other format: Distribution: Web Download
Other format: Sampling Format: ulaw
Other format: Sampling Rate: 8000
Other language: Mandarin Chinese
Other rights: LDC User Agreement for Non-Members: https://catalog.ldc.upenn.edu/license/ldc-non-members-agreement.pdf
Other rights: Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining
Other rights: Rights holder: Portions © 1996, 1997, 2018 Trustees of the University of Pennsylvania