OLAC Language Resource Catalog

Navigation Aids

OLAC Language Resource Catalog
Search for language resources
 

Main Content

Collins Multilingual database (MLD) ? WordBank with audio files
Title:
Collins Multilingual database (MLD) ? WordBank with audio files
ID:
ELRA-S0382
Link to the object:
Online:
Yes
Archive:
Date:
2016-07-13
Publisher:
ELRA (European Language Resources Association)
Description:
Desktop/Microphone
The Collins Multilingual database covers Real Life Daily vocabulary. It is composed of a multilingual lexicon in 32 languages (the WordBank, see ELRA-T0376) and a multilingual set of sentences in 28 languages (the PhraseBank, see ELRA-T0377). This version includes the corresponding audio files covering 26 languages of the 32 languages available in the Collins MLD Wordbank: Arabic, Chinese, Croatian, Czech, Danish, Dutch, American English, British English, Finnish, French, German, Greek, Italian, Japanese, Korean, Norwegian, Polish, Portuguese (Iberian), Portuguese (Brazilian), Russian, Spanish (Iberian), Spanish (Latin American), Swedish, Thai, Turkish, Vietnamese. The WordBank contains 10,000 words for each language, XML-annotated for part-of-speech, gender, irregular forms and disambiguating information for homographs. An additional dataset of 10,000 headwords is included for 12 languages (Chinese, American and British English, French, German, Italian, Japanese, Korean, Iberian and Brazilian Portuguese, Iberian and Latin American Spanish). The full database contains 10,000 audio files for each language (26 languages), and 10,000 additional audio files corresponding to the 10,000 additional headwords in 12 languages. Audio was recorded by native speakers.
This multilingual lexicon covers Real Life Daily vocabulary in 26 languages. It contains 10,000 words for each language, XML-annotated for part-of-speech, gender, irregular forms and disambiguating information for homographs, with the corresponding audio files recorded by a native speaker and 10,000 additional headwords with audio for 12 languages.
Content language:
Arabic
Chinese
Czech
Danish
Dutch
English
Finnish
French
German
Modern Greek (1453-)
Italian
Japanese
Korean
Norwegian
Polish
Portuguese
Russian
Spanish
Swedish
Thai
Turkish
Vietnamese
Linguistic type:
Primary text
DCMI type:
Sound
Other language:
Arabic
Chinese
Croatian
Czech
Danish
Dutch, Flemish
English
Finnish
French
German
Greek, Modern (1453-)
Italian
Japanese
Korean
Norwegian
Polish
Portuguese
Russian
Spanish, Castilian
Swedish
Thai
Turkish
Vietnamese
Complete OLAC record:
Link for this page:

Find Related Information:

Archive: ELRA Catalogue of Language Resources
Online: Yes
Linguistic type: Primary text
DCMI type: Sound
Content language: Arabic
Content language: Chinese
Content language: Czech
Content language: Danish
Content language: Dutch
Date: 2000 and later
Date: 2010 - 2019
Publisher: ELRA (European Language Resources Association)
Title: Collins Multilingual database (MLD) ? WordBank with audio files
Other language: Arabic
Other language: Chinese
Other language: Croatian
Other language: Czech
Other language: Danish