OLAC Language Resource Catalog

Navigation Aids

OLAC Language Resource Catalog
Search for language resources
 

Main Content

ECI/MCI (European Corpus Initiative/Multilingual Corpus I)
Title:
ECI/MCI (European Corpus Initiative/Multilingual Corpus I)
ID:
ELRA-W0004
Link to the object:
Online:
Yes
Archive:
Date:
1996-09-01
Publisher:
ELRA (European Language Resources Association)
Description:
Written Corpora
The European Corpus Initiative (ECI) was founded to oversee the acquisition and preparation of a large multilingual corpus, and supports existing and projected national and international efforts to carefully design, collect and publish large-scale multilingual written and spoken corpora. ECI has produced the Multilingual Corpus I (ECI/MCI) of over 98 million words, covering most of the major European languages, as well as Turkish, Japanese, Russian, Chinese, Malay and more. The primary focus in this effort is on textual material of all kinds, including transcriptions of spoken material. Just a sampling of the contents of the CD-ROM: German newspaper texts from the Frankfurter Rundschau from July 1992 -March 1993. provided by Universit?t Gesamthochschule, Paderborn, Germany. Approximately 34 million words. French newspaper texts from Le Monde, consisting of material from September 1989, October 1989, and January 1990. Provided by LIMSI CNRS, France. Approximately 4.1 million words. Extracts from the Leiden Corpus of Dutch, consisting of newspapers, transcribed speech, etc. Provided by Institut voor Nederlandse Lexicologie, Leiden, Holland. Approximately 5.5 million words. International Labor Organisation (ILO) "Official Bulletin, B Series". Vols LXVII(1984) - LXXII(1989). Parallel texts in English, French and Spanish provided by the International Labor Organisation. Approximately 5 million words.
Over 98 million words, covering most of the major European languages, as well as Turkish, Japanese, Russian, Chinese, Malay and more.
Content language:
Turkish
Albanian
Bulgarian
Chinese
Czech
Dutch
English
Estonian
French
Scottish Gaelic
German
Modern Greek (1453-)
Italian
Japanese
Latin
Lithuanian
Malay (macrolanguage)
Spanish
Danish
Russian
Norwegian
Uzbek
Portuguese
Swedish
Linguistic type:
Primary text
DCMI type:
Text
Other format:
Downloadable
Other language:
Turkish
Albanian
Bulgarian
Chinese
Czech
Dutch, Flemish
English
Estonian
French
Gaelic, Scottish Gaelic
German
Greek, Modern (1453-)
Italian
Japanese
Latin
Lithuanian
Malay
Spanish, Castilian
Serbian
Danish
Russian
Norwegian
Uzbek
Portuguese
Swedish
Other rights:
Rights available for: Research Use
Complete OLAC record:
Link for this page:

Find Related Information:

Archive: ELRA Catalogue of Language Resources
Online: Yes
Linguistic type: Primary text
DCMI type: Text
Content language: Albanian
Content language: Bulgarian
Content language: Chinese
Content language: Czech
Content language: Danish
Date: 1950 - 1999
Date: 1990 - 1999
Date: 2000 - 2009
Date: 2000 and later
Date: 2010 - 2019
Publisher: ELRA (European Language Resources Association)
Title: ECI/MCI (European Corpus Initiative/Multilingual Corpus I)
Other format: Downloadable
Other language: Albanian
Other language: Bulgarian
Other language: Chinese
Other language: Czech
Other language: Danish
Other rights: Rights available for: Research Use