BRUSSELS, Jan. 18 (Xinhua) -- The European Commission
has made available for public use of a translation memory containing one million
sentences and their high quality translations in 22 languages.
The sentences or sentence fragments are from European
Union (EU) legal texts. It is possible to find their equivalent sentences in all
other official languages, with Irish as an exception.
The data are useful for the development of machine
translation systems and other linguistic software tools such as grammar and
spelling checkers, online dictionaries and multilingual text classification
systems.
"By this initiative the European Commission intends
to boost human language technologies, support multilingualism and make
computer-assisted translation easier, cheaper and more accessible," said Leonard
Orban, EU Commissioner for Multilingualism, Friday.
"This unique collection of language data contributes
to the creation of a new generation of software tools for human language
processing and helps foster the competitiveness of the language industry," said
Janez Potocnik, the European Research Commissioner.
The EU institutions have more multilingual texts than
any other organizations because of the requirements that EU law exist in each of
its 23 official languages.