Bibliographic Details
| Title: |
Exploiting the LDC Chinese-English Bilingual Wordlist for Cross Language Information Retrieval. |
| Authors: |
Kwok, K. L. |
| Source: |
International Journal of Computer Processing of Oriental Languages. Jun2001, Vol. 14 Issue 2, p173. 19p. |
| Subjects: |
Information retrieval, Polyglot dictionaries, Chinese language, English language |
| Abstract: |
We investigated using the LDC English/Chinese bilingual wordlists for EnglishChinese cross language retrieval. It is shown that the Chinese-to-English wordlist can be considered as both a phrase and word dictionary, and is preferable to the English-to-Chinese version in terms of phrase translation and word translation selection. Additional techniques such as frequency-based term selection, translation set weighting and term co-occurrence data were employed. Experiments show that within the TREC 5&6 Chinese corpus and retrieval environment, 74% of monolingual effectiveness is achievable for short queries of a few English words, and 85% for long queries of paragraph sizes. [ABSTRACT FROM AUTHOR] |
|
Copyright of International Journal of Computer Processing of Oriental Languages is the property of World Scientific Publishing Company and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.) |
| Database: |
Engineering Source |