A classification approach for detecting cross-lingual biomedical term translations.
Saved in:
| Title: | A classification approach for detecting cross-lingual biomedical term translations. |
|---|---|
| Authors: | HAKAMI, H.1 hoda.h@tu.edu.sa, BOLLEGALA, D.2 danushka.bollegala@liverpool.ac.uk |
| Source: | Natural Language Engineering. Jan2017, Vol. 23 Issue 1, p31-51. 21p. |
| Subjects: | Medical language, Machine translating, Bilingualism, N-gram models (Computational linguistics), Accuracy of information |
| Abstract: | Finding translations for technical terms is an important problem in machine translation. In particular, in highly specialized domains such as biology or medicine, it is difficult to find bilingual experts to annotate sufficient cross-lingual texts in order to train machine translation systems. Moreover, new terms are constantly being generated in the biomedical community, which makes it difficult to keep the translation dictionaries up to date for all language pairs of interest. Given a biomedical term in one language (source language), we propose a method for detecting its translations in a different language (target language). Specifically, we train a binary classifier to determine whether two biomedical terms written in two languages are translations. Training such a classifier is often complicated due to the lack of common features between the source and target languages. We propose several feature space concatenation methods to successfully overcome this problem. Moreover, we study the effectiveness of contextual and character n-gram features for detecting term translations. Experiments conducted using a standard dataset for biomedical term translation show that the proposed method outperforms several competitive baseline methods in terms of mean average precision and top-k translation accuracy. [ABSTRACT FROM PUBLISHER] |
| Copyright of Natural Language Engineering is the property of Cambridge University Press and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.) | |
| Database: | Engineering Source |
Be the first to leave a comment!