Dynamic language modeling for European Portuguese

Saved in:
Bibliographic Details
Title: Dynamic language modeling for European Portuguese
Authors: Martins, Ciro1,2, Teixeira, António1 ajst@ua.pt, Neto, João2
Source: Computer Speech & Language. Oct2010, Vol. 24 Issue 4, p750-773. 24p.
Subjects: Portuguese language, Dylan (Computer program language), Vocabulary, Speech perception, Language & languages, Syntax (Grammar), Broadcast journalism, Information storage & retrieval systems
Abstract: Abstract: This paper reports on the work done on vocabulary and language model daily adaptation for a European Portuguese broadcast news transcription system. The proposed adaptation framework takes into consideration European Portuguese language characteristics, such as its high level of inflection and complex verbal system. A multi-pass speech recognition framework using contemporary written texts available daily on the Web is proposed. It uses morpho-syntactic knowledge (part-of-speech information) about an in-domain training corpus for daily selection of an optimal vocabulary. Using an information retrieval engine and the ASR hypotheses as query material, relevant documents are extracted from a dynamic and large-size dataset to generate a story-based language model. When applied to a daily and live closed-captioning system of live TV broadcasts, it was shown to be effective, with a relative reduction of out-of-vocabulary word rate (69%) and WER (12.0%) when compared to the results obtained by the baseline system with the same vocabulary size. [Copyright &y& Elsevier]
Copyright of Computer Speech & Language is the property of Academic Press Inc. and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
Database: Engineering Source
Be the first to leave a comment!
You must be logged in first