Close this search box.

Miranda J., Neto J.P., Black A.W.

13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012

pp 1026



In a growing number of applications, such as simultaneous interpretation, audio or text may be available conveying the same information in different languages. Thesedifferent views contain redundant information that can be explored to enhance the performance of speech and language processing applications. We propose a method that directly integrates ASR word graphs or lattices and phrase tables from an SMT system to combine such parallel speech data and improve ASR performance. We apply this technique to speeches from four European Parliament committees and obtain a 16.6% relative improvement (20.8% after a second iteration) in WER, when Portuguese and Spanish interpreted versions are combined with the original English speeches. Our results indicate that further improvements may be possible by including additional languages. Index Terms: multistream combination, speech recognition, machine translation