Loading…

Constructing a speech audio–video corpus by aligning long segments of speech and text

A new algorithm for aligning text with speech audio signals having lengths of up to several hours is proposed. The algorithm allows its quality to be effectively evaluated. The requirements on the acoustic model are not very demanding. The algorithm can be used to design an audio–video course for le...

Full description

Saved in:
Bibliographic Details
Published in:Moscow University computational mathematics and cybernetics 2017-04, Vol.41 (2), p.97-103
Main Authors: Karpukhin, I. A., Konushin, A. S.
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:A new algorithm for aligning text with speech audio signals having lengths of up to several hours is proposed. The algorithm allows its quality to be effectively evaluated. The requirements on the acoustic model are not very demanding. The algorithm can be used to design an audio–video course for learning the Russian language.
ISSN:0278-6419
1934-8428
DOI:10.3103/S0278641917020030