Loading…
Constructing a speech audio–video corpus by aligning long segments of speech and text
A new algorithm for aligning text with speech audio signals having lengths of up to several hours is proposed. The algorithm allows its quality to be effectively evaluated. The requirements on the acoustic model are not very demanding. The algorithm can be used to design an audio–video course for le...
Saved in:
Published in: | Moscow University computational mathematics and cybernetics 2017-04, Vol.41 (2), p.97-103 |
---|---|
Main Authors: | , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | A new algorithm for aligning text with speech audio signals having lengths of up to several hours is proposed. The algorithm allows its quality to be effectively evaluated. The requirements on the acoustic model are not very demanding. The algorithm can be used to design an audio–video course for learning the Russian language. |
---|---|
ISSN: | 0278-6419 1934-8428 |
DOI: | 10.3103/S0278641917020030 |