Live subtitling with speech recognition causes and consequences of text reduction
AbstractSpeech technology has made it possible to use speech recognition for simultaneous subtitling of live television broadcasts via the technique of respeaking. Despite the considerable prior research into the quality of live subtitling using speech recognition, little research has focused on the quantitative aspects of subtitles. Although live subtitles are nearly always a reduced form of the spoken comments, the exact causes of text reduction are still largely unidentified. This study aims at a better understanding of the causes and consequences of text reduction in a live subtitling context. Three excerpts of an infotainment talk show were subtitled by twelve respeakers of the Flemish public television. They were instructed to do this in three different reduction conditions. Various subtitle features, such as reduction percentages and delay, as well as measures of the respeakers’ working memory were collected. Both a quantitative and qualitative analysis were carried out. In the quantitative analysis we opted for a multilevel analysis to take into account the hierarchical nature of the data. In the qualitative analysis, we discussed the effects of commonly used reduction strategies. The results show that reduction is not a random process. In contrast, it is largely determined by a number of external factors, viz. delay, amount of source text and the proportion of ‘full’ reductions. There is a large amount of evidence suggesting that respeakers prefer to omit certain comments rather than reducing them to a certain extent. It also appears that the decision to fully omit a comment seems not to be primarily based on the amount of input, while the decision to partially reduce is. Differences in the capacity of the working memory do not seem to affect text reduction as such. Finally, the qualitative analysis demonstrated that respeakers use a wide variety of strategies to reduce the spoken comments in order to limit the loss of information as much as possible.
Download InfoIf you experience problems downloading a file, check if you have the proper application to view it first. In case of further problems read the IDEAS help page. Note that these files are not on the IDEAS site. Please be patient as the files may be large.
Bibliographic InfoPaper provided by University of Antwerp, Faculty of Applied Economics in its series Working Papers with number 2010010.
Length: 36 pages
Date of creation: May 2010
Date of revision:
Contact details of provider:
Postal: Prinsstraat 13, B-2000 Antwerpen
Web page: https://www.uantwerp.be/en/faculties/applied-economic-sciences/
More information through EDIRC
Real-time subtitling; Live subtitling; Respeaking; Voice-writing; Speech recognition; Keystroke logging; Reduction;
This paper has been announced in the following NEP Reports:
- NEP-ALL-2010-05-22 (All new papers)
You can help add them by filling out this form.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: (Joeri Nys).
If references are entirely missing, you can add them using this form.