Video translation: weaving synthetic voices into the multilingual production workflow
-
PDF
568.505859375Kb
- Author(s): S.A.K. Weber and X. Bai Source: IBC 2016 Conference, 2016 page ()
- Conference: IBC 2016 Conference
- DOI: 10.1049/ibc.2016.0047
- ISBN: 978-1-78561-343-2
- Location: Amsterdam, Netherlands
- Conference date: 8-12 Sept. 2016
- Format: PDF
The production of media content across several languages and platforms is both time consuming and complex. Microphones, sound booths and arrays of editing software are typically required to generate translated audio tracks. This paper presents a one-stop solution to simplifying this workflow. With a particular focus on the translation of audio tracks contained in video files, this paper describes an innovative workflow that leverages commercialised Text-To-Speech voice synthesis and a prototypical system running in production. This workflow bypasses the need for microphones, video or audio editing software and allows a single editor to generate multiple mixed-gender voice-overs. A lightweight markup language is presented which helps editors to fine-tune synthetic voices. The balance between automation and editorial and linguistic quality will be also examined. The majority positive feedback received from journalists and audiences indicates that the prototype and its underlying language technology have the potential to become part of the multilingual video production process.
Inspec keywords: speech synthesis; video signal processing; television production; workflow management software; microphones; audio signal processing
Subjects: Optical, image and video signal processing; Speech processing techniques; Video signal processing; Television and video equipment, systems and applications; Speech recognition and synthesis
Related content
content/conferences/10.1049/ibc.2016.0047
pub_keyword,iet_inspecKeyword,pub_concept
6
6