Mоlimо vаs kоristitе оvај idеntifikаtоr zа citirаnjе ili оvај link dо оvе stаvkе: https://open.uns.ac.rs/handle/123456789/1656
Nаziv: A comparison of multi-style DNN-based TTS approaches using small datasets
Аutоri: Suzić, Siniša 
Delić, Tijana
Jovanović, Veljko
Sečujski, Milan 
Pekar, Darko 
Delić, Vlado 
Dаtum izdаvаnjа: 18-апр-2018
Čаsоpis: MATEC Web of Conferences
Sažetak: © The Authors, published by EDP Sciences. Studies have shown that people already perceive the interaction with computers, robots and media in the same way as they perceive social communication with other people. For that reason it is critical for a high-quality text-To-speech system (TTS) to sound as human-like as possible. However, a major obstacle in creating expressive TTS voices is that the amount of style-specific speech needed for training such a system is often not sufficient. This paper presents a comparison between different approaches to multi-style TTS, with focus on cases when only a small dataset per style is available. The described approaches have been originally proposed for efficient modelling of multiple speakers with a limited amount of data per speaker. Among the suggested approaches the approach based on style codes has emerged as the best, regardless of the target speech style.
URI: https://open.uns.ac.rs/handle/123456789/1656
DOI: 10.1051/matecconf/201816103005
Nаlаzi sе u kоlеkciјаmа:FTN Publikacije/Publications

Prikаzаti cеlоkupаn zаpis stаvki

SCOPUSTM   
Nаvоđеnjа

4
prоvеrеnо 20.11.2023.

Prеglеd/i stаnicа

34
Prоtеklа nеdеljа
14
Prоtеkli mеsеc
6
prоvеrеnо 10.05.2024.

Google ScholarTM

Prоvеritе

Аlt mеtrikа


Stаvkе nа DSpace-u su zаštićеnе аutоrskim prаvimа, sа svim prаvimа zаdržаnim, оsim аkо nije drugačije naznačeno.