Audio samples for paper: Investigation of Effectively Synthesizing Code-switched Speech Using Highly Imbalanced Mix-lingual Data

Authors: Shaotong Guo, Longbiao Wang, Sheng Li, Ju Zhang, Cheng Gong, Yuguang Wang, Jianwu Dang, Kiyoshi Honda
Abstract: End-to-end text-to-speech (TTS) can synthesize monolingual speech with high naturalness and intelligibility. Recently, the end-to-end model has also been used in code-switching (CS) TTS and performs well on naturalness, intelligibility and speaker consistency. However, existing systems rely on skillful bilingual speakers to build a CS mix-lingual data set with a high Language-Mix-Ratio (LMR), while simply mixing monolingual data sets results in accent problems. To reduce the cost of recording and maintain the speaker consistency, in this paper, we investigate an effective method to use a low LMR imbalanced mix-lingual data set. Experiments show that it is possible to construct a CS TTS system with a low LMR imbalanced mix-lingual data set with diverse input text presentations, meanwhile produce acceptable synthetic CS speech with more than 4.0 Mean Opinion Score (MOS). We also find that the result will be improved if the mix-lingual data set is augmented with monolingual English data.

Text Represenations

PY-AP: Tonal pinyin for mandarin and alphabet for English.
PY-UP: Tonal pinyin for mandarin and uppercase for English..
PY-PY: Tonal pinyin for both Mandarin and English.
PY-PH: Tonal pinyin for mandarin and CMU-phonemes for English.
*_AUG: Data augmentation by pure English data.

Result

1. "黎贝卡office小程序。"

PY-AP PY-AP_AUG PY-UP PY-UP_AUG PY-PY PY-PY_AUG PY-PH PY-PH_AUG

2. "橄榄球靠往地上趴,叫Touch down 。"

PY-AP PY-AP_AUG PY-UP PY-UP_AUG PY-PY PY-PY_AUG PY-PH PY-PH_AUG

3. "我身边做 framework 的程序员。"

PY-AP PY-AP_AUG PY-UP PY-UP_AUG PY-PY PY-PY_AUG PY-PH PY-PH_AUG

4. "中国GDP增速在2019年上半年会遇到一定下行压力。"

PY-AP PY-AP_AUG PY-UP PY-UP_AUG PY-PY PY-PY_AUG PY-PH PY-PH_AUG

5. "明天我要take over所有的工作。"

PY-AP PY-AP_AUG PY-UP PY-UP_AUG PY-PY PY-PY_AUG PY-PH PY-PH_AUG

6. "You raise me up太好听了。"

PY-AP PY-AP_AUG PY-UP PY-UP_AUG PY-PY PY-PY_AUG PY-PH PY-PH_AUG