The Centre for Speech Technology Research, The university of Edinburgh

iFlytek Co., Ltd. release of audio recordings for Blizzard 2020

These data are only available to registered participants in the Blizzard Challenge 2020.

You should only request these data after your registration for the challenge has been accepted. Other requests will be ignored.

Speech data

This data is released under a license for non-commercial use only.

Information about the transcriptions

The Mandarin data for the hub task 2020-MH1 is provided with text transcriptions only.

The Shanghainese data for the spoke task 2020-SS1 is provided with both text and phonemic transcriptions. Neither are time-aligned.

Downloads

The following files are available to download after completing the corresponding license. Once we have received your Blizzard Challenge registration and your license, we will email you a password. All requests are manually checked. md5 checksum for the files are as follows:
8d40ac468b22e8abbd8f45a78f585770 mandarin_blizzard_release_2020_v1.zip
71caf50b612042a833650bb657c1f08e shanghainese_blizzard_release_2020_v1.zip
and the sizes of the files are:
mandarin_blizzard_release_2020_v1.zip 2.4G
shanghainese_blizzard_release_2020_v1.zip 289M


Contact Simon King for more details.