[1]
|
Srikanth Ronanki, Oliver Watts, and Simon King.
A Hierarchical Encoder-Decoder Model for Statistical Parametric
Speech Synthesis.
In Proc. Interspeech 2017, August 2017.
[ bib |
.pdf |
Abstract ]
|
[2]
|
Felipe Espic, Cassia Valentini-Botinhao, and Simon King.
Direct modelling of magnitude and phase spectra for statistical
parametric speech synthesis.
In Proc. Interspeech, Stochohlm, Sweden, August 2017.
[ bib |
.PDF |
Abstract ]
|
[3]
|
Joseph Mendelson, Pilar Oplustil, Oliver Watts, and Simon King.
Nativization of foreign names in tts for automatic reading of world
news in swahili.
In Interspeech 2017, May 2017.
[ bib |
.pdf |
Abstract ]
|
[4]
|
Srikanth Ronanki, Oliver Watts, Simon King, and Gustav Eje Henter.
Median-Based Generation of Synthetic Speech Durations using a
Non-Parametric Approach.
In Proc. IEEE Workshop on Spoken Language Technology (SLT),
December 2016.
[ bib |
.pdf |
Abstract ]
|
[5]
|
Srikanth Ronanki, Siva Reddy, Bajibabu Bollepalli, and Simon King.
DNN-based Speech Synthesis for Indian Languages from ASCII text.
In Proc. 9th ISCA Speech Synthesis Workshop (SSW9), Sunnyvale,
CA, USA, September 2016.
[ bib |
.pdf |
Abstract ]
|
[6]
|
Srikanth Ronanki, Gustav Eje Henter, Zhizheng Wu, and Simon King.
A template-based approach for speech synthesis intonation generation
using LSTMs.
In Proc. Interspeech, San Francisco, USA, September 2016.
[ bib |
.pdf |
Abstract ]
|
[7]
|
Srikanth Ronanki, Zhizheng Wu, Oliver Watts, and Simon King.
A Demonstration of the Merlin Open Source Neural Network Speech
Synthesis System.
In Proc. Speech Synthesis Workshop (SSW9), September 2016.
[ bib |
.pdf |
Abstract ]
|
[8]
|
Felipe Espic, Cassia Valentini-Botinhao, Zhizheng Wu, and Simon King.
Waveform generation based on signal reshaping for statistical
parametric speech synthesis.
In Proc. Interspeech, pages 2263-2267, San Francisco, CA, USA,
September 2016.
[ bib |
.PDF |
Abstract ]
|
[9]
|
Zhizheng Wu, Oliver Watts, and Simon King.
Merlin: An open source neural network speech synthesis system.
In 9th ISCA Speech Synthesis Workshop (2016), pages 218-223,
September 2016.
[ bib |
.pdf |
Abstract ]
|
[10]
|
Korin Richmond and Simon King.
Smooth talking: Articulatory join costs for unit selection.
In Proc. IEEE International Conference on Acoustics, Speech, and
Signal Processing (ICASSP), pages 5150-5154, March 2016.
[ bib |
.pdf |
Abstract ]
|
[11]
|
Gustav Eje Henter, Srikanth Ronanki, Oliver Watts, Mirjam Wester, Zhizheng Wu,
and Simon King.
Robust TTS duration modelling using DNNs.
In Proc. ICASSP, volume 41, pages 5130-5134, Shanghai, China,
March 2016.
[ bib |
http |
.pdf |
Abstract ]
|
[12]
|
Oliver Watts, Gustav Eje Henter, Thomas Merritt, Zhizheng Wu, and Simon King.
From HMMs to DNNs: where do the improvements come from?
In Proc. ICASSP, volume 41, pages 5505-5509, Shanghai, China,
March 2016.
[ bib |
http |
.pdf |
Abstract ]
|
[13]
|
Adriana Stan, Yoshitaka Mamiya, Junichi Yamagishi, Peter Bell, Oliver Watts,
Rob Clark, and Simon King.
ALISA: An automatic lightly supervised speech segmentation and
alignment tool.
Computer Speech and Language, 35:116-133, 2016.
[ bib |
DOI |
http |
.pdf |
Abstract ]
|
[14]
|
Thomas Merritt, Robert A J Clark, Zhizheng Wu, Junichi Yamagishi, and Simon
King.
Deep neural network-guided unit selection synthesis.
In Proc. ICASSP, 2016.
[ bib |
.pdf |
Abstract ]
|
[15]
|
Lau Chee Yong, Oliver Watts, and Simon King.
Combining lightly-supervised learning and user feedback to construct
and improve a statistical parametric speech synthesizer for malay.
Research Journal of Applied Sciences, Engineering and
Technology, 11(11):1227-1232, December 2015.
[ bib |
.pdf |
Abstract ]
|
[16]
|
C. Valentini-Botinhao, Z. Wu, and S. King.
Towards minimum perceptual error training for DNN-based speech
synthesis.
In Proc. Interspeech, Dresden, Germany, September 2015.
[ bib |
.pdf |
Abstract ]
|
[17]
|
Thomas Merritt, Junichi Yamagishi, Zhizheng Wu, Oliver Watts, and Simon King.
Deep neural network context embeddings for model selection in
rich-context HMM synthesis.
In Proc. Interspeech, Dresden, September 2015.
[ bib |
.pdf |
Abstract ]
|
[18]
|
Oliver Watts, Zhizheng Wu, and Simon King.
Sentence-level control vectors for deep neural network speech
synthesis.
In INTERSPEECH 2015 16th Annual Conference of the International
Speech Communication Association, pages 2217-2221. International Speech
Communication Association, September 2015.
[ bib |
.pdf |
Abstract ]
|
[19]
|
Marcus Tomalin, Mirjam Wester, Rasmus Dall, Bill Byrne, and Simon King.
A lattice-based approach to automatic filled pause insertion.
In Proc. DiSS 2015, Edinburgh, August 2015.
[ bib |
.pdf |
Abstract ]
|
[20]
|
Z. Wu, C. Valentini-Botinhao, O. Watts, and S. King.
Deep neural networks employing multi-task learning and stacked
bottleneck features for speech synthesis.
In Proc. ICASSP, pages 4460-4464, Brisbane, Australia, April
2015.
[ bib |
.pdf |
Abstract ]
|
[21]
|
Thomas Merritt, Javier Latorre, and Simon King.
Attributing modelling errors in HMM synthesis by stepping gradually
from natural to modelled speech.
In Proceedings of the IEEE International Conference on
Acoustics, Speech, and Signal Processing (ICASSP), pages 4220-4224,
Brisbane, April 2015.
[ bib |
.pdf |
Abstract ]
|
[22]
|
Zhizheng Wu and Simon King.
Minimum trajectory error training for deep neural networks, combined
with stacked bottleneck features.
In Interspeech, 2015.
[ bib |
.pdf ]
|
[23]
|
Zhizheng Wu, Pawel Swietojanski, Christophe Veaux, Steve Renals, and Simon
King.
A study of speaker adaptation for DNN-based speech synthesis.
In Interspeech, 2015.
[ bib |
.pdf ]
|
[24]
|
Zhizheng Wu, Cassia Valentini-Botinhao, Oliver Watts, and Simon King.
Deep neural network employing multi-task learning and stacked
bottleneck features for speech synthesis.
In Proceedings of the IEEE International Conference on
Acoustics, Speech, and Signal Processing (ICASSP), 2015.
[ bib |
.pdf ]
|
[25]
|
Zhizheng Wu, Ali Khodabakhsh, Cenk Demiroglu, Junichi Yamagishi, Daisuke Saito,
Tomoki Toda, and Simon King.
SAS: A speaker verification spoofing database containing diverse
attacks.
In Proceedings of the IEEE International Conference on
Acoustics, Speech, and Signal Processing (ICASSP), 2015.
[ bib |
.pdf ]
|
[26]
|
Victor Poblete, Felipe Espic, Simon King, Richard M. Stern, Fernando Huenupan,
Josue Fredes, and Nestor Becerra Yoma.
A perceptually-motivated low-complexity instantaneous linear channel
normalization technique applied to speaker verification.
Computer Speech & Language, 31(1):1 - 27, 2015.
[ bib |
DOI |
http |
.pdf |
Abstract ]
|
[27]
|
Cassia Valentini-Botinhao, Junichi Yamagishi, and Simon King.
Intelligibility enhancement of speech in noise.
In Proceedings of the Institute of Acoustics, volume 36 Pt. 2,
pages 96-103, Birmingham, UK, October 2014.
[ bib |
.pdf |
Abstract ]
|
[28]
|
Thomas Merritt, Tuomo Raitio, and Simon King.
Investigating source and filter contributions, and their interaction,
to statistical parametric speech synthesis.
In Proc. Interspeech, pages 1509-1513, Singapore, September
2014.
[ bib |
.pdf |
Abstract ]
|
[29]
|
Gustav Eje Henter, Thomas Merritt, Matt Shannon, Catherine Mayo, and Simon
King.
Measuring the perceptual effects of modelling assumptions in speech
synthesis using stimuli constructed from repeated natural speech.
In Proc. Interspeech, volume 15, pages 1504-1508, September
2014.
[ bib |
.pdf |
Abstract ]
|
[30]
|
Oliver Watts, Siva Gangireddy, Junichi Yamagishi, Simon King, Steve Renals,
Adriana Stan, and Mircea Giurgiu.
Neural net word representations for phrase-break prediction without a
part of speech tagger.
In Proc. ICASSP, pages 2618-2622, Florence, Italy, May 2014.
[ bib |
.pdf |
Abstract ]
|
[31]
|
Rasmus Dall, Junichi Yamagishi, and Simon King.
Rating naturalness in speech synthesis: The effect of style and
expectation.
In Proc. Speech Prosody, May 2014.
[ bib |
.pdf |
Abstract ]
|
[32]
|
C. Valentini-Botinhao, J. Yamagishi, S. King, and R. Maia.
Intelligibility enhancement of HMM-generated speech in additive
noise by modifying mel cepstral coefficients to increase the glimpse
proportion.
Computer Speech and Language, 28(2):665-686, 2014.
[ bib |
DOI |
.pdf |
Abstract ]
|
[33]
|
Moses Ekpenyong, Eno-Abasi Urua, Oliver Watts, Simon King, and Junichi
Yamagishi.
Statistical parametric speech synthesis for Ibibio.
Speech Communication, 56:243-251, January 2014.
[ bib |
DOI |
http |
.pdf |
Abstract ]
|
[34]
|
P. Lanchantin, M. J. F. Gales, S. King, and J. Yamagishi.
Multiple-average-voice-based speech synthesis.
In Proc. ICASSP, 2014.
[ bib |
Abstract ]
|
[35]
|
Rasmus Dall, Marcus Tomalin, Mirjam Wester, William Byrne, and Simon King.
Investigating automatic & human filled pause insertion for speech
synthesis.
In Proc. Interspeech, 2014.
[ bib |
.pdf |
Abstract ]
|
[36]
|
Herman Kamper, Aren Jansen, Simon King, and S. J. Goldwater.
Unsupervised lexical clustering of speech segments using
fixed-dimensional acoustic embeddings.
In Proc. SLT, 2014.
[ bib |
.pdf |
Abstract ]
|
[37]
|
C. Valentini-Botinhao, J. Yamagishi, S. King, and Y. Stylianou.
Combining perceptually-motivated spectral shaping with loudness and
duration modification for intelligibility enhancement of HMM-based synthetic
speech in noise.
In Proc. Interspeech, Lyon, France, August 2013.
[ bib |
.pdf ]
|
[38]
|
Cassia Valentini-Botinhao, Mirjam Wester, Junichi Yamagishi, and Simon King.
Using neighbourhood density and selective SNR boosting to increase
the intelligibility of synthetic speech in noise.
In 8th ISCA Workshop on Speech Synthesis, pages 133-138,
Barcelona, Spain, August 2013.
[ bib |
.pdf |
Abstract ]
|
[39]
|
Thomas Merritt and Simon King.
Investigating the shortcomings of HMM synthesis.
In 8th ISCA Workshop on Speech Synthesis, pages 185-190,
Barcelona, Spain, August 2013.
[ bib |
.pdf |
Abstract ]
|
[40]
|
Maria Astrinaki, Alexis Moinet, Junichi Yamagishi, Korin Richmond, Zhen-Hua
Ling, Simon King, and Thierry Dutoit.
Mage - reactive articulatory feature control of HMM-based
parametric speech synthesis.
In 8th ISCA Workshop on Speech Synthesis, pages 227-231,
Barcelona, Spain, August 2013.
[ bib |
.pdf ]
|
[41]
|
Adriana Stan, Peter Bell, Junichi Yamagishi, and Simon King.
Lightly supervised discriminative training of grapheme models for
improved sentence-level alignment of speech and text data.
In Proc. Interspeech, Lyon, France, August 2013.
[ bib |
.pdf |
Abstract ]
|
[42]
|
H. Christensen, M. Aniol, P. Bell, P. Green, T. Hain, S. King, and
P. Swietojanski.
Combining in-domain and out-of-domain speech data for automatic
recognition of disordered speech.
In Proc. Interspeech, Lyon, France, August 2013.
[ bib |
.pdf |
Abstract ]
|
[43]
|
Kayoko Yanagisawa, Javier Latorre, Vincent Wan, Mark J. F. Gales, and Simon
King.
Noise robustness in HMM-TTS speaker adaptation.
In 8th ISCA Workshop on Speech Synthesis, pages 139-144,
Barcelona, Spain, August 2013.
[ bib |
.pdf |
Abstract ]
|
[44]
|
Rubén San-Segundo, Juan Manuel Montero, Mircea Giurgiu, Ioana Muresan, and
Simon King.
Multilingual number transcription for text-to-speech conversion.
In 8th ISCA Workshop on Speech Synthesis, pages 85-89,
Barcelona, Spain, August 2013.
[ bib |
.pdf |
Abstract ]
|
[45]
|
Heng Lu, Simon King, and Oliver Watts.
Combining a vector space representation of linguistic context with a
deep neural network for text-to-speech synthesis.
In 8th ISCA Workshop on Speech Synthesis, pages 281-285,
Barcelona, Spain, August 2013.
[ bib |
.pdf |
Abstract ]
|
[46]
|
Yoshitaka Mamiya, Adriana Stan, Junichi Yamagishi, Peter Bell, Oliver Watts,
Robert Clark, and Simon King.
Using adaptation to improve speech transcription alignment in noisy
and reverberant environments.
In 8th ISCA Workshop on Speech Synthesis, pages 61-66,
Barcelona, Spain, August 2013.
[ bib |
.pdf |
Abstract ]
|
[47]
|
Oliver Watts, Adriana Stan, Rob Clark, Yoshitaka Mamiya, Mircea Giurgiu,
Junichi Yamagishi, and Simon King.
Unsupervised and lightly-supervised learning for rapid construction
of TTS systems in multiple languages from 'found' data: evaluation and
analysis.
In 8th ISCA Workshop on Speech Synthesis, pages 121-126,
Barcelona, Spain, August 2013.
[ bib |
.pdf |
Abstract ]
|
[48]
|
Adriana Stan, Oliver Watts, Yoshitaka Mamiya, Mircea Giurgiu, Rob Clark,
Junichi Yamagishi, and Simon King.
TUNDRA: A Multilingual Corpus of Found Data for TTS Research Created
with Light Supervision.
In Proc. Interspeech, Lyon, France, August 2013.
[ bib |
.pdf |
Abstract ]
|
[49]
|
James Scobbie, Alice Turk, Christian Geng, Simon King, Robin Lickley, and Korin
Richmond.
The Edinburgh speech production facility DoubleTalk corpus.
In Proc. Interspeech, Lyon, France, August 2013.
[ bib |
.pdf |
Abstract ]
|
[50]
|
Maria Astrinaki, Alexis Moinet, Junichi Yamagishi, Korin Richmond, Zhen-Hua
Ling, Simon King, and Thierry Dutoit.
Mage - HMM-based speech synthesis reactively controlled by the
articulators.
In 8th ISCA Workshop on Speech Synthesis, page 243, Barcelona,
Spain, August 2013.
[ bib |
.pdf |
Abstract ]
|
[51]
|
Chee-Ming Ting, Simon King, Sh-Hussain Salleh, and A. K. Ariff.
Discriminative tandem features for HMM-based EEG classification.
In Proc. 35th Annual International Conference of the IEEE
Engineering in Medicine and Biology Society (EMBC 13), Osaka, Japan, July
2013.
[ bib |
.pdf |
Abstract ]
|
[52]
|
C. Valentini-Botinhao, E. Godoy, Y. Stylianou, B. Sauert, S. King, and
J. Yamagishi.
Improving intelligibility in noise of HMM-generated speech via
noise-dependent and -independent methods.
In Proc. ICASSP, Vancouver, Canada, May 2013.
[ bib |
.pdf ]
|
[53]
|
H. Lu and S. King.
Factorized context modelling for text-to-speech synthesis.
In Proc. ICASSP, Vancouver, Canada, May 2013.
[ bib |
.pdf |
Abstract ]
|
[54]
|
Mark Sinclair and Simon King.
Where are the challenges in speaker diarization?
In Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE
International Conference on, Vancouver, British Columbia, USA, May 2013.
[ bib |
.pdf |
Abstract ]
|
[55]
|
John Dines, Hui Liang, Lakshmi Saheer, Matthew Gibson, William Byrne, Keiichiro
Oura, Keiichi Tokuda, Junichi Yamagishi, Simon King, Mirjam Wester, Teemu
Hirsimäki, Reima Karhila, and Mikko Kurimo.
Personalising speech-to-speech translation: Unsupervised
cross-lingual speaker adaptation for HMM-based speech synthesis.
Computer Speech and Language, 27(2):420-437, February 2013.
[ bib |
DOI |
http |
Abstract ]
|
[56]
|
Javier Tejedor, Doroteo T. Toledano, Dong Wang, Simon King, and Jose Colas.
Feature analysis for discriminative confidence estimation in spoken
term detection.
Computer Speech and Language, To appear, 2013.
[ bib |
.pdf |
Abstract ]
|
[57]
|
P. Lal and S. King.
Cross-lingual automatic speech recognition using tandem features.
IEEE Transactions on Audio, Speech, and Language Processing, To
appear, 2013.
[ bib |
DOI |
.pdf |
Abstract ]
|
[58]
|
Yoshitaka Mamiya, Junichi Yamagishi, Oliver Watts, Robert A.J. Clark, Simon
King, and Adriana Stan.
Lightly supervised gmm vad to use audiobook for speech synthesiser.
In Proc. ICASSP, 2013.
[ bib |
.pdf |
Abstract ]
|
[59]
|
Christian Geng, Alice Turk, James M. Scobbie, Cedric Macmartin, Philip Hoole,
Korin Richmond, Alan Wrench, Marianne Pouplier, Ellen Gurman Bard, Ziggy
Campbell, Catherine Dickie, Eddie Dubourg, William Hardcastle, Evia Kainada,
Simon King, Robin Lickley, Satsuki Nakai, Steve Renals, Kevin White, and
Ronny Wiegand.
Recording speech articulation in dialogue: Evaluating a synchronized
double electromagnetic articulography setup.
Journal of Phonetics, 41(6):421 - 431, 2013.
[ bib |
DOI |
http |
.pdf |
Abstract ]
|
[60]
|
Adriana Stan, Peter Bell, and Simon King.
A grapheme-based method for automatic alignment of speech and text
data.
In Proc. IEEE Workshop on Spoken Language Technology, Miami,
Florida, USA, December 2012.
[ bib |
.pdf |
Abstract ]
|
[61]
|
Heng Lu and Simon King.
Using Bayesian networks to find relevant context features for
HMM-based speech synthesis.
In Proc. Interspeech, Portland, Oregon, USA, September 2012.
[ bib |
.pdf |
Abstract ]
|
[62]
|
Rasmus Dall, Christophe Veaux, Junichi Yamagishi, and Simon King.
Analysis of speaker clustering techniques for HMM-based speech
synthesis.
In Proc. Interspeech, September 2012.
[ bib |
.pdf |
Abstract ]
|
[63]
|
C. Valentini-Botinhao, J. Yamagishi, and S. King.
Evaluating speech intelligibility enhancement for HMM-based
synthetic speech in noise.
In Proc. Sapa Workshop, Portland, USA, September 2012.
[ bib |
.pdf |
Abstract ]
|
[64]
|
Ruben San-Segundo, Juan M. Montero, Veronica Lopez-Luden, and Simon King.
Detecting acronyms from capital letter sequences in spanish.
In Proc. Interspeech, Portland, Oregon, USA, September 2012.
[ bib |
.pdf |
Abstract ]
|
[65]
|
C. Valentini-Botinhao, J. Yamagishi, and S. King.
Mel cepstral coefficient modification based on the Glimpse
Proportion measure for improving the intelligibility of HMM-generated
synthetic speech in noise.
In Proc. Interspeech, Portland, USA, September 2012.
[ bib |
Abstract ]
|
[66]
|
C. Valentini-Botinhao, J. Yamagishi, and S. King.
Using an intelligibility measure to create noise robust cepstral
coefficients for HMM-based speech synthesis.
In Proc. LISTA Workshop, Edinburgh, UK, May 2012.
[ bib |
.pdf ]
|
[67]
|
C. Valentini-Botinhao, R. Maia, J. Yamagishi, S. King, and H. Zen.
Cepstral analysis based on the Glimpse proportion measure for
improving the intelligibility of HMM-based synthetic speech in noise.
In Proc. ICASSP, pages 3997-4000, Kyoto, Japan, March 2012.
[ bib |
DOI |
.pdf |
Abstract ]
|
[68]
|
Chen-Yu Yang, G. Brown, Liang Lu, J. Yamagishi, and S. King.
Noise-robust whispered speech recognition using a non-audible-murmur
microphone with vts compensation.
In Chinese Spoken Language Processing (ISCSLP), 2012 8th
International Symposium on, pages 220-223, 2012.
[ bib |
DOI |
Abstract ]
|
[69]
|
Jaime Lorenzo-Trueba, Oliver Watts, Roberto Barra-Chicote, Junichi Yamagishi,
Simon King, and Juan M Montero.
Simple4all proposals for the albayzin evaluations in speech
synthesis.
In Proc. Iberspeech 2012, 2012.
[ bib |
.pdf |
Abstract ]
|
[70]
|
Dong Wang, Javier Tejedor, Simon King, and Joe Frankel.
Term-dependent confidence normalization for out-of-vocabulary spoken
term detection.
Journal of Computer Science and Technology, 27(2), 2012.
[ bib |
DOI |
Abstract ]
|
[71]
|
Keiichiro Oura, Junichi Yamagishi, Mirjam Wester, Simon King, and Keiichi
Tokuda.
Analysis of unsupervised cross-lingual speaker adaptation for
HMM-based speech synthesis using KLD-based transform mapping.
Speech Communication, 54(6):703-714, 2012.
[ bib |
DOI |
http |
Abstract ]
|
[72]
|
Kei Hashimoto, Junichi Yamagishi, William Byrne, Simon King, and Keiichi
Tokuda.
Impacts of machine translation and speech synthesis on
speech-to-speech translation.
Speech Communication, 54(7):857-866, 2012.
[ bib |
DOI |
http |
Abstract ]
|
[73]
|
Junichi Yamagishi, Christophe Veaux, Simon King, and Steve Renals.
Speech synthesis technologies for individuals with vocal
disabilities: Voice banking and reconstruction.
Acoustical Science and Technology, 33(1):1-5, 2012.
[ bib |
DOI |
http |
.pdf |
Abstract ]
|
[74]
|
Oliver Watts, Junichi Yamagishi, and Simon King.
Unsupervised continuous-valued word features for phrase-break
prediction without a part-of-speech tagger.
In Proc. Interspeech, pages 2157-2160, Florence, Italy, August
2011.
[ bib |
.pdf |
Abstract ]
|
[75]
|
Cassia Valentini-Botinhao, Junichi Yamagishi, and Simon King.
Can objective measures predict the intelligibility of modified
HMM-based synthetic speech in noise?
In Proc. Interspeech, August 2011.
[ bib |
.pdf |
Abstract ]
|
[76]
|
Korin Richmond, Phil Hoole, and Simon King.
Announcing the electromagnetic articulography (day 1) subset of the
mngu0 articulatory corpus.
In Proc. Interspeech, pages 1505-1508, Florence, Italy, August
2011.
[ bib |
.pdf |
Abstract ]
|
[77]
|
Ming Lei, Junichi Yamagishi, Korin Richmond, Zhen-Hua Ling, Simon King, and
Li-Rong Dai.
Formant-controlled HMM-based speech synthesis.
In Proc. Interspeech, pages 2777-2780, Florence, Italy, August
2011.
[ bib |
.pdf |
Abstract ]
|
[78]
|
S. Andraszewicz, J. Yamagishi, and S. King.
Vocal attractiveness of statistical speech synthesisers.
In Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE
International Conference on, pages 5368-5371, May 2011.
[ bib |
DOI |
Abstract ]
|
[79]
|
Cassia Valentini-Botinhao, Junichi Yamagishi, and Simon King.
Evaluation of objective measures for intelligibility prediction of
HMM-based synthetic speech in noise.
In Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE
International Conference on, pages 5112-5115, May 2011.
[ bib |
DOI |
.pdf |
Abstract ]
|
[80]
|
K. Hashimoto, J. Yamagishi, W. Byrne, S. King, and K. Tokuda.
An analysis of machine translation and speech synthesis in
speech-to-speech translation system.
In Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE
International Conference on, pages 5108-5111, May 2011.
[ bib |
DOI |
Abstract ]
|
[81]
|
Dong Wang, Nicholas Evans, Raphael Troncy, and Simon King.
Handling overlaps in spoken term detection.
In Proc. International Conference on Acoustics, Speech and
Signal Processing, pages 5656-5659, May 2011.
[ bib |
DOI |
.pdf |
Abstract ]
|
[82]
|
Dong Wang and Simon King.
Letter-to-sound pronunciation prediction using conditional random
fields.
IEEE Signal Processing Letters, 18(2):122-125, February 2011.
[ bib |
DOI |
.pdf |
Abstract ]
|
[83]
|
J. Dines, J. Yamagishi, and S. King.
Measuring the gap between HMM-based ASR and TTS.
IEEE Selected Topics in Signal Processing, 2011.
(in press).
[ bib |
DOI |
Abstract ]
|
[84]
|
Adriana Stan, Junichi Yamagishi, Simon King, and Matthew Aylett.
The Romanian speech synthesis (RSS) corpus: Building a high
quality HMM-based speech synthesis system using a high sampling rate.
Speech Communication, 53(3):442-450, 2011.
[ bib |
DOI |
http |
Abstract ]
|
[85]
|
C. Mayo, R. A. J. Clark, and S. King.
Listeners' weighting of acoustic cues to synthetic speech
naturalness: A multidimensional scaling analysis.
Speech Communication, 53(3):311-326, 2011.
[ bib |
DOI |
Abstract ]
|
[86]
|
Dong Wang, Simon King, Nick Evans, and Raphael Troncy.
Direct posterior confidence for out-of-vocabulary spoken term
detection.
In Proc. ACM Multimedia 2010 Searching Spontaneous
Conversational Speech Workshop, October 2010.
[ bib |
DOI |
.pdf |
Abstract ]
|
[87]
|
Dong Wang, Simon King, Nick Evans, and Raphael Troncy.
CRF-based stochastic pronunciation modelling for out-of-vocabulary
spoken term detection.
In Proc. Interspeech, Makuhari, Chiba, Japan, September 2010.
[ bib |
Abstract ]
|
[88]
|
Oliver Watts, Junichi Yamagishi, and Simon King.
The role of higher-level linguistic features in HMM-based speech
synthesis.
In Proc. Interspeech, pages 841-844, Makuhari, Japan,
September 2010.
[ bib |
.pdf |
Abstract ]
|
[89]
|
Junichi Yamagishi, Oliver Watts, Simon King, and Bela Usabaev.
Roles of the average voice in speaker-adaptive HMM-based speech
synthesis.
In Proc. Interspeech, pages 418-421, Makuhari, Japan,
September 2010.
[ bib |
.pdf |
Abstract ]
|
[90]
|
Mirjam Wester, John Dines, Matthew Gibson, Hui Liang, Yi-Jian Wu, Lakshmi
Saheer, Simon King, Keiichiro Oura, Philip N. Garner, William Byrne, Yong
Guan, Teemu Hirsimäki, Reima Karhila, Mikko Kurimo, Matt Shannon, Sayaka
Shiota, Jilei Tian, Keiichi Tokuda, and Junichi Yamagishi.
Speaker adaptation and the evaluation of speaker similarity in the
EMIME speech-to-speech translation project.
In Proc. 7th ISCA Speech Synthesis Workshop, Kyoto, Japan,
September 2010.
[ bib |
.pdf |
Abstract ]
|
[91]
|
Javier Tejedor, Doroteo T. Toledano, Miguel Bautista, Simon King, Dong Wang,
and Jose Colas.
Augmented set of features for confidence estimation in spoken term
detection.
In Proc. Interspeech, September 2010.
[ bib |
.pdf |
Abstract ]
|
[92]
|
Oliver Watts, Junichi Yamagishi, and Simon King.
Letter-based speech synthesis.
In Proc. Speech Synthesis Workshop 2010, pages 317-322, Nara,
Japan, September 2010.
[ bib |
.pdf |
Abstract ]
|
[93]
|
O. Watts, J. Yamagishi, S. King, and K. Berkling.
Synthesis of child speech with HMM adaptation and voice conversion.
Audio, Speech, and Language Processing, IEEE Transactions on,
18(5):1005-1016, July 2010.
[ bib |
DOI |
.pdf |
Abstract ]
|
[94]
|
Alice Turk, James Scobbie, Christian Geng, Barry Campbell, Catherine Dickie,
Eddie Dubourg, Ellen Gurman Bard, William Hardcastle, Mariam Hartinger, Simon
King, Robin Lickley, Cedric Macmartin, Satsuki Nakai, Steve Renals, Korin
Richmond, Sonja Schaeffler, Kevin White, Ronny Wiegand, and Alan Wrench.
An Edinburgh speech production facility.
Poster presented at the 12th Conference on Laboratory Phonology,
Albuquerque, New Mexico., July 2010.
[ bib |
.pdf ]
|
[95]
|
D. Wang, S. King, and J. Frankel.
Stochastic pronunciation modelling for out-of-vocabulary spoken term
detection.
Audio, Speech, and Language Processing, IEEE Transactions on,
PP(99), July 2010.
[ bib |
DOI |
Abstract ]
|
[96]
|
Mikko Kurimo, William Byrne, John Dines, Philip N. Garner, Matthew Gibson, Yong
Guan, Teemu Hirsimäki, Reima Karhila, Simon King, Hui Liang, Keiichiro
Oura, Lakshmi Saheer, Matt Shannon, Sayaka Shiota, Jilei Tian, Keiichi
Tokuda, Mirjam Wester, Yi-Jian Wu, and Junichi Yamagishi.
Personalising speech-to-speech translation in the EMIME project.
In Proc. ACL 2010 System Demonstrations, Uppsala, Sweden, July
2010.
[ bib |
.pdf |
Abstract ]
|
[97]
|
J. Yamagishi, B. Usabaev, S. King, O. Watts, J. Dines, J. Tian, R. Hu, Y. Guan,
K. Oura, K. Tokuda, R. Karhila, and M. Kurimo.
Thousands of voices for HMM-based speech synthesis - analysis and
application of TTS systems built on various ASR corpora.
IEEE Transactions on Audio, Speech and Language Processing,
18(5):984-1004, July 2010.
[ bib |
DOI |
Abstract ]
|
[98]
|
R. Barra-Chicote, J. Yamagishi, S. King, J. Manuel Monero, and
J. Macias-Guarasa.
Analysis of statistical parametric and unit-selection speech
synthesis systems applied to emotional speech.
Speech Communication, 52(5):394-404, May 2010.
[ bib |
DOI |
Abstract ]
|
[99]
|
Dong Wang, Simon King, Joe Frankel, and Peter Bell.
Stochastic pronunciation modelling and soft match for
out-of-vocabulary spoken term detection.
In Proc. ICASSP, Dallas, Texas, USA, March 2010.
[ bib |
.pdf |
Abstract ]
|
[100]
|
Simon King.
Speech synthesis.
In Morgan and Ellis, editors, Speech and Audio Signal
Processing. Wiley, 2010.
[ bib |
Abstract ]
|
[101]
|
Steve Renals and Simon King.
Automatic speech recognition.
In William J. Hardcastle, John Laver, and Fiona E. Gibbon, editors,
Handbook of Phonetic Sciences, chapter 22. Wiley Blackwell, 2010.
[ bib ]
|
[102]
|
Keiichiro Oura, Keiichi Tokuda, Junichi Yamagishi, Mirjam Wester, and Simon
King.
Unsupervised cross-lingual speaker adaptation for HMM-based speech
synthesis.
In Proc. ICASSP, volume I, pages 4954-4957, 2010.
[ bib |
.pdf |
Abstract ]
|
[103]
|
Volker Strom and Simon King.
A classifier-based target cost for unit selection speech synthesis
trained on perceptual data.
In Proc. Interspeech, Makuhari, Japan, 2010.
[ bib |
.ps |
.pdf |
Abstract ]
|
[104]
|
Alice Turk, James Scobbie, Christian Geng, Cedric Macmartin, Ellen Bard, Barry
Campbell, Catherine Dickie, Eddie Dubourg, Bill Hardcastle, Phil Hoole, Evia
Kanaida, Robin Lickley, Satsuki Nakai, Marianne Pouplier, Simon King, Steve
Renals, Korin Richmond, Sonja Schaeffler, Ronnie Wiegand, Kevin White, and
Alan Wrench.
The Edinburgh Speech Production Facility's articulatory corpus of
spontaneous dialogue.
The Journal of the Acoustical Society of America,
128(4):2429-2429, 2010.
[ bib |
DOI |
Abstract ]
|
[105]
|
J. Yamagishi and S. King.
Simple methods for improving speaker-similarity of HMM-based speech
synthesis.
In Proc. ICASSP 2010, Dallas, Texas, USA, 2010.
[ bib |
.pdf ]
|
[106]
|
Simon King.
A tutorial on HMM speech synthesis (invited paper).
In Sadhana - Academy Proceedings in Engineering Sciences,
Indian Institute of Sciences, 2010.
[ bib |
.pdf |
Abstract ]
|
[107]
|
Peter Bell and Simon King.
Diagonal priors for full covariance speech recognition.
In Proc. IEEE Workshop on Automatic Speech Recognition and
Understanding, Merano, Italy, December 2009.
[ bib |
DOI |
.pdf |
Abstract ]
|
[108]
|
Dong Wang, Simon King, and Joe Frankel.
Stochastic pronunciation modelling for spoken term detection.
In Proc. Interspeech, pages 2135-2138, Brighton, UK, September
2009.
[ bib |
.pdf |
Abstract ]
|
[109]
|
Oliver Watts, Junichi Yamagishi, Simon King, and Kay Berkling.
HMM adaptation and voice conversion for the synthesis of child
speech: A comparison.
In Proc. Interspeech 2009, pages 2627-2630, Brighton, U.K.,
September 2009.
[ bib |
.pdf |
Abstract ]
|
[110]
|
Simon King and Vasilis Karaiskos.
The Blizzard Challenge 2009.
In Proc. Blizzard Challenge Workshop, Edinburgh, UK, September
2009.
[ bib |
.pdf |
Abstract ]
|
[111]
|
Dong Wang, Simon King, Joe Frankel, and Peter Bell.
Term-dependent confidence for out-of-vocabulary term detection.
In Proc. Interspeech, pages 2139-2142, Brighton, UK, September
2009.
[ bib |
.pdf |
Abstract ]
|
[112]
|
Junichi Yamagishi, Mike Lincoln, Simon King, John Dines, Matthew Gibson, Jilei
Tian, and Yong Guan.
Analysis of unsupervised and noise-robust speaker-adaptive
HMM-based speech synthesis systems toward a unified ASR and TTS
framework.
In Proc. Interspeech 2009, Edinburgh, U.K., September 2009.
[ bib |
Abstract ]
|
[113]
|
J. Dines, J. Yamagishi, and S. King.
Measuring the gap between HMM-based ASR and TTS.
In Proc. Interspeech, pages 1391-1394, Brighton, U.K.,
September 2009.
[ bib |
Abstract ]
|
[114]
|
Javier Tejedor, Dong Wang, Simon King, Joe Frankel, and Jose Colas.
A posterior probability-based system hybridisation and combination
for spoken term detection.
In Proc. Interspeech, pages 2131-2134, Brighton, UK, September
2009.
[ bib |
.pdf |
Abstract ]
|
[115]
|
J. Yamagishi, Bela Usabaev, Simon King, Oliver Watts, John Dines, Jilei Tian,
Rile Hu, Yong Guan, Keiichiro Oura, Keiichi Tokuda, Reima Karhila, and Mikko
Kurimo.
Thousands of voices for HMM-based speech synthesis.
In Proc. Interspeech, pages 420-423, Brighton, U.K., September
2009.
[ bib |
http |
Abstract ]
|
[116]
|
Dong Wang, Tejedor Tejedor, Joe Frankel, and Simon King.
Posterior-based confidence measures for spoken term detection.
In Proc. ICASSP09, Taiwan, April 2009.
[ bib |
.pdf |
Abstract ]
|
[117]
|
Matthew P. Aylett, Simon King, and Junichi Yamagishi.
Speech synthesis without a phone inventory.
In Interspeech, pages 2087-2090, 2009.
[ bib |
.pdf |
Abstract ]
|
[118]
|
Junichi Yamagishi, Takashi Nose, Heiga Zen, Zhenhua Ling, Tomoki Toda, Keiichi
Tokuda, Simon King, and Steve Renals.
Robust speaker-adaptive HMM-based text-to-speech synthesis.
IEEE Transactions on Audio, Speech and Language Processing,
17(6):1208-1230, 2009.
[ bib |
http |
www: |
Abstract ]
|
[119]
|
R. Barra-Chicote, J. Yamagishi, J.M. Montero, S. King, S. Lutfi, and
J. Macias-Guarasa.
Generacion de una voz sintetica en Castellano basada en HSMM para
la Evaluacion Albayzin 2008: conversion texto a voz.
In V Jornadas en Tecnologia del Habla, pages 115-118, November
2008.
(in Spanish).
[ bib |
.pdf ]
|
[120]
|
Javier Tejedor, Dong Wang, Joe Frankel, Simon King, and José Colás.
A comparison of grapheme and phoneme-based units for Spanish spoken
term detection.
Speech Communication, 50(11-12):980-991, November 2008.
[ bib |
DOI |
Abstract ]
|
[121]
|
Oliver Watts, Junichi Yamagishi, Kay Berkling, and Simon King.
HMM-based synthesis of child speech.
In Proc. 1st Workshop on Child, Computer and Interaction
(ICMI'08 post-conference workshop), Crete, Greece, October 2008.
[ bib |
.pdf |
Abstract ]
|
[122]
|
Peter Bell and Simon King.
A shrinkage estimator for speech recognition with full covariance
HMMs.
In Proc. Interspeech, Brisbane, Australia, September 2008.
Shortlisted for best student paper award.
[ bib |
.pdf |
Abstract ]
|
[123]
|
Junichi Yamagishi, Zhenhua Ling, and Simon King.
Robustness of hmm-based speech synthesis.
In Proc. Interspeech 2008, pages 581-584, Brisbane, Australia,
September 2008.
[ bib |
.pdf |
Abstract ]
|
[124]
|
Dong Wang, Ivan Himawan, Joe Frankel, and Simon King.
A posterior approach for microphone array based speech recognition.
In Proc. Interspeech, pages 996-999, September 2008.
[ bib |
.pdf |
Abstract ]
|
[125]
|
Joe Frankel, Dong Wang, and Simon King.
Growing bottleneck features for tandem ASR.
In Proc. Interspeech, page 1549, September 2008.
[ bib |
.pdf |
Abstract ]
|
[126]
|
Simon King, Keiichi Tokuda, Heiga Zen, and Junichi Yamagishi.
Unsupervised adaptation for hmm-based speech synthesis.
In Proc. Interspeech, pages 1869-1872, Brisbane, Australia,
September 2008.
[ bib |
.PDF |
Abstract ]
|
[127]
|
Laszlo Toth, Joe Frankel, Gabor Gosztolya, and Simon King.
Cross-lingual portability of mlp-based tandem features - a case
study for english and hungarian.
In Proc. Interspeech, pages 2695-2698, Brisbane, Australia,
September 2008.
[ bib |
.PDF |
Abstract ]
|
[128]
|
Vasilis Karaiskos, Simon King, Robert A. J. Clark, and Catherine Mayo.
The blizzard challenge 2008.
In Proc. Blizzard Challenge Workshop, Brisbane, Australia,
September 2008.
[ bib |
.pdf |
Abstract ]
|
[129]
|
Peter Bell and Simon King.
Covariance updates for discriminative training by constrained line
search.
In Proc. Interspeech, Brisbane, Australia, September 2008.
[ bib |
.pdf |
Abstract ]
|
[130]
|
Olga Goubanova and Simon King.
Bayesian networks for phone duration prediction.
Speech Communication, 50(4):301-311, April 2008.
[ bib |
DOI |
Abstract ]
|
[131]
|
Dong Wang, Joe Frankel, Javier Tejedor, and Simon King.
A comparison of phone and grapheme-based spoken term detection.
In Proc. ICASSP, pages 4969-4972, March 2008.
[ bib |
DOI |
Abstract ]
|
[132]
|
Matthew P. Aylett and Simon King.
Single speaker segmentation and inventory selection using dynamic
time warping self organization and joint multigram mapping.
In SSW06, pages 258-263, 2008.
[ bib |
.pdf |
Abstract ]
|
[133]
|
Volker Strom and Simon King.
Investigating Festival's target cost function using perceptual
experiments.
In Proc. Interspeech, Brisbane, 2008.
[ bib |
.ps |
.pdf |
Abstract ]
|
[134]
|
J. Frankel and S. King.
Factoring Gaussian precision matrices for linear dynamic models.
Pattern Recognition Letters, 28(16):2264-2272, December 2007.
[ bib |
DOI |
.pdf |
Abstract ]
|
[135]
|
Ö. Çetin, M. Magimai-Doss, A. Kantor, S. King, C. Bartels, J. Frankel, and
K. Livescu.
Monolingual and crosslingual comparison of tandem features derived
from articulatory and phone MLPs.
In Proc. ASRU, Kyoto, December 2007. IEEE.
[ bib |
.pdf |
Abstract ]
|
[136]
|
J. Frankel, M. Wester, and S. King.
Articulatory feature recognition using dynamic Bayesian networks.
Computer Speech & Language, 21(4):620-640, October 2007.
[ bib |
.pdf |
Abstract ]
|
[137]
|
J. Frankel, M. Magimai-Doss, S. King, K. Livescu, and Ö. Çetin.
Articulatory feature classifiers trained on 2000 hours of telephone
speech.
In Proc. Interspeech, Antwerp, Belgium, August 2007.
[ bib |
.pdf |
Abstract ]
|
[138]
|
Junichi Yamagishi, Takao Kobayashi, Steve Renals, Simon King, Heiga Zen, Tomoki
Toda, and Keiichi Tokuda.
Improved average-voice-based speech synthesis using gender-mixed
modeling and a parameter generation algorithm considering GV.
In Proc. 6th ISCA Workshop on Speech Synthesis (SSW-6), August
2007.
[ bib |
.pdf |
Abstract ]
|
[139]
|
Robert A. J. Clark, Monika Podsiadlo, Mark Fraser, Catherine Mayo, and Simon
King.
Statistical analysis of the Blizzard Challenge 2007 listening
test results.
In Proc. Blizzard 2007 (in Proc. Sixth ISCA Workshop on Speech
Synthesis), Bonn, Germany, August 2007.
[ bib |
.pdf |
Abstract ]
|
[140]
|
Mark Fraser and Simon King.
The Blizzard Challenge 2007.
In Proc. Blizzard 2007 (in Proc. Sixth ISCA Workshop on Speech
Synthesis), Bonn, Germany, August 2007.
[ bib |
.pdf |
Abstract ]
|
[141]
|
Volker Strom, Ani Nenkova, Robert Clark, Yolanda Vazquez-Alvarez, Jason
Brenier, Simon King, and Dan Jurafsky.
Modelling prominence and emphasis improves unit-selection synthesis.
In Proc. Interspeech 2007, Antwerp, Belgium, August 2007.
[ bib |
.pdf |
Abstract ]
|
[142]
|
Peter Bell and Simon King.
Sparse gaussian graphical models for speech recognition.
In Proc. Interspeech 2007, Antwerp, Belgium, August 2007.
[ bib |
.pdf |
Abstract ]
|
[143]
|
Ö. Çetin, A. Kantor, S. King, C. Bartels, M. Magimai-Doss, J. Frankel, and
K. Livescu.
An articulatory feature-based tandem approach and factored
observation modeling.
In Proc. ICASSP, Honolulu, April 2007.
[ bib |
.pdf |
Abstract ]
|
[144]
|
K. Livescu, Ö. Çetin, M. Hasegawa-Johnson, S. King, C. Bartels, N. Borges,
A. Kantor, P. Lal, L. Yung, S. Bezman, Dawson-Haggerty, B. Woods, J. Frankel,
M. Magimai-Doss, and K. Saenko.
Articulatory feature-based methods for acoustic and audio-visual
speech recognition: Summary from the 2006 JHU Summer Workshop.
In Proc. ICASSP, Honolulu, April 2007.
[ bib |
.pdf |
Abstract ]
|
[145]
|
K. Livescu, A. Bezman, N. Borges, L. Yung, Ö. Çetin, J. Frankel, S. King,
M. Magimai-Doss, X. Chi, and L. Lavoie.
Manual transcription of conversational speech at the articulatory
feature level.
In Proc. ICASSP, Honolulu, April 2007.
[ bib |
.pdf |
Abstract ]
|
[146]
|
S. King, J. Frankel, K. Livescu, E. McDermott, K. Richmond, and M. Wester.
Speech production knowledge in automatic speech recognition.
Journal of the Acoustical Society of America, 121(2):723-742,
February 2007.
[ bib |
.pdf |
Abstract ]
|
[147]
|
J. Frankel and S. King.
Speech recognition using linear dynamic models.
IEEE Transactions on Speech and Audio Processing,
15(1):246-256, January 2007.
[ bib |
.ps |
.pdf |
Abstract ]
|
[148]
|
Robert A. J. Clark, Korin Richmond, and Simon King.
Multisyn: Open-domain unit selection for the Festival speech
synthesis system.
Speech Communication, 49(4):317-330, 2007.
[ bib |
DOI |
.pdf |
Abstract ]
|
[149]
|
Jithendra Vepa and Simon King.
Subjective evaluation of join cost and smoothing methods for unit
selection speech synthesis.
IEEE Transactions on Speech and Audio Processing,
14(5):1763-1771, September 2006.
[ bib |
.pdf |
Abstract ]
|
[150]
|
J. Frankel and S. King.
Observation process adaptation for linear dynamic models.
Speech Communication, 48(9):1192-1199, September 2006.
[ bib |
.ps |
.pdf |
Abstract ]
|
[151]
|
R. Clark, K. Richmond, V. Strom, and S. King.
Multisyn voices for the Blizzard Challenge 2006.
In Proc. Blizzard Challenge Workshop (Interspeech Satellite),
Pittsburgh, USA, September 2006.
(http://festvox.org/blizzard/blizzard2006.html).
[ bib |
.pdf |
Abstract ]
|
[152]
|
Robert A. J. Clark and Simon King.
Joint prosodic and segmental unit selection speech synthesis.
In Proc. Interspeech 2006, Pittsburgh, USA, September 2006.
[ bib |
.ps |
.pdf |
Abstract ]
|
[153]
|
Simon King.
Handling variation in speech and language processing.
In Keith Brown, editor, Encyclopedia of Language and
Linguistics. Elsevier, 2nd edition, 2006.
[ bib ]
|
[154]
|
Simon King.
Language variation in speech technologies.
In Keith Brown, editor, Encyclopedia of Language and
Linguistics. Elsevier, 2nd edition, 2006.
[ bib ]
|
[155]
|
Volker Strom, Robert Clark, and Simon King.
Expressive prosody for unit-selection speech synthesis.
In Proc. Interspeech, Pittsburgh, 2006.
[ bib |
.ps |
.pdf |
Abstract ]
|
[156]
|
Robert A.J. Clark, Korin Richmond, and Simon King.
Multisyn voices from ARCTIC data for the Blizzard challenge.
In Proc. Interspeech 2005, September 2005.
[ bib |
.pdf |
Abstract ]
|
[157]
|
C. Mayo, R. A. J. Clark, and S. King.
Multidimensional scaling of listener responses to synthetic speech.
In Proc. Interspeech 2005, Lisbon, Portugal, September 2005.
[ bib |
.pdf ]
|
[158]
|
J. Frankel and S. King.
A hybrid ANN/DBN approach to articulatory feature recognition.
In Proc. Eurospeech, Lisbon, September 2005.
[ bib |
.ps |
.pdf |
Abstract ]
|
[159]
|
Alexander Gutkin and Simon King.
Inductive String Template-Based Learning of Spoken
Language.
In Hugo Gamboa and Ana Fred, editors, Proc. 5th International
Workshop on Pattern Recognition in Information Systems (PRIS-2005), In
conjunction with the 7th International Conference on Enterprise Information
Systems (ICEIS-2005), pages 43-51, Miami, USA, May 2005. INSTICC Press.
[ bib |
.ps.gz |
.pdf |
Abstract ]
|
[160]
|
Alexander Gutkin and Simon King.
Detection of Symbolic Gestural Events in Articulatory
Data for Use in Structural Representations of Continuous Speech.
In Proc. IEEE International Conference on Acoustics, Speech, and
Signal Processing (ICASSP-05), volume I, pages 885-888, Philadelphia, PA,
USA, March 2005. IEEE Signal Processing Society Press.
[ bib |
.ps.gz |
.pdf |
Abstract ]
|
[161]
|
Simon King, Chris Bartels, and Jeff Bilmes.
Svitchboard 1: Small vocabulary tasks from switchboard 1.
In Proc. Interspeech 2005, Lisbon, Portugal, 2005.
[ bib |
.pdf |
Abstract ]
|
[162]
|
Olga Goubanova and Simon King.
Predicting consonant duration with Bayesian belief networks.
In Proc. Interspeech 2005, Lisbon, Portugal, 2005.
[ bib |
.pdf |
Abstract ]
|
[163]
|
M. Wester, J. Frankel, and S. King.
Asynchronous articulatory feature recognition using dynamic
Bayesian networks.
In Proc. IEICI Beyond HMM Workshop, Kyoto, December 2004.
[ bib |
.ps |
.pdf |
Abstract ]
|
[164]
|
Yoshinori Shiga and Simon King.
Source-filter separation for articulation-to-speech synthesis.
In Proc. ICSLP, Jeju, Korea, October 2004.
[ bib |
.ps |
.pdf |
Abstract ]
|
[165]
|
Jithendra Vepa and Simon King.
Subjective evaluation of join cost functions used in unit selection
speech synthesis.
In Proc. 8th International Conference on Spoken Language
Processing (ICSLP), Jeju, Korea, October 2004.
[ bib |
.pdf |
Abstract ]
|
[166]
|
Yoshinori Shiga and Simon King.
Estimating detailed spectral envelopes using articulatory clustering.
In Proc. ICSLP, Jeju, Korea, October 2004.
[ bib |
.ps |
.pdf |
Abstract ]
|
[167]
|
Alexander Gutkin and Simon King.
Phone classification in pseudo-Euclidean vector spaces.
In Proc. 8th International Conference on Spoken Language
Processing (ICSLP), volume II, pages 1453-1457, Jeju Island, Korea, October
2004.
[ bib |
.ps.gz |
.pdf |
Abstract ]
|
[168]
|
J. Frankel, M. Wester, and S. King.
Articulatory feature recognition using dynamic Bayesian networks.
In Proc. ICSLP, September 2004.
[ bib |
.ps |
.pdf |
Abstract ]
|
[169]
|
Alexander Gutkin and Simon King.
Structural Representation of Speech for Phonetic
Classification.
In Proc. 17th International Conference on Pattern Recognition
(ICPR), volume 3, pages 438-441, Cambridge, UK, August 2004. IEEE Computer
Society Press.
[ bib |
.ps.gz |
.pdf |
Abstract ]
|
[170]
|
J. Vepa and S. King.
Subjective evaluation of join cost and smoothing methods.
In Proc. 5th ISCA speech synthesis workshop, Pittsburgh, USA,
June 2004.
[ bib |
.pdf |
Abstract ]
|
[171]
|
Yoshinori Shiga and Simon King.
Accurate spectral envelope estimation for articulation-to-speech
synthesis.
In Proc. 5th ISCA Speech Synthesis Workshop, pages 19-24, CMU,
Pittsburgh, USA, June 2004.
[ bib |
.ps |
.pdf |
Abstract ]
|
[172]
|
Jithendra Vepa and Simon King.
Join cost for unit selection speech synthesis.
In Abeer Alwan and Shri Narayanan, editors, Speech Synthesis.
Prentice Hall, 2004.
[ bib |
.ps ]
|
[173]
|
Robert A.J. Clark, Korin Richmond, and Simon King.
Festival 2 - build your own general purpose unit selection speech
synthesiser.
In Proc. 5th ISCA workshop on speech synthesis, 2004.
[ bib |
.ps |
.pdf |
Abstract ]
|
[174]
|
Ben Gillett and Simon King.
Transforming F0 contours.
In Proc. Eurospeech, Geneva, September 2003.
[ bib |
.pdf |
Abstract ]
|
[175]
|
Yoshinori Shiga and Simon King.
Estimating the spectral envelope of voiced speech using multi-frame
analysis.
In Proc. Eurospeech-2003, volume 3, pages 1737-1740, Geneva,
Switzerland, September 2003.
[ bib |
.ps |
.pdf |
Abstract ]
|
[176]
|
James Horlock and Simon King.
Named entity extraction from word lattices.
In Proc. Eurospeech, Geneva, September 2003.
[ bib |
.pdf |
Abstract ]
|
[177]
|
James Horlock and Simon King.
Discriminative methods for improving named entity extraction on
speech data.
In Proc. Eurospeech, Geneva, September 2003.
[ bib |
.pdf |
Abstract ]
|
[178]
|
Ben Gillett and Simon King.
Transforming voice quality.
In Proc. Eurospeech, Geneva, September 2003.
[ bib |
.pdf |
Abstract ]
|
[179]
|
Yoshinori Shiga and Simon King.
Estimation of voice source and vocal tract characteristics based on
multi-frame analysis.
In Proc. Eurospeech, volume 3, pages 1749-1752, Geneva,
Switzerland, September 2003.
[ bib |
.ps |
.pdf |
Abstract ]
|
[180]
|
K. Richmond, S. King, and P. Taylor.
Modelling the uncertainty in recovering articulation from acoustics.
Computer Speech and Language, 17:153-172, 2003.
[ bib |
.pdf |
Abstract ]
|
[181]
|
Christophe Van Bael and Simon King.
An accent-independent lexicon for automatic speech recognition.
In Proc. ICPhS, pages 1165-1168, 2003.
[ bib |
.pdf |
Abstract ]
|
[182]
|
J. Vepa and S. King.
Kalman-filter based join cost for unit-selection speech synthesis.
In Proc. Eurospeech, Geneva, Switzerland, 2003.
[ bib |
.pdf |
Abstract ]
|
[183]
|
Simon King.
Dependence and independence in automatic speech recognition and
synthesis.
Journal of Phonetics, 31(3-4):407-411, 2003.
[ bib |
.pdf |
Abstract ]
|
[184]
|
J. Vepa, S. King, and P. Taylor.
Objective distance measures for spectral discontinuities in
concatenative speech synthesis.
In Proc. ICSLP, Denver, USA, September 2002.
[ bib |
.pdf |
Abstract ]
|
[185]
|
J. Vepa, S. King, and P. Taylor.
New objective distance measures for spectral discontinuities in
concatenative speech synthesis.
In Proc. IEEE 2002 workshop on speech synthesis, Santa
Monica, USA, September 2002.
[ bib |
.pdf |
Abstract ]
|
[186]
|
Jesper Salomon, Simon King, and Miles Osborne.
Framewise phone classification using support vector machines.
In Proceedings International Conference on Spoken Language
Processing, Denver, 2002.
[ bib |
.ps |
.pdf |
Abstract ]
|
[187]
|
J. Frankel and S. King.
ASR - articulatory speech recognition.
In Proc. Eurospeech, pages 599-602, Aalborg, Denmark,
September 2001.
[ bib |
.ps |
.pdf |
Abstract ]
|
[188]
|
J. Frankel and S. King.
Speech recognition in the articulatory domain: investigating an
alternative to acoustic HMMs.
In Proc. Workshop on Innovations in Speech Processing, April
2001.
[ bib |
.ps |
.pdf |
Abstract ]
|
[189]
|
J. Frankel, K. Richmond, S. King, and P. Taylor.
An automatic speech recognition system using neural networks and
linear dynamic models to recover and model articulatory traces.
In Proc. ICSLP, 2000.
[ bib |
.ps |
.pdf |
Abstract ]
|
[190]
|
S. King, P. Taylor, J. Frankel, and K. Richmond.
Speech recognition via phonetically-featured syllables.
In PHONUS, volume 5, pages 15-34, Institute of Phonetics,
University of the Saarland, 2000.
[ bib |
.ps |
.pdf |
Abstract ]
|
[191]
|
Simon King and Paul Taylor.
Detection of phonological features in continuous speech using neural
networks.
Computer Speech and Language, 14(4):333-353, 2000.
[ bib |
.ps |
.pdf |
Abstract ]
|
[192]
|
Simon King and Alan Wrench.
Dynamical system modelling of articulator movement.
In Proc. ICPhS 99, pages 2259-2262, San Francisco, August
1999.
[ bib |
.ps |
.pdf |
Abstract ]
|
[193]
|
Simon King, Todd Stephenson, Stephen Isard, Paul Taylor, and Alex Strachan.
Speech recognition via phonetically featured syllables.
In Proc. ICSLP `98, pages 1031-1034, Sydney, Australia,
December 1998.
[ bib |
.ps |
.pdf |
Abstract ]
|
[194]
|
Paul A. Taylor, S. King, S. D. Isard, and H. Wright.
Intonation and dialogue context as constraints for speech
recognition.
Language and Speech, 41(3):493-512, 1998.
[ bib |
.ps |
.pdf ]
|
[195]
|
Simon King.
Using Information Above the Word Level for Automatic Speech
Recognition.
PhD thesis, University of Edinburgh, 1998.
[ bib |
.ps |
.pdf |
Abstract ]
|
[196]
|
Simon King, Thomas Portele, and Florian Höfer.
Speech synthesis using non-uniform units in the Verbmobil project.
In Proc. Eurospeech 97, volume 2, pages 569-572, Rhodes,
Greece, September 1997.
[ bib |
.ps |
.pdf |
Abstract ]
|
[197]
|
Simon King.
Final report for Verbmobil Teilprojekt 4.4.
Technical Report ISSN 1434-8845, IKP, Universitaet Bonn, January
1997.
Verbmobil-Report 195 available at http://verbmobil.dfki.de.
[ bib |
Abstract ]
|
[198]
|
Paul A. Taylor, Simon King, Stephen Isard, Helen Wright, and Jacqueline Kowtko.
Using intonation to constrain language models in speech recognition.
In Proc. Eurospeech'97, Rhodes, 1997.
[ bib |
.pdf |
Abstract ]
|
[199]
|
Simon King.
Users Manual for Verbmobil Teilprojekt 4.4.
IKP, Universitaet Bonn, October 1996.
[ bib |
Abstract ]
|
[200]
|
Simon King.
Inventory design for Verbmobil Teilprojekt 4.4.
Technical report, IKP, Universität Bonn, October 1996.
[ bib |
Abstract ]
|
[201]
|
Paul A. Taylor, Hiroshi Shimodaira, Stephen Isard, Simon King, and Jacqueline
Kowtko.
Using prosodic information to constrain language models for spoken
dialogue.
In Proc. ICSLP `96, Philadelphia, 1996.
[ bib |
.ps |
.pdf |
Abstract ]
|
[202]
|
Stephen Isard, Simon King, Paul A. Taylor, and Jacqueline Kowtko.
Prosodic information in a speech recognition system intended for
dialogue.
In IEEE Workshop in speech recognition, Snowbird, Utah, 1995.
[ bib |
Abstract ]
|