Neural Text-to-Speech Synthesis

Lieferzeit: Lieferbar innerhalb 14 Tagen

160,49 

Artificial Intelligence: Foundations, Theory, and Algorithms

ISBN: 9819908299
ISBN 13: 9789819908295
Autor: Tan, Xu
Verlag: Springer Verlag GmbH
Umfang: xxv, 201 S., 24 farbige Illustr., 201 p. 24 illus. in color.
Erscheinungsdatum: 18.07.2024
Auflage: 1/2024
Produktform: Kartoniert
Einband: Kartoniert
Artikelnummer: 4062527 Kategorie:

Beschreibung

Autorenporträt

Xu Tan is a Principal Researcher and Research Manager at Microsoft Research Asia. His research interests cover deep learning and its applications in language/speech/music processing and digital human creation. He has rich research experience in text-to-speech synthesis. He has developed high-quality TTS systems such as FastSpeech 1/2 (widely used in the TTS community), DelightfulTTS (winning the champion of the Blizzard TTS Challenge), and NaturalSpeech (achieving human-level quality on the TTS benchmark dataset), and transferred many research works to improve the experience of Microsoft Azure TTS services. He has given a series of tutorials on TTS at top conferences such as IJCAI, ICASSP, and INTERSPEECH, and written a comprehensive survey paper on TTS. Besides speech synthesis, he has designed several popular language models (e.g., MASS) and AI music systems (e.g., Muzic), developed machine translation systems that achieved human parity in Chinese-English translation and won several champions in WMT machine translation competitions. He has published over 100 papers at prestigious conferences such as ICML, NeurIPS, ICLR, AAAI, IJCAI, ACL, EMNLP, NAACL, ICASSP, INTERSPEECH, KDD, and IEEE/ACM Transactions, and served as the area chair or action editor of some AI conferences and journals (e.g., NeurIPS, AAAI, ICASSP, TMLR).

Herstellerkennzeichnung:


Springer Verlag GmbH
Tiergartenstr. 17
69121 Heidelberg
DE

E-Mail: juergen.hartmann@springer.com

Das könnte Ihnen auch gefallen …