
parakeet is a deep learning based text-to-speech toolkit built upon paddlepaddle framework. It aims to provide a flexible, efficient and state-of-the-art text-to-speech toolkit for the open-source community. It includes many influential TTS models proposed by Baidu Research and other research groups.

parakeet mainly consists of components below.

  1. Implementation of models and commonly used neural network layers.

  2. Dataset abstraction and common data preprocessing pipelines.

  3. Ready-to-run experiments.


Design of Parakeet

Indices and tables