Parakeet - PAddle PARAllel text-to-speech toolKIT

What is Parakeet?

Parakeet is a deep learning based text-to-speech toolkit built upon paddlepaddle framework. It aims to provide a flexible, efficient and state-of-the-art text-to-speech toolkit for the open-source community. It includes many influential TTS models proposed by Baidu Research and other research groups.

What can Parakeet do?

Parakeet mainly consists of components below:

  • Implementation of models and commonly used neural network layers.

  • Dataset abstraction and common data preprocessing pipelines.

  • Ready-to-run experiments.

Parakeet provides you with a complete TTS pipeline, including:

  • Text FrontEnd

    • Rule based Chinese frontend.

  • Acoustic Models

    • FastSpeech2

    • SpeedySpeech

    • TransformerTTS

    • Tacotron2

  • Vocoders

    • Parallel WaveGAN

    • WaveFlow

  • Voice Cloning

    • Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis

    • GE2E

Parakeet helps you to train TTS models with simple commands.