parakeet.audio package

Submodules

parakeet.audio.audio module

class parakeet.audio.audio.AudioProcessor(sample_rate: int, n_fft: int, win_length: int, hop_length: int, n_mels: int = 80, fmin: int = 0, fmax: Optional[int] = None, window='hann', center=True, pad_mode='reflect', normalize=True)[source]

Bases: object

istft(D)[source]
mel_spectrogram(wav)[source]
read_wav(filename)[source]
spectrogram(wav)[source]
stft(wav)[source]
write_wav(path, wav)[source]

parakeet.audio.spec_normalizer module

This modules contains normalizers for spectrogram magnitude. Normalizers are invertible transformations. They can be used to process magnitude of spectrogram before training and can also be used to recover from the generated spectrogram so as to be used with vocoders like griffin lim.

The base class describe the interface. transform is used to perform transformation and inverse is used to perform the inverse transformation.

check issues: https://github.com/mozilla/TTS/issues/377

class parakeet.audio.spec_normalizer.LogMagnitude(min=1e-05)[source]

Bases: parakeet.audio.spec_normalizer.NormalizerBase

This is a simple normalizer used in Waveglow, Waveflow, tacotron2…

inverse(x)[source]
transform(x)[source]
class parakeet.audio.spec_normalizer.NormalizerBase[source]

Bases: object

inverse(normalized)[source]
transform(spec)[source]
class parakeet.audio.spec_normalizer.UnitMagnitude(min=1e-05)[source]

Bases: parakeet.audio.spec_normalizer.NormalizerBase

This is the normalizer used in the

inverse(x)[source]
transform(x)[source]

Module contents