TTS: [Bug] Tacotron 2 with DCA is broken

🐛 Description

I’ve trained vanilla T2+DCA for 300k steps on the current main branch and it sounds/looks broken.

To Reproduce

Training script Config

Environment

Environment

Additional context

Audio files from 300k steps

Screenshot 2022-02-21 at 10 52 15 Screenshot 2022-02-21 at 10 52 21 Screenshot 2022-02-21 at 10 52 08 Screenshot 2022-02-21 at 10 51 58

Train alignment figure:

trainfigure

About this issue

  • Original URL
  • State: closed
  • Created 2 years ago
  • Comments: 15 (13 by maintainers)

Most upvoted comments

all is good, closing this. Thanks @lexkoro

@erogol perfect thank you! I’m waiting for my training to run, posting results here tonight and closing the issue.

Might be because you are using np.log with spec_gain: 20, either try

spec_gain=20.0,
log_func="np.log10",

or

spec_gain=1.0,
log_func="np.log",

Started to train the LJSpeech DCA recipe. I share my updates here if there is an issue or not