TTS: [Bug] Noticing some anomalies in the training eval with YourTTS training
Describe the bug
The tensorboard shows that the first two or last two columns in the EvalFigures/evalspectrogram/real seem to be “multiplied” in some instances. This does not happen every time, but when it does, it shows up in the ‘diff’. This may potentially affect the training output and cause artifacts.
The images show that the multiplication occurs uniformly across the entire spectral band. I am investigating this issue and trying to determine its impact. I estimate that the affected area is around 6% (two out of 32 columns) or 12% (four out of 32 columns). I am not sure how many frames are impacted as the tensorboard only shows a snapshot of the particular step. However, this issue occurs frequently throughout different datasets.
Maybe it is related to the beginning or end of a file, or something along those lines and some padding has to be inserted to complete the segment? Am curious to know what it is 😃 I am using mixed precision, I’ll test later without to see if that makes any difference.
To Reproduce
Training on a YourTTS (and possibly VITS) model
Expected behavior
I would expect that in the ‘real’ data and ‘diff’, that there wouldn’t be an section of the spectrogram that intermittently has some consistent difference - as that would suggest that it’s not something from the training data itself.
Logs
No response
Environment
env_info.py
{
"CUDA": {
"GPU": [
"NVIDIA GeForce RTX 2080 Ti"
],
"available": true,
"version": "11.7"
},
"Packages": {
"PyTorch_debug": false,
"PyTorch_version": "1.13.1+cu117",
"TTS": "0.13.0",
"numpy": "1.23.5"
},
"System": {
"OS": "Linux",
"architecture": [
"64bit",
"ELF"
],
"processor": "x86_64",
"python": "3.10.6",
"version": "#42~22.04.1-Ubuntu SMP PREEMPT_DYNAMIC Tue Apr 18 17:40:00 UTC 2"
}
}
Additional context
No response
About this issue
- Original URL
- State: closed
- Created a year ago
- Comments: 15 (5 by maintainers)
You may find the answers you are seeking on the Coqui discord, it’s outside the scope of this issue
On Thu, Aug 3, 2023, 20:51 phamkhactu @.***> wrote:
@phamkhactu I’ve experimented with a couple solution attempts but haven’t solved it yet.
One attempt was with ChatGPT (didn’t seem to solve, thinking about trying similar with the mel functions that also use reflect):