TTS: [Bug] Portuguese TTS model on XTTS is pronouncing the "." (dot) character when it happens in a text
Describe the bug
Hello,
It seem a bit of a “oopsie” was made when handling the Portuguese dataset as now the PTBR pronounces the “.” character as ponto every time we insert sentences like:
“Olá, sou seu novo clone de voz. Faça o possível para carregar um áudio de qualidade.”
Here is the output: https://vocaroo.com/1404xnr0Vkmc
It was not supposed to say “ponto”…
It goes like:
“Olá, sou seu novo clone de voz ponto Faça o possível para carregar um áudio de qualidade ponto”
But it should not be like that.
To Reproduce
Set the client to portuguese (pt) then type anything including “.” (dot)
Expected behavior
Not pronouncing dot. The purpose of “.” is to indicate the end of a declarative sentence or to separate certain elements in written text.
Logs
None
Environment
git clone https://huggingface.co/spaces/coqui/xtts
python -m venv venv
source venv/bin/activate
pip install -r requirements.txt
python app.py
Additional context
No response
About this issue
- Original URL
- State: closed
- Created 10 months ago
- Comments: 16 (3 by maintainers)
Italian has the same issue. Except for workarounds, did you find a stable fix?
“…” method does not work. Neither “!”.
Thanks
PS: with italian works replacing “.” with “\n”
In general, the use of “…” instead of “.”, also works for Portuguese language.
Temporarily it is possible to fix this problem by replacing dots “.” with exclamations “!”
I experienced the same problem with xtts-v2 using the german language.
A similar error exists in other languages, such as French, Russian and Japanese. The problem appears in model xtts_v1.1, coqui 0.19.0, python 3.11.5.