PyTorch
Catalan
TTS
audio
synthesis
VITS
speech
coqui.ai

Cannot load the model

#1
by jordimas - opened

Hello

Two quick comments:

  1. On the sample there are these imports:

import tempfile
import gradio as gr
import numpy as np
import os
import json

I do not think that they are needed. There is no need to install all these dependencies (e..g gradio).

  1. When trying to load the model, following the receipe in the README.md, Coqui gives the error bellow.

When I try to look at the contents of the config.json I cannot see a plain text file, instead "archive/data.pklFBZZZZZZZZZZZZZ"

May be I should load the model following a specifc instructions? Or may the model is published in an incorrect format? I do not know.

Code used:

def syntetize():
from typing import Optional
from TTS.config import load_config
from TTS.utils.manage import ModelManager
from TTS.utils.synthesizer import Synthesizer

model_path = "aina/model/best_model.pth"
config_path = "aina/model/config.json"
speakers_file_path = "aina/model/speakers.pth"

text = "Hola amics!"
speaker_idx = "1"

synthesizer = Synthesizer(
model_path, config_path, speakers_file_path, None, None, None,
)
wavs = synthesizer.tts(text, speaker_idx)

Thanks

Jordi

File "/home/jordi/sc/aina-python/lib/python3.11/site-packages/TTS/config/init.py", line 89, in load_config
data = json.load(f)
^^^^^^^^^^^^
File "/usr/lib/python3.11/json/init.py", line 293, in load
return loads(fp.read(),
^^^^^^^^^
File "", line 322, in decode
UnicodeDecodeError: 'utf-8' codec can't decode byte 0x80 in position 64: invalid start byte

Projecte Aina org

Hi Jordi,

We have stopped maintaining this model, sorry it is still available due to administrative reasons which causes confusion.

Was there a specific reason that you wanted to use the Coqui models? Our most advanced models can be tried here and the instructions to deploy them are in the About (Informació) tab. Let us know what would be the most appropriate way to help you. Thanks.

OK. The reason to use Coqui is because the project already uses Coqui for the other languages.
But it's undertood. Thanks

jordimas changed discussion status to closed
Projecte Aina org

If Coqui is required for your setup, you can check this config file: https://huggingface.co/spaces/projecte-aina/tts-ca-coqui-vits-multispeaker/blob/main/config_multisteplr.json

However, we think that matxa models have better quality and perfomance than the ones trained with coqui.

wetdog changed discussion status to open

Thanks! The link gives a 404.

Is the Coqui model that you are sharing newer that the Catalan model provided by https://github.com/idiap/coqui-ai-TTS?
I'm trying to understand if there is value in using this model instead of the currently provided by Coqui build with you also :)

Thanks

Projecte Aina org

I'm sorry, it was pointing to a private space. You can find it here: https://huggingface.co/projecte-aina/tts-ca-coqui-vits-multispeaker/blob/main/model/config_multisteplr.json This version was trained with more data and is posterior to the one uploaded in coqui.

Thanks. I saw the fixes in the repo. I will try this in the next days and I will let you know if there is a problem.

In order for your work to have maxium reach, I suggest you to consider commiting the new model to:
https://github.com/idiap/coqui-ai-TTS

It seems not too complicated:
https://github.com/idiap/coqui-ai-TTS/commit/cf765cb3f2c4b29f8c91eb5eda52ba2203a09eb3

Thanks again

Jordi

Sign up or log in to comment