KENZY: Text-to-Speech (TTS)

Parameters

Parameter Type Default Description
group str None Device membership group
location str None Location, e.g. "Living Room"
model.type str speecht5 Options are: festival, speecht5
model.target str gpu SpeechT5 options: gpu, cpu
speaker str slt SpeechT5 options: slt, clb, bdl, ksp, rms, jmk
external_player str None External player command
cache.folder str ~/.kenzy/cache/speech Folder for caching spoken phrases
offline bool false Will disable downloading the models

The model type of speecht5 uses the microsoft/speecht5_tts model from Huggingface.co. The festival option calls the external festival program.

Note: You should consider only setting offline after you have executed the program at least once so that it fully downloads all model files. Once they are downloaded you can switch the offline mode on so that it does not try to re-download the models (which enables the program to then run without an Internet connection).

The external_player option allows you to specify a program like paplay that can play wave files rather than using the built-in pythonic player.

Example YAML file

See Service Settings for options in the service group.

type: kenzy.tts

device: 
  location:                 Living Room
  group:                    Primary
  model.type:               speecht5
  model.target:             cpu
  speaker:                  slt
  cache.folder:             ~/.kenzy/cache/speech

service:
  host:                     0.0.0.0
  port:                     9702

Help & Support

Help and additional details is available at https://kenzy.ai