KENZY.Ai · GitHub license Python Versions Read the Docs GitHub release (latest by date)

This project is dedicated to building a "Synthetic Human" which is called Kenzy for which we have assigned the female gender pronoun of "she". She has intent determination (padatious) visual face recognition (opencv/opencv), speech transcription (whisper), and speech synthesis (speecht5/festival).

Visit our main site:

Kenzy's Architecture

Kenzy's architecture is divided into compartments. These compartments come with two main components: Servers and Devices. The servers focus on communication between other compartments and devices are designed to control input and output operations. Devices are always run within a server and a server can execute only one device. Servers talk to other servers using HTTP/HTTPS like standard web requests making customizing the communication fairly straightforward. The most important device is the kenzy.skillmanager which is a special type of device that collects data and provides the skill engine for reacting to inputs.

All options, configurations, and startup parameters are driven configuration files. There are a few examples available in the repository under the examples folder.

Python Module Overview

Class/Object Description
kenzy.core Core logic with inheritable objects for each device.
kenzy.extras Extra functions for UPNP/SSDP and other features.
kenzy.skillmanager Core skill manager (a.k.a. "The Brain")
kenzy.image Object/Face detection processing video capture (previously "Watcher")
kenzy.tts Text-to-speech models processing audio-output (previously "Speaker")
kenzy.stt Speech-to-text models processing audio-input (previously "Listener")


The quickest and easiest way to install Kenzy is to use our installation script:

wget -q -O && sh

Running the script exactly as shown above will install Kenzy and all components. If you want to be more selective you can add options as follows:

  • -b = Install skill manager dependencies (formerly the "Brain")
  • -l = Install stt dependencies (formerly the "Listener")
  • -s = Install tts dependencies (formerly the "Speaker")
  • -w = Install image dependencies (formerly the "Watcher")
  • -v [PATH] = Python virtual environment path (will create new if does not already exist)

Installer script has been tested on Ubuntu 22.04+ and Debian Buster.

Kenzy is available through pip, but to use the built-in devices there are a few extra libraries you may require. Please visit the Basic Install page for more details.

» HOWTO: Install

Web Control Panel

If everything is working properly you should be able to point your device to the web control panel running on the skillmanager device to test it out. The default URL is:

» http://localhost:9700/

Help & Support

Help and additional details is available at