Overview

Facephi Voice Service is a REST API service to which you can send audio files to be processed and get the result of the voice recognition process. The service offers a service to enroll a new voice, and another one to authenticate a voice.

The product is for speaker verification and voice liveness detection. It is based on the use of a voice template, which is a string that contains the voice biometric information. This voice template can be used to authenticate the voices in the future.

The audio format supported are:

WAV
MP3
Opus/OGG
AAC
WMA
PCM ulaw and mulaw
FLAC
ALAC (mov)
MP4
AIFF

The core distribution contains the following resources:

Docker container. The container is available in the Docker FacePhi repository and is based on Ubuntu 24.04. The container contains the service and all the dependencies needed to run it.
Online documentation. The documentation is available in the following URL: https://doc-voice-service.facephi.dev/