Overview

Facephi Voice Service is a REST API service to which you can send audio files to be processed and get the result of the voice recognition process. The service offers a service to enroll a new voice, and another one to authenticate a voice.

The product is for speaker verification and voice liveness detection. It is based on the use of a voice template, which is a string that contains the voice biometric information. This voice template can be used to authenticate the voices in the future.

The audio format supported are:

  • WAV
  • MP3
  • Opus/OGG
  • AAC
  • WMA
  • PCM ulaw and mulaw
  • FLAC
  • ALAC (mov)
  • MP4
  • AIFF

The core distribution contains the following resources:

  • Docker container. The container is available in the Docker FacePhi repository and is based on Ubuntu 24.04. The container contains the service and all the dependencies needed to run it.
  • Online documentation. The documentation is available in the following URL: https://doc-voice-service.facephi.dev/