Overview
Facephi Voice Service is a REST API service to which you can send audio files to be processed and get the result of the voice recognition process. The service offers a service to enroll a new voice, and another one to authenticate a voice.
The product is for speaker verification and voice liveness detection. It is based on the use of a voice template, which is a string that contains the voice biometric information. This voice template can be used to authenticate the voices in the future.
The audio format supported are:
- WAV
- MP3
- Opus/OGG
- AAC
- WMA
- PCM ulaw and mulaw
- FLAC
- ALAC (mov)
- MP4
- AIFF
The core distribution contains the following resources:
- Docker container. The container is available in the Docker FacePhi repository and is based on Ubuntu 24.04. The container contains the service and all the dependencies needed to run it.
- Online documentation. The documentation is available in the following URL: https://doc-voice-service.facephi.dev/