Hyperion: Speaker Recognition Toolkit

Hyperion is a Speaker Recognition Toolkit based on PyTorch and numpy. It provides:

x-Vector architectures: ResNet, Res2Net, Spine2Net, ECAPA-TDNN, EfficientNet, Transformers and others.
Embedding preprocessing tools: PCA, LDA, NAP, Centering/Whitening, Length Normalization, CORAL
Several flavours of PLDA back-ends: Full-rank PLDA, Simplified PLDA, PLDA
Calibration and Fusion tools
Recipes for popular datasets: VoxCeleb, NIST-SRE, VOiCES

Contents:

Indices and tables

Read the Docs v: latest

Versions: latest; stable; v0.2.2

Downloads: pdf

On Read the Docs: Project Home; Builds