Hyperion: Speaker Recognition Toolkit

Hyperion is a Speaker Recognition Toolkit based on PyTorch and numpy. It provides:
  • x-Vector architectures: ResNet, Res2Net, Spine2Net, ECAPA-TDNN, EfficientNet, Transformers and others.

  • Embedding preprocessing tools: PCA, LDA, NAP, Centering/Whitening, Length Normalization, CORAL

  • Several flavours of PLDA back-ends: Full-rank PLDA, Simplified PLDA, PLDA

  • Calibration and Fusion tools

  • Recipes for popular datasets: VoxCeleb, NIST-SRE, VOiCES

Contents:

Indices and tables