ai4bharat/IndicVoices
Viewer • Updated • 5.9M • 12k • 54
Fine-tuned speaker embedding model for Indian languages, based on pyannote wespeaker-voxceleb-resnet34-LM.
This model was fine-tuned on 112K+ audio samples from:
| Metric | Value |
|---|---|
| Best Val Accuracy | 91.4% |
| Best EER | 4.18% |
import torch
from pyannote.audio import Model
# Load base model
model = Model.from_pretrained("pyannote/wespeaker-voxceleb-resnet34-LM")
# Load fine-tuned weights
checkpoint = torch.load("checkpoint.pt")
# Note: This checkpoint includes a classification head for Indian languages
Base model
pyannote/wespeaker-voxceleb-resnet34-LM