Closed
Description
Line 22 in 07fafa3
Currently pyannote.audio is pinned to 3.0.0, but it has been reported that it performed slower because the embeddings model ran on CPU. As a result a new release 3.0.1 fixed it by replacing
onnxruntime
with onnxruntime-gpu
.
It makes sense for whisperX to update pyannote.audio to 3.0.1, however, there is a conflict with faster_whisper on onnxruntime
, as discussed here. Until it is resolved on the faster_whisper side, installing both will end up onnxruntime
still in CPU mode and thus slower performance.
My current workaround is running the following commands post installation
pip install pyannote.audio==3.0.1
pip uninstall onnxruntime
pip install --force-reinstall onnxruntime-gpu
Alternative, use the old 2.1 model.
model = whisperx.DiarizationPipeline(model_name='pyannote/[email protected]', use_auth_token=YOUR_AUTH_TOKEN, device='cuda')
Metadata
Metadata
Assignees
Labels
No labels