pyannote/speaker-diarization-3.0 runs slower than pyannote/speaker-diarization@2.1

https://github.com/m-bain/whisperX/blob/07fafa37b3ef7ce8628b194da302a5a996bb7d37/setup.py#L22
Currently pyannote.audio is pinned to 3.0.0, but it has been [reported](https://github.com/pyannote/pyannote-audio/issues/1481) that it performed slower because the embeddings model ran on CPU. As a result a new[ release 3.0.1](https://github.com/pyannote/pyannote-audio/pull/1478) fixed it by replacing`onnxruntime` with `onnxruntime-gpu`.

It makes sense for whisperX to update pyannote.audio to 3.0.1, however, there is a conflict with faster_whisper on `onnxruntime`, as discussed [here](https://github.com/guillaumekln/faster-whisper/issues/493). Until it is resolved on the faster_whisper side, installing both will end up `onnxruntime` still in CPU mode and thus slower performance.

My current workaround is running the following commands post installation
```sh
pip install pyannote.audio==3.0.1
pip uninstall onnxruntime
pip install --force-reinstall onnxruntime-gpu
```

Alternative, use the old 2.1 model.
```py
model = whisperx.DiarizationPipeline(model_name='pyannote/speaker-diarization@2.1', use_auth_token=YOUR_AUTH_TOKEN, device='cuda')
```

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

pyannote/speaker-diarization-3.0 runs slower than pyannote/[email protected] #499

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Uh oh!

pyannote/speaker-diarization-3.0 runs slower than pyannote/[email protected] #499

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions