Skip to content

feat: add offline diarization support with configuration file#1097

Open
semidark wants to merge 2 commits intom-bain:mainfrom
semidark:offline-diarization
Open

feat: add offline diarization support with configuration file#1097
semidark wants to merge 2 commits intom-bain:mainfrom
semidark:offline-diarization

Conversation

@semidark
Copy link
Copy Markdown

@semidark semidark commented Apr 3, 2025

  • Introduced OfflineDiarizationPipeline class for local speaker diarization using a specified configuration file.
  • Added command-line arguments to enable offline diarization and specify the config file path in transcribe.py.
  • Created pyannote_diarization_config.yaml for configuring the diarization pipeline parameters.

- Introduced `OfflineDiarizationPipeline` class for local speaker diarization using a specified configuration file.
- Added command-line arguments to enable offline diarization and specify the config file path in `transcribe.py`.
- Created `pyannote_diarization_config.yaml` for configuring the diarization pipeline parameters.
@ahmedcareweb
Copy link
Copy Markdown

Any updates on this one? I see there have been activity lately on the repo. I hope this one gets attention

@fcakyon
Copy link
Copy Markdown

fcakyon commented Jul 20, 2025

@Barabazs can you have a look at this pr? 😊

@ahmedcareweb
Copy link
Copy Markdown

I see that the repo is getting some attention back. 18 commits already in the last month. Can @semidark resolve the conflicts so this pr can be merged?

@semidark
Copy link
Copy Markdown
Author

semidark commented Mar 8, 2026

I can have a look at it next week. I was running my fork for 11 Months straight in my customers air gapped production environment. No problems there.

@LukeP-Flanagan
Copy link
Copy Markdown

This is exactly the fix I am currently looking for. I eagerly look forward to the conflict resolutions so that this can be merged. Or, alternatively, the updated fork being public.
Do you hard-code CPU-only processing? (In whisperx/offline_diarize.py --> class OfflineDiarizationPipeline:
def init(
self,
config_path,
device="cpu",
): )

Looking forward to hearing progress.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants