Skip to content

clamsproject/app-pyannotesd-wrapper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

PyAnnote Wrapper for Speaker Diarization

Description

This app is used to do speaker diarizaiton on audio and video files. This is a wrapper of PyAnnote 3.1.

User instruction

General user instructions for CLAMS apps are available at CLAMS Apps documentation.

System requirements

  • Requires Python3 with clams-python, torchaudio, pyannote.audio, and ffmpeg-python to run the app locally.
  • Requires an HTTP client utility (such as curl) to invoke and execute analysis.
  • Requires docker to run the app in a Docker container

Run pip install -r requirements.txt to install the requirements.

Speaker diarization for a file

PyAnnote requires users to agree with 2 user conditions to get access to their model on HuggingFace. Therefore, the user of this app must have satisfied this requirement with their own HuggingFace account. After agreeing with the conditions, to run this app in CLI, copy the access token and run:

python cli.py --huggingface_token <huggingface_access_token> <input_mmif_file_path> <output_mmif_file_path>

Configurable runtime parameter

For the full list of parameters, please refer to the app metadata from the CLAMS App Directory or the metadata.py file in this repository.

About

A wrapper for the speaker diarization model from pyannote

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published