Skip to content

mepc36/gentle

This branch is 9 commits behind strob/gentle:master.

Folders and files

NameName
Last commit message
Last commit date

Latest commit

2148efc · May 14, 2020
Sep 22, 2018
Jun 16, 2019
Jan 9, 2020
Jul 7, 2018
Sep 13, 2016
Jun 20, 2018
Nov 27, 2015
Sep 5, 2019
Oct 23, 2015
Jul 7, 2018
May 14, 2020
Jan 2, 2019
Mar 14, 2016
Jun 20, 2018
Jun 9, 2018
Jan 24, 2017
Dec 18, 2015
Sep 22, 2018
Jun 9, 2018

Repository files navigation

Gentle

Robust yet lenient forced-aligner built on Kaldi. A tool for aligning speech with text.

Getting Started

There are three ways to install Gentle.

  1. Download the pre-built Mac application. This package includes a GUI that will start the server and a browser. It only works on Mac OS.

  2. Use the Docker image. Just run docker run -P lowerquality/gentle. This works on all platforms supported by Docker.

  3. Download the source code and run ./install.sh. Then run python3 serve.py to start the server. This works on Mac and Linux.

Using Gentle

By default, the aligner listens at http://localhost:8765. That page has a graphical interface for transcribing audio, viewing results, and downloading data.

There is also a REST API so you can use Gentle in your programs. Here's an example of how to use the API with CURL:

curl -F "audio=@audio.mp3" -F "transcript=@words.txt" "http://localhost:8765/transcriptions?async=false"

If you've downloaded the source code you can also run the aligner as a command line program:

git clone https://github.com/lowerquality/gentle.git
cd gentle
./install.sh
python3 align.py audio.mp3 words.txt

The default behaviour outputs the JSON to stdout. See python3 align.py --help for options.

Packages

No packages published

Languages

  • Python 64.7%
  • C++ 17.2%
  • HTML 14.4%
  • Shell 1.7%
  • Makefile 1.2%
  • Dockerfile 0.8%