-
Notifications
You must be signed in to change notification settings - Fork 3
Creating a diarization broadcast corpus
judyfong edited this page Jun 2, 2020
·
10 revisions
- Gecko or Audacity,
- rttm files (if you have linux then you can generate the rttm files yourselves)
- corresponding videos/audio of episode
Label speaker turns which last at least 1 minute.
Each speaker gets their own speaker number per recording.
Add all the speakers to one segment and copy over the list then remove them back all again to create initial list for the csv file.
Unknown speakers get labelled Unknown 01 etc.
- Generate the proposed rttm files for 28 episodes that week.
- Labelling - Gecko
- Open Gecko
- Upload the video file & rttm file
- Adjust the segment start and end times to match speaker turns.
- Add missing speaker turns.
- Correct speaker labels/numbers. Add new ones if necessary
- Write down the speaker names which correspond to each speaker number. These go in a csv file.
- Label music, foreign, or noise.
- Export as json, srt, and rttm.
- Labelling - Audacity
- Open Audacity
- Upload the video file & label file
- Adjust the segment start and end times to match speaker turns.
- Add missing speaker turns.
- Correct speaker labels/numbers. Add new ones if necessary
- Write down the speaker names which correspond to each speaker number. These go in a csv file.
- Label music, foreign, or noise in square brackets.
- Export label file.
- Repeat for a new episode.
- Turn in the corrected rttm, json, srt, and csv files then get new rttm and video files.
- Judy reports the new DER with that week's data.
format
<recording/episode id>, <speaker_number in rttm file>, <speaker name>
example
Fréttirkl1900-5022010T0,1, Bogi Águstsson