Allophoible

Allophoible is an extension of Version 2.0 of the PHOIBLE database with features for all phone segments that only occur as allophones and for additional phonemes that occur in eSpeak NG Output (From Version 1.51). We also include feature for some additional phones from Version 1.0 of the UCLA Phonetic Corpus that are not in PHOIBLE for future research. The extended database used in our work is included as allophoible.csv. Furthermore, we defined features for additional diacritics and two other symbols which were not included in the original PHOIBLE feature definition data. These feature definitions are included as additional-diacritic-features.csv for reference.

In addition to the raw feature data files, we modified and extended the feature generation code for automatically extending the base PHOIBLE database. We do not include additional code or resources for reproducing the original PHOIBLE database, which can be found here.

Format of new Phones

New phone definitions are appended as new rows to the original PHOIBLE database and assigned to the previously unused "InventoryID" 0. The "Glottocode", "ISO6393", "LanguageName" and "SpecificDialect" fields are set to NA, since the same phones can occur as allophones in inventories of different languages. The "Source" field for new phones indicates, whether phones were taken from the allophones from PHOIBLE ("phoible"), from eSpeak NG output ("espeak-ng") or from the UCLA Phonetic Corpus ("ucla").

Complex segments from PHOIBLE or eSpeak NG sometimes consist of a mixture of vowels and consonant, in which case the "SegmentClass" can be ambiguous. For PHOIBLE allophones, we disambiguate in these cases by taking the segment class from the phoneme that the phone is an allophone of. For eSpeak NG, the field is left as NA in these cases.

Reference

@inproceedings{glocker2023allophant,
    title={Allophant: Cross-lingual Phoneme Recognition with Articulatory Attributes},
    author={Glocker, Kevin and Herygers, Aaricia and Georges, Munir},
    year={2023},
    booktitle={{Proc. Interspeech 2023}},
    month={8}}

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
allophoible		allophoible
data		data
raw-data		raw-data
scripts		scripts
LICENSE		LICENSE
README.md		README.md
all_missing.csv		all_missing.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Allophoible

Format of new Phones

Reference

About

Releases 1

Packages

Languages

License

Aariciah/allophoible

Folders and files

Latest commit

History

Repository files navigation

Allophoible

Format of new Phones

Reference

About

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages