CorpusTools 1.1.0 Release Notes¶
This is a major version release for Phonological CorpusTools.
Importing corpora¶
- Importing corpora functionality in the GUI received a large overhaul
- All types of corpora are imported through a single dialog
- PCT should autodetect many settings based on selected files or directories
- Autodetected settings can be edited and refined by the user
- Basic logging support saves parsing details entered by the user (i.e., multicharacter segments)
- Numbers in transcriptions can be parsed as stress, tone, or as a normal character (note that tone and stress are currently not supported in functions or phonological search)
Pronunciation variants¶
- All algorithms that analyze segments support four strategies for dealing with pronunciation variants: canonical forms, most frequent variants, separated tokens as types, and tokens weighted by their relative frequenies
- Algorithms that analyze words support two strategies for pronunciation variants: canonical forms and most frequent variants
- Exporting corpora can now export pronunciation variants (and their frequencies)
Functional load¶
- Added support for finding the average functional load of single segments
Phonotactic probability¶
- Fixed an issue where calculating biphone probabilities on single segment words would cause errors; now assigns a probability of 0 to those words
Kullback-Leibler divergence¶
- Added options to bring KL divergence in line with the other functions
- Added command line script for calculating KL divergence
GUI¶
- Added a dialog to the “View/change feature system” dialog to edit the categorization of segments into a coherent segment chart via features
- Features can be used as input to the analysis functions, i.e. functional load of voice in the corpus (segements that are +voice compared to segments that are -voice)
Segment selection¶
- Segment selection has been redone
- Segments can be selected via the inventory
- Features can be typed into the filter field, which will highlight segments that will be included with that feature selection
- Once a feature specification has been entered, that segment set can be locked in
Environments¶
- Environment creation has been revamped
- Users can select a set of center segments
- Right hand and left hand can be added, with multiple sets of segments on each side
Known issues¶
- Help pages for the Mac binary require internet connection to view, due to issues including .html files in the .app binary