Transcription

class corpustools.corpus.classes.lexicon.Transcription(seg_list)[source]

Transcription object, sequence of symbols

Parameters:

seg_list : list

List of segments that form the transcription. Elements in the list, can be Segments, strings, or BaseAnnotations

Attributes

_list (list) List of strings representing segment symbols
stress_pattern: dict Dictionary with keys of segment indices and values of the stress for that segment
boundaries (dict) Possible keys of ‘morpheme’ or ‘tone’ that keeps track of where morpheme or tone boundaries are inserted

Methods

__init__(seg_list)
find(environment) Find instances of an EnvironmentFilter in the Transcription
find_nonmatch(environment) Find all instances of an EnvironmentFilter in the Transcription
match_segments(segments) Returns a matching segments from a list of segments
with_word_boundaries() Return the string of segments with word boundaries surrounding them
find(environment)[source]

Find instances of an EnvironmentFilter in the Transcription

Parameters:

environment : EnvironmentFilter

EnvironmentFilter to search for

Returns:

list

List of Environments that fit the EnvironmentFilter

find_nonmatch(environment)[source]

Find all instances of an EnvironmentFilter in the Transcription that match in the middle segments, but don’t match on the sides

Parameters:

environment : EnvironmentFilter

EnvironmentFilter to search for

Returns:

list

List of Environments that fit the EnvironmentFilter’s middle but not the sides

match_segments(segments)[source]

Returns a matching segments from a list of segments

Parameters:

segments : list

List of Segments or strings to filter the Transcription

Returns:

list

List of segments (in their original order) that match the segment parameter

with_word_boundaries()[source]

Return the string of segments with word boundaries surrounding them

Returns:

list

Transcription with word boundaries