Research

Emotion Profile Refinery for Speech Emotion Classification
Human emotions are inherently ambiguous and impure. When designing systems to anticipate human emotions based on speech, the lack of emotional purity must be considered.
Automatic Detection of Phonological Errors in Child Speech Using Siamese Recurrent Autoencoder
Speech sound disorder (SSD) refers to the developmental disorder in which children encounter persistent difficulties in correctly pronouncing words.
Nagoya, Japan. 1993

An NN based tone classifier for Cantonese

by Tan Lee; P.C. Ching; L.W. Chan; B. Mak
Tone identification is undoubtedly an essential component in the speech recognition problem of Chinese, specifically for the Cantonese dialect which is well known of being very rich in tones.
Using a large vocabulary containing 234 distinct syllables, the system performance for single-speaker and multispeaker cases are found to be 89% and 87% respectively.
For each particular speech unit, a fully connected recurrent neural network is built such that the static and dynamic speech characteristics are represented simultaneously by a specific temporal pattern of neuron activation states.
Detroit, USA. 1995

Recurrent neural networks for speech modeling and speech recognition

by Tan Lee; P.C. Ching; L.W. Chan; B. Mak