Research

Emotion Profile Refinery for Speech Emotion Classification

Human emotions are inherently ambiguous and impure. When designing systems to anticipate human emotions based on speech, the lack of emotional purity must be considered.

Automatic Detection of Phonological Errors in Child Speech Using Siamese Recurrent Autoencoder

Speech sound disorder (SSD) refers to the developmental disorder in which children encounter persistent difficulties in correctly pronouncing words.

Nagoya, Japan. 1993

An NN based tone classifier for Cantonese

by Tan Lee; P.C. Ching; L.W. Chan; B. Mak

Tone identification is undoubtedly an essential component in the speech recognition problem of Chinese, specifically for the Cantonese dialect which is well known of being very rich in tones.

Using a large vocabulary containing 234 distinct syllables, the system performance for single-speaker and multispeaker cases are found to be 89% and 87% respectively.

For each particular speech unit, a fully connected recurrent neural network is built such that the static and dynamic speech characteristics are represented simultaneously by a specific temporal pattern of neuron activation states.

Research

Emotion Profile Refinery for Speech Emotion Classification

Automatic Detection of Phonological Errors in Child Speech Using Siamese Recurrent Autoencoder

Nagoya, Japan. 1993

An NN based tone classifier for Cantonese

by Tan Lee; P.C. Ching; L.W. Chan; B. Mak

Detroit, USA. 1995

Recurrent neural networks for speech modeling and speech recognition

by Tan Lee; P.C. Ching; L.W. Chan; B. Mak

More researches on CUHK Portal

More researches on Google Scholar