Research
Emotion Profile Refinery for Speech Emotion Classification
Human emotions are inherently ambiguous and impure. When
designing systems to anticipate human emotions based on
speech, the lack of emotional purity must be considered.
- 2020/8/12
- arXiv preprint arXiv:2008.05259
- Shuiyang Mao, PC Ching, Tan Lee
Automatic Detection of Phonological Errors in Child Speech Using Siamese Recurrent Autoencoder
Speech sound disorder (SSD) refers to the developmental disorder in which children encounter persistent difficulties in correctly pronouncing words.
- 2020/8/7
- arXiv preprint arXiv:2008.03193
- Si-Ioi Ng, Tan Lee
Nagoya, Japan. 1993
An NN based tone classifier for Cantonese
by Tan Lee; P.C. Ching; L.W. Chan; B. Mak
Tone identification is undoubtedly an essential component in the speech recognition problem of Chinese, specifically for the Cantonese dialect which is well known of being very rich in tones.
Using a large vocabulary containing 234 distinct syllables, the system performance for single-speaker and multispeaker cases are found to be 89% and 87% respectively.
For each particular speech unit, a fully connected recurrent neural network is built such that the static and dynamic speech characteristics are represented simultaneously by a specific temporal pattern of neuron activation states.