Real-Time End-to-End Speech Emotion Recognition with Cross-Domain Adaptation
Language resources are the main factor in speech-emotion-recognition (SER)-based deep learning models.Thai is a low-resource language that has a smaller data size than high-resource languages such as German.This paper describes the framework of using a pretrained-model-based front-end and back-end network to adapt feature spaces from the speech rec