Адрес e-mail:

Корейско-российский День науки и технологии в МФТИ

6 июня в 119 ГК в рамках корейско-российского Дня науки и технологии состоятся лекции профессора Корейского института передовых технологий (KAIST) Ли Су Янга и профессора Университета Корё (Korea University) Нам Кичуна.

이수영.jpg
С 10:30 до 11:00 пройдёт лекция Ли Су Янга на тему «Hierarchical Committee and Top-Down Attention for Robust Classification: Cases for Emotional Facial Expression and Noisy Speech Recognition». 

Abstract

This talk consists of two approaches to provide robust classification performance in real world applications, i.e., emotion recognition from facial expression and speech recognition in noisy environment.


The emotional facial expression recognition incorporating a hierarchical committee machine won the emotion-from-images competition at the third Emotion Recognition in the Wild ( EmotiW2015 ) challenge. We trained multiple deep convolutional neural networks (CNNs) as committee members and combined their decisions with two strategies:  in order to obtain diverse decisions from deep CNNs, we incorporated several different network architectures, input normalization, and random weight initializations for training these deep models, and in order to form a better committee in structural and decisional aspects, we constructed a hierarchical architecture of the committee with exponentially-weighted decision fusion. For the recognition of seven emotional categories in the wild, we achieved a test accuracy of 61.6 %. Moreover, on other public databases, our hierarchical committee of deep CNNs yielded superior performance, outperforming or competing with the state-of-the-art results for these databases.


To achieve high accuracy for noisy speech recognition, we also incorporated top-down attention which automatically assigned attention gain on input and/or hidden variables for higher confidence on the classification decision. [2=4] Although it is basically similar to recent attentive networks, unlike image recognition tasks in big background, the segmentation of speech even in noisy environment is relatively easy task and we applied the top-down attention only at test phase. Also, several top candidate classes were attended and only the class with the maximum confidence was selected as the final decision. This approach successfully resulted in sequential recognition of superimposed patterns and continuous speech recognition in noisy environment.


С 11:00 до 11:30 — лекция Нам Кичуна на тему «Learning Sciences of Brain and ICT».

Если вы заметили в тексте ошибку, выделите её и нажмите Ctrl+Enter.

МФТИ в социальных сетях

soc-vk soc-fb soc-tw soc-li soc-li soc-yt
Яндекс.Метрика