INFORMATION
탐색 건너뛰기 링크입니다.
RESEARCH
Y.J.KIM,  Dept. of  Computer Engineering, Hanbat National University
   International Conference
 IEEE Best Paper Award : A Voice Activity Detection Model composed of Bidirectional Long-Short Term Memory and Attention Mechanism.(IEEE Best Paper Award)
   
 

Abstract:

  • In this study, we proposed a deep learning model that consists of the bidirectional Long-Short Term Memory (bi-LSTM) and the attention mechanism to perform frame-wise Voice Activity Detection (VAD). The bi-LSTM extracts annotations of frame by summarizing information from both direction. The attention mechanism accepts the annotations to extracts such frames that are important to the voice activity judgement and aggregates the representation of those informative frames to form an attention distribution vector. It is used as features for frame classification by logistic classification approach. We constructed four comparative models to perform experiments with TIMIT corpus and noise signals. The excrement shows that the proposed model outperforms the conventional VAD with LSTM. And we showed how the attention mechanism can help VAD tasks by visualizing the attention distribution of the model.
  • https://ieeexplore.ieee.org/document/8666342

 

IEEE Best Paper ward

  • The International Conference on Humanoid, Nanotechnology, Information Technology, Communication and Control, Environment, and Management (IEEE HNICEM 2018)
  • Best paper Ward
 
2018 IEEE 10th International Conference on Humanoid, Nanotechnology, Information Technology,Communication and Control, Environment and Management (HNICEM), Baguio City, Philippines, 2018, pp. 1-5, doi: 10.1109/HNICEM.2018.8666342.  
  2018-12-01/2020-08-04/김윤중