语音识别

2017年1月26日 (四) 19:08Wyf wiki（讨论 | 贡献）的版本

(差异) ←上一版本 | 最后版本 (差异) | 下一版本→ (差异)

语音识别

语音识别，Automatic Speech Recognition，简称ASR

基本工具

LSTM

Long short term memory neural network

Long short term memory neural computation, Neural computation 9 (8), 1735-1780, 1997. LSTM

CTC

Connectionist temporal classification

Connectionist temporal classification: labelling unsegmented sequence data with recurrent neural networks, ICML 2006.

GRU

Gated Recursive Unit

On the Properties of Neural Machine Translation: Encoder-Decoder Approaches, SSST-8, 2014.

研究

传统方法综述

Karpagavalli, S., and E. Chandra. "A Review on Automatic Speech Recognition Architecture and Approaches." International Journal of Signal Processing, Image Processing and Pattern Recognition 9, No. 4 (2016): 393-404.

Google

Alex Graves，DeepMind研究员，语音识别多项技术开创者

Speech recognition with deep recurrent neural networks, 2013.
Hybrid speech recognition with deep bidirectional LSTM, ASRU 2013.
Towards End-To-End Speech Recognition with Recurrent Neural Networks, ICML 2014.
Connectionist temporal classification: labelling unsegmented sequence data with recurrent neural networks, ICML 2006.

Google Speech

Google Speech Processing from Mobile to Farfield, CHiME 2016. Google_Speech_Processing

最后修改于2017年1月26日 (星期四) 19:08