“大数据智能-论文阅读”版本间的差异
(以“阅读论文清单 基础 ImageNet: A Large-Scale Hierarchical Image Database, CVPR 2009. ImageNet Classification with Deep Convolutional Neural Networks, NIPS 20...”为内容创建页面) |
|||
第1行: | 第1行: | ||
阅读论文清单 | 阅读论文清单 | ||
− | + | 图像识别基础 | |
ImageNet: A Large-Scale Hierarchical Image Database, CVPR 2009. | ImageNet: A Large-Scale Hierarchical Image Database, CVPR 2009. | ||
第11行: | 第11行: | ||
Region-based convolutional networks for accurate object detection and segmentation, IEEE PAMI 2016. | Region-based convolutional networks for accurate object detection and segmentation, IEEE PAMI 2016. | ||
− | + | 语音识别基础 | |
Long short term memory neural computation, Neural computation 9 (8), 1735-1780, 1997. LSTM | Long short term memory neural computation, Neural computation 9 (8), 1735-1780, 1997. LSTM | ||
第18行: | 第18行: | ||
On the Properties of Neural Machine Translation: Encoder-Decoder Approaches, SSST-8, 2014. | On the Properties of Neural Machine Translation: Encoder-Decoder Approaches, SSST-8, 2014. | ||
− | |||
2017年3月24日 (五) 10:40的版本
阅读论文清单
图像识别基础
ImageNet: A Large-Scale Hierarchical Image Database, CVPR 2009.
ImageNet Classification with Deep Convolutional Neural Networks, NIPS 2012.
DeepFace: Closing the Gap to Human-Level Performance in Face Verification, CVPR 2014.
Region-based convolutional networks for accurate object detection and segmentation, IEEE PAMI 2016.
语音识别基础
Long short term memory neural computation, Neural computation 9 (8), 1735-1780, 1997. LSTM
Connectionist temporal classification: labelling unsegmented sequence data with recurrent neural networks, ICML 2006.
On the Properties of Neural Machine Translation: Encoder-Decoder Approaches, SSST-8, 2014.
第一次 Speech Recognition
Deep Speech 2 End-to-End Speech Recognition in English and Mandarin, JMLR 2016.
EESEN: End-to-End Speech Recognition using Deep RNN Models and WFST-based Decoding, ASRU 2015.
Learning the Speech Front-end With Raw Waveform CLDNNs, InterSpeech 2015.
Convolutional, long short-term memory, fully connected deep neural networks, ICASSP 2015.
Context dependent phone models for LSTM RNN acoustic modelling, ICASSP 2015.
Listen, attend and spell: A neural network for large vocabulary conversational speech recognition, ICASSP 2015.
Audio augmentation for speech recognition, InterSpeech 2015.
第二次 TTS (Text To Speech)
DeepVoice: Real-Time Neural Text-to-Speech
Wavenet: A generative model for raw audio
Char2wav: End-to-end speech synthesis
第二次 Recommender System
[x] Covington P, Adams J, Sargin E. Deep neural networks for youtube recommendations, ACM Recsys, 2016.
第三次 Reinforcement learning
Learning Locomotion Skills Using DeepRL: Does the Choice of Action Space Matter?
Off-Policy Neural Fitted Actor-Critic
A Deep Hierarchical Approach to Lifelong Learning in Minecraft
Guided Deep Reinforcement Learning for Additive Manufacturing Control Application
Deep Visual Foresight for Planning Robot Motion