“大数据智能-论文阅读”版本间的差异

来自iCenter Wiki
跳转至: 导航搜索
(以“阅读论文清单 基础 ImageNet: A Large-Scale Hierarchical Image Database, CVPR 2009. ImageNet Classification with Deep Convolutional Neural Networks, NIPS 20...”为内容创建页面)
 
第1行: 第1行:
 
阅读论文清单
 
阅读论文清单
  
基础
+
图像识别基础
  
 
ImageNet: A Large-Scale Hierarchical Image Database, CVPR 2009.
 
ImageNet: A Large-Scale Hierarchical Image Database, CVPR 2009.
第11行: 第11行:
 
Region-based convolutional networks for accurate object detection and segmentation, IEEE PAMI 2016.
 
Region-based convolutional networks for accurate object detection and segmentation, IEEE PAMI 2016.
  
Long short term memory neural network(LSTM)
+
语音识别基础
  
 
Long short term memory neural computation, Neural computation 9 (8), 1735-1780, 1997. LSTM
 
Long short term memory neural computation, Neural computation 9 (8), 1735-1780, 1997. LSTM
第18行: 第18行:
  
 
On the Properties of Neural Machine Translation: Encoder-Decoder Approaches, SSST-8, 2014.
 
On the Properties of Neural Machine Translation: Encoder-Decoder Approaches, SSST-8, 2014.
 
  
  

2017年3月24日 (五) 10:40的版本

阅读论文清单

图像识别基础

ImageNet: A Large-Scale Hierarchical Image Database, CVPR 2009.

ImageNet Classification with Deep Convolutional Neural Networks, NIPS 2012.

DeepFace: Closing the Gap to Human-Level Performance in Face Verification, CVPR 2014.

Region-based convolutional networks for accurate object detection and segmentation, IEEE PAMI 2016.

语音识别基础

Long short term memory neural computation, Neural computation 9 (8), 1735-1780, 1997. LSTM

Connectionist temporal classification: labelling unsegmented sequence data with recurrent neural networks, ICML 2006.

On the Properties of Neural Machine Translation: Encoder-Decoder Approaches, SSST-8, 2014.


第一次 Speech Recognition

Deep Speech 2 End-to-End Speech Recognition in English and Mandarin, JMLR 2016.

EESEN: End-to-End Speech Recognition using Deep RNN Models and WFST-based Decoding, ASRU 2015.

Learning the Speech Front-end With Raw Waveform CLDNNs, InterSpeech 2015.

Convolutional, long short-term memory, fully connected deep neural networks, ICASSP 2015.

Context dependent phone models for LSTM RNN acoustic modelling, ICASSP 2015.

Listen, attend and spell: A neural network for large vocabulary conversational speech recognition, ICASSP 2015.

Audio augmentation for speech recognition, InterSpeech 2015.


第二次 TTS (Text To Speech)

DeepVoice: Real-Time Neural Text-to-Speech

Wavenet: A generative model for raw audio

Char2wav: End-to-end speech synthesis


第二次 Recommender System

[x] Covington P, Adams J, Sargin E. Deep neural networks for youtube recommendations, ACM Recsys, 2016.




第三次 Reinforcement learning

Learning Locomotion Skills Using DeepRL: Does the Choice of Action Space Matter?

Off-Policy Neural Fitted Actor-Critic

A Deep Hierarchical Approach to Lifelong Learning in Minecraft

Guided Deep Reinforcement Learning for Additive Manufacturing Control Application

Deep Visual Foresight for Planning Robot Motion