1 |
J. Markoff, "Scientists See Promise in Deep-Learning Programs," New York Times. November 24, 2012.
|
2 |
G. Marcus, "Is 'Deep Learning' a Revolution in Artificial Intelligence?" The New Yorker, November 25, 2012.
|
3 |
G. Hinton, S. Osindero, Y. Teh, "A fast learning algorithm for deep belief nets," Neural Computation Vol.18, pp. 1527-1554, 2006.
DOI
|
4 |
M. Minsky, S. Papert, Perceptrons, Cambridge, MA: MIT Press, 1969.
|
5 |
D. E. Rumelhart, G. E. Hinton, R. J. Williams, "Learning internal representations by error propagation" in Parallel Distributed Processing, MIT Press, 1986, pp. 318-362.
|
6 |
K. Fukushima, "Neocognitron: A self-organizing neural network for a mechanism of pattern recognition unaffected by shift in position," Biological Cybernetics, Vol. 36, No. 4, pp.193-202, 1980.
DOI
|
7 |
A. Graves, J. Schmidhuber, "Framewise phoneme classification with bidirectional LSTM and other neural network architectures," Neural Networks, Vol. 18, No. 5-6, pp.602-610, 2005.
DOI
|
8 |
P. Baldi, P. J. Sadowski, "Understanding dropout," Advances in Neural Information Processing Systems (NIPS), 2013, pp.2814-2822.
|
9 |
C. M. Bishop, Neural Networks for Pattern Recognition, Oxford: Oxford University Press, 1995.
|
10 |
M. Riesenhuber, T. Poggio, "Hierarchical models of object recognition in cortex," Nature Neuroscience, Vol. 2, No. 11, pp.1019-1025, 1999.
DOI
|
11 |
R. Kurzweil, How to Create a Mind: The Secret of Human Thought Revealed. Penguin Books, 2012.
|
12 |
Y. Bengio, E. Thibodeau-Laufer, G. Alain, J. Yosinski, "Deep Generative Stochastic Networks Trainable by Backprop," In Proc. of International Conference on Machine Learning (ICML), 2014.
|
13 |
A. Graves. "Generating Sequences With Recurrent Neural Networks," 2014.
|
14 |
D. Marr, "Vision: A Computational Investigation into Human Representation and Processing of Visual Information," Freeman, San Francisco, 1982.
|
15 |
R. A. Brooks, "Elephants Don't Play Chess," Robotics and Autonomous Systems, Vol. 6, pp.3-15, 1990.
DOI
|
16 |
J. Schmidhuber, "Deep Learning in Neural Networks: An Overview," Technical Report IDSIA-03-14, 2014.
|
17 |
C. Farabet, B. Martini, B. Corda, P. Akselrod, E. Culurciello, Y. LeCun, "NeuFlow: A Runtime Reconfigurable Dataflow Processor for Vision", in Proc. of the Fifth IEEE Workshop on Embedded Computer Vision (ECV), Colorado Springs, 2011.
|
18 |
http://spectrum.ieee.org/robotics/artificial-intelligence/machinelearning-maestro-michael-jordan-on-the-delusions-of-big-data-and-other-huge-engineering-efforts
|
19 |
L. Deng, "Three classes of deep learning architectures and their applications: a tutorial survey," APSIPA Transactions on Signal and Information Processing, 2012.
|
20 |
G. Hinton, R. Salakhutdinov, "Reducing the dimensionality of data with neural networks," Science , Vol. 313, No. 5786, pp. 504-507, Jul. 2006.
DOI
|
21 |
A. Krizhevsky, I. Sutskever, G. Hinton, "ImageNet classification with deep convolutional neural networks," Advances in Neural Information Processing (NIPS), Lake Taho, NV, 2012.
|
22 |
J. Markoff, "How Many Computers to Identify a Cat? 16,000," New York Times. June 25, 2012.
|
23 |
P. Smolensky, "Information Processing in Dynamical Systems: Foundations of Harmony Theory," in Parallel distributed processing: Explorations in the microstructure of cognition , Vol. 1 , MIT Press, Cambridge, MA, 1986, pp. 194-281.
|
24 |
S. J. Thorpe, M. Fabre-Thorpe, "Seeking Categories in the Brain," Science . Vol. 291, No. 5502, pp.260-262, Jan. 2001.
DOI
|
25 |
A. Graves, A. Mohamed, G. Hinton. "Speech Recognition with Deep Recurrent Neural Networks," International Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2013, Vancouver, Canada.
|
26 |
Y. Bengio, A. Courville, P. Vincent, "Representation learning: A review and new perspectives," Pattern Analysis and Machine Intelligence, IEEE Transactions on , Vol. 35, No. 8, pp.1798-1828, 2013.
DOI
|
27 |
D. C. Ciresan, U. Meier, J. Masci, J. Schmidhuber, "A committee of neural networks for traffic sign classification," In Proc. of International Joint Conference on Neural Networks (IJCNN), 2011, pp.1918-1921.
|
28 |
Y. Bengio, P. Simard, P. Frasconi, "Learning long-term dependencies with gradient descent is difficult," IEEE Transactions on Neural Networks, Vol. 5, No. 2, pp.157-166, 1994.
DOI
|
29 |
Y. Jia, "Caffe: An Open Source Convolutional Architecture for Fast Feature Embedding," 2013, http://caffe.berkeleyvision.org/.
|
30 |
T. Mikolov, W.-T. Yih, G. Zweig, "Linguistic Regularities in Continuous Space Word Representations," In Proc. of NAACL HLT, 2013.
|
31 |
R. Socher, M. Ganjoo, H. Sridhar, O. Bastani, C. D. Manning, A. Y. Ng, "Zero-shot learning through cross-modal transfer," In Proc. of International Conference on Learning Representations (ICLR), Scottsdale, AZ, 2013.
|