Bishop, C. M. (2006) Pattern Recognition and Machine Learning. Chapter 5: Neural Networks.
Schmidhuber, J. (2015). Deep Learning in Neural Networks: An Overview. Neural Networks 61: 85-117.
Bengio, Y., LeCun, Y., Hinton, G. (2015). Deep Learning. Nature 521: 436-44.
Goodfellow, I., Bengio, Y. and Courville, A. (2016) Deep Learning. MIT Press.
A little bit more complete list of references and online resources is here.
An extensive list of references can also be found at https://github.com/terryum/awesome-deep-learning-papers and http://deeplearning.net/reading-list/