Resources, references and datasets
Non-exhaustive collection of resources that I find helpful, datasets that I’m using/have used as part of my work. Kept here for my own references and posterity. None are listed in any paticular order.
Datasets
Images
- MNIST Handwritten Numbers
- CelebA Large celebrities faces dataset with attributes
- 10,117 identities
- 202,599 face images
- MS-Celeb-1M
- ~10M images
- 100k celebrities
- SVHN Street View House Numbers
- ImageNet THE image dataset
Audio
- VCTK Corpus - English multi-speaker corpus
- AudioSet - Sound Vocabulary dataset with video
- Beethoven Sonata Dataset
- In Wayback Machine (archive.org)
- Download script based off SampleRNN, Mehri et al 2016
Autonomous Vehicles
- KITTI Dataset
- Multiple Cameras
- LiDAR
- GPS/IMU
- Object Labels and bounding boxes
- Citiscapes Dataset
- Multiple Cameras
- Pixel level annotation
- Various weather conditions
References
Online Resources
- Useful Formulae List - Maintained by Huihan chin and me
- NIPS 2016 Tutorial: Generative Adversarial Networks - Ian Goodfellow
- Momentum in gradient descent - Nice illustration and post by Gabriel Goh
- Tensorflow playground
Books
- Numerical Optimization Jorge Nocedal, Stephen J. Wright (Springer)
- Probabilistic Graphical Models: Principles and Techniques - Daphne Koller, Nir Friedman
- Pattern Recognition and Machine Learning - Christopher M. Bishop
- Deep Learning - Ian Goodfellow, Yoshua Bengio, Aaron Courville (online version)