Computer Science & AI · 2012
ImageNet Classification with Deep Convolutional Neural Networks
Alex Krizhevsky, Ilya Sutskever, Geoffrey E. Hinton
Overview
'AlexNet' showed that deep convolutional neural networks, trained on GPUs, could crush the competition at large-scale image recognition. Its dramatic win on the ImageNet benchmark ignited the deep-learning revolution. (Linked DOI is the journal version of record; the original appeared at NeurIPS 2012.)
Sparked the modern deep-learning era in artificial intelligence.
Key findings
Methods
A large convolutional neural network with five convolutional and three fully connected layers was trained on 1.2 million labelled ImageNet images using two GPUs, ReLU nonlinearities, dropout, and data augmentation.
Keywords
Related papers
Computer Science & AI
Highly Accurate Protein Structure Prediction with AlphaFold
AlphaFold solved a 50-year grand challenge in biology: predicting a protein's three-dimensional shape from its amino-acid sequence. Its accuracy approached that of experimental methods, transforming structural biology.
Computer Science & AI
Attention Is All You Need
The paper that introduced the Transformer, an architecture built entirely on attention mechanisms with no recurrence or convolution. It became the foundation of modern large language models such as GPT and BERT.
Computer Science & AI
Deep Learning
A landmark review by three pioneers of the field, synthesising how deep neural networks learn layered representations of data. It explains why depth works and surveys the advances that made deep learning dominant across vision, speech, and language.