‘CNN’ directory

See Also

Gwern

“InvertOrNot.com Proposal ”, Gwern 2021

InvertOrNot.com Proposal

Links

“Camie-Tagger: Anime Image Tagger ”, Camais03 2025

⁠camie-tagger: Anime Image Tagger⁠

“CSHNet: A Novel Information Asymmetric Image Translation Method ”, Yang et al 2025

CSHNet: A Novel Information Asymmetric Image Translation Method⁠

“Human-Like Bots for Tactical Shooters Using Compute-Efficient Sensors ”, Justesen et al 2024

Human-like Bots for Tactical Shooters Using Compute-Efficient Sensors⁠

“An Analytic Theory of Creativity in Convolutional Diffusion Models ”, Kamb & Ganguli 2024

An analytic theory of creativity in convolutional diffusion models⁠

“Convolutional Differentiable Logic Gate Networks ”, Petersen et al 2024

Convolutional Differentiable Logic Gate Networks⁠

“MaskBit: Embedding-Free Image Generation via Bit Tokens ”, Weber et al 2024

MaskBit: Embedding-free Image Generation via Bit Tokens⁠

“Quantum Convolutional Neural Networks Are (Effectively) Classically Simulable ”, Bermejo et al 2024

Quantum Convolutional Neural Networks are (Effectively) Classically Simulable⁠

“High-Performance Deep Spiking Neural Networks With 0.3 Spikes per Neuron ”, Stanojevic et al 2024

High-performance deep spiking neural networks with 0.3 spikes per neuron⁠

“Three-Dimension Animation Character Design Based on Probability Genetic Algorithm ”, Gao 2024

Three-Dimension Animation Character Design Based on Probability Genetic Algorithm⁠

“Investigating Learning-Independent Abstract Reasoning in Artificial Neural Networks ”, Barak & Loewenstein 2024

Investigating learning-independent abstract reasoning in artificial neural networks⁠

“Grokfast: Accelerated Grokking by Amplifying Slow Gradients ”, Lee et al 2024

Grokfast: Accelerated Grokking by Amplifying Slow Gradients⁠

“A Rotation and a Translation Suffice: Fooling CNNs With Simple Transformations ”, Engstrom et al 2024

A Rotation and a Translation Suffice: Fooling CNNs with Simple Transformations⁠

“Neural Networks Learn Statistics of Increasing Complexity ”, Belrose et al 2024

Neural Networks Learn Statistics of Increasing Complexity⁠

“Machine Learning Reveals the Control Mechanics of an Insect Wing Hinge ”, Melis et al 2024

Machine learning reveals the control mechanics of an insect wing hinge⁠

“Supplementary Materials for Grounded Language Acquisition through the Eyes and Ears of a Single Child ”, Vong 2024

Supplementary Materials for Grounded language acquisition through the eyes and ears of a single child⁠

“Grounded Language Acquisition through the Eyes and Ears of a Single Child ”, Vong et al 2024

Grounded language acquisition through the eyes and ears of a single child⁠

“Machine Learning As a Tool for Hypothesis Generation ”, Jens & Sendhil 2024

Machine Learning as a Tool for Hypothesis Generation⁠

“Multi Visual Feature Fusion Based Fog Visibility Estimation for Expressway Surveillance Using Deep Learning Network ”, Yang et al 2023

Multi visual feature fusion based fog visibility estimation for expressway surveillance using deep learning network⁠

“Auditing the Inference Processes of Medical-Image Classifiers by Leveraging Generative AI and the Expertise of Physicians ”, DeGrave et al 2023

Auditing the inference processes of medical-image classifiers by leveraging generative AI and the expertise of physicians⁠

“Development of Deep Ensembles to Screen for Autism and Symptom Severity Using Retinal Photographs ”, Kim et al 2023

Development of Deep Ensembles to Screen for Autism and Symptom Severity Using Retinal Photographs⁠

“May the Noise Be With You: Adversarial Training without Adversarial Examples ”, Arous et al 2023

May the Noise be with you: Adversarial Training without Adversarial Examples⁠

“Are Vision Transformers More Data Hungry Than Newborn Visual Systems? ”, Pandey et al 2023

Are Vision Transformers More Data Hungry Than Newborn Visual Systems?⁠

“UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio, Video, Point Cloud, Time-Series and Image Recognition ”, Ding et al 2023

UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio, Video, Point Cloud, Time-Series and Image Recognition⁠

“The Possibility of Making $138,000 from Shredded Banknote Pieces Using Computer Vision ”, Kong 2023

The possibility of making $138,000 from shredded banknote pieces using computer vision⁠

“An Exact Mapping from ReLU Networks to Spiking Neural Networks ”, Stanojevic et al 2023

An exact mapping from ReLU networks to spiking neural networks⁠

“ConvNets Match Vision Transformers at Scale ”, Smith et al 2023

ConvNets Match Vision Transformers at Scale⁠

“Generalization in Diffusion Models Arises from Geometry-Adaptive Harmonic Representation ”, Kadkhodaie et al 2023

Generalization in diffusion models arises from geometry-adaptive harmonic representation⁠

“Interpret Vision Transformers As ConvNets With Dynamic Convolutions ”, Zhou et al 2023

Interpret Vision Transformers as ConvNets with Dynamic Convolutions⁠

“Artificial Intelligence-Supported Screen Reading versus Standard Double Reading in the Mammography Screening With Artificial Intelligence Trial (MASAI): a Clinical Safety Analysis of a Randomized, Controlled, Non-Inferiority, Single-Blinded, Screening Accuracy Study ”, Lång et al 2023

Artificial intelligence-supported screen reading versus standard double reading in the Mammography Screening with Artificial Intelligence trial (MASAI): a clinical safety analysis of a randomized, controlled, non-inferiority, single-blinded, screening accuracy study⁠

“Hand-Drawn Anime Line Drawing Colorization of Faces With Texture Details ”, Akita et al 2023

Hand-drawn anime line drawing colorization of faces with texture details⁠

“High-Quality Synthetic Character Image Extraction via Distortion Recognition ”, Sawada et al 2023

High-Quality Synthetic Character Image Extraction via Distortion Recognition⁠

“Loss of Plasticity in Deep Continual Learning (Continual Backpropagation) ”, Dohare et al 2023

Loss of Plasticity in Deep Continual Learning (Continual Backpropagation)⁠

“Neural Networks Trained With SGD Learn Distributions of Increasing Complexity ”, Refinetti et al 2023

Neural networks trained with SGD learn distributions of increasing complexity⁠

“Rosetta Neurons: Mining the Common Units in a Model Zoo ”, Dravid et al 2023

Rosetta Neurons: Mining the Common Units in a Model Zoo⁠

“Improving Neural Network Representations Using Human Similarity Judgments ”, Muttenthaler et al 2023

Improving neural network representations using human similarity judgments⁠

“U-Net CNN in APL: Exploring Zero-Framework, Zero-Library Machine Learning ”, Hsu & Serrão 2023

U-Net CNN in APL: Exploring Zero-Framework, Zero-Library Machine Learning⁠

“VanillaNet: the Power of Minimalism in Deep Learning ”, Chen et al 2023

VanillaNet: the Power of Minimalism in Deep Learning⁠

“Multi-Label Classification in Anime Illustrations Based on Hierarchical Attribute Relationships ”, Lan et al 2023

Multi-Label Classification in Anime Illustrations Based on Hierarchical Attribute Relationships⁠

“ImageNet-Hard: The Hardest Images Remaining from a Study of the Power of Zoom and Spatial Biases in Image Classification ”, Taesiri et al 2023

ImageNet-Hard: The Hardest Images Remaining from a Study of the Power of Zoom and Spatial Biases in Image Classification⁠

“Hierarchical Multi-Label Attribute Classification With Graph Convolutional Networks on Anime Illustration ”, Lan et al 2023

Hierarchical Multi-Label Attribute Classification With Graph Convolutional Networks on Anime Illustration⁠

“‘AI’ on a Calculator: Part 1 [MNIST CNN on a TI-84 Graphing Calculator] ”, Mitchell 2023

‘AI’ on a Calculator: Part 1 [MNIST CNN on a TI-84 graphing calculator]

“Loss Landscapes Are All You Need: Neural Network Generalization Can Be Explained Without the Implicit Bias of Gradient Descent ”, Chiang et al 2023

Loss Landscapes are All You Need: Neural Network Generalization Can Be Explained Without the Implicit Bias of Gradient Descent⁠

“Adding Conditional Control to Text-To-Image Diffusion Models ”, Zhang et al 2023

Adding Conditional Control to Text-to-Image Diffusion Models⁠

“Pruning Compact ConvNets for Efficient Inference ”, Ghosh et al 2023

Pruning Compact ConvNets for Efficient Inference⁠

“Does Progress on ImageNet Transfer to Real-World Datasets? ”, Fang et al 2023

Does progress on ImageNet transfer to real-world datasets?⁠

“EarSpy: Spying Caller Speech and Identity through Tiny Vibrations of Smartphone Ear Speakers ”, Mahdad et al 2022

EarSpy: Spying Caller Speech and Identity through Tiny Vibrations of Smartphone Ear Speakers⁠

“Pretraining Without Attention ”, Wang et al 2022

Pretraining Without Attention⁠

“What Do Vision Transformers Learn? A Visual Exploration ”, Ghiasi et al 2022

What do Vision Transformers Learn? A Visual Exploration⁠

“A 64-Core Mixed-Signal In-Memory Compute Chip Based on Phase-Change Memory for Deep Neural Network Inference ”, Gallo et al 2022

A 64-core mixed-signal in-memory compute chip based on phase-change memory for deep neural network inference⁠

“Simulated Automated Facial Recognition Systems As Decision-Aids in Forensic Face Matching Tasks ”, Carragher & Hancock 2022

Simulated automated facial recognition systems as decision-aids in forensic face matching tasks⁠

“Interpreting Neural Networks through the Polytope Lens ”, Black et al 2022

Interpreting Neural Networks through the Polytope Lens⁠

“Predicting Sex, Age, General Cognition and Mental Health With Machine Learning on Brain Structural Connectomes ”, Yeung et al 2022

Predicting sex, age, general cognition and mental health with machine learning on brain structural connectomes⁠

“The Power of Ensembles for Active Learning in Image Classification ”, Beluch et al 2022

The Power of Ensembles for Active Learning in Image Classification⁠

“GCN-Based Multi-Modal Multi-Label Attribute Classification in Anime Illustration Using Domain-Specific Semantic Features ”, Lan et al 2022

GCN-Based Multi-Modal Multi-Label Attribute Classification in Anime Illustration Using Domain-Specific Semantic Features⁠

“The Unreasonable Effectiveness of Fully-Connected Layers for Low-Data Regimes ”, Kocsis et al 2022

The Unreasonable Effectiveness of Fully-Connected Layers for Low-Data Regimes⁠

“Understanding the Covariance Structure of Convolutional Filters ”, Trockman et al 2022

Understanding the Covariance Structure of Convolutional Filters⁠

“The Lie Derivative for Measuring Learned Equivariance ”, Gruver et al 2022

⁠The Lie Derivative for Measuring Learned Equivariance⁠

“VICRegL: Self-Supervised Learning of Local Visual Features ”, Bardes et al 2022

VICRegL: Self-Supervised Learning of Local Visual Features⁠

“Omnigrok: Grokking Beyond Algorithmic Data ”, Liu et al 2022

Omnigrok: Grokking Beyond Algorithmic Data⁠

“`g.pt`: Learning to Learn With Generative Models of Neural Network Checkpoints ”, Peebles et al 2022

g.pt: Learning to Learn with Generative Models of Neural Network Checkpoints⁠

“FastSiam: Resource-Efficient Self-Supervised Learning on a Single GPU ”, Pototzky et al 2022

FastSiam: Resource-Efficient Self-supervised Learning on a Single GPU⁠

“Evaluation of Transfer Learning Methods for Detecting Alzheimer’s Disease With Brain MRI ”, Dhinagar et al 2022

Evaluation of Transfer Learning Methods for Detecting Alzheimer’s Disease with Brain MRI⁠

“Reassessing Hierarchical Correspondences between Brain and Deep Networks through Direct Interface ”, Sexton & Love 2022

Reassessing hierarchical correspondences between brain and deep networks through direct interface⁠

“Timesweeper: Accurately Identifying Selective Sweeps Using Population Genomic Time Series ”, Whitehouse & Schrider 2022

Timesweeper: Accurately Identifying Selective Sweeps Using Population Genomic Time Series⁠

“RHO-LOSS: Prioritized Training on Points That Are Learnable, Worth Learning, and Not Yet Learnt ”, Mindermann et al 2022

RHO-LOSS: Prioritized Training on Points that are Learnable, Worth Learning, and Not Yet Learnt⁠

“BigVGAN: A Universal Neural Vocoder With Large-Scale Training ”, Lee et al 2022

BigVGAN: A Universal Neural Vocoder with Large-Scale Training⁠

“Studying Growth With Neural Cellular Automata ”, Greydanus 2022

⁠Studying Growth with Neural Cellular Automata :

View External Link:

⁠https://greydanus.github.io/2022/05/24/studying-growth/

“Continual Pre-Training Mitigates Forgetting in Language and Vision ”, Cossu et al 2022

Continual Pre-Training Mitigates Forgetting in Language and Vision⁠

“Scaling Up Your Kernels to 31×31: Revisiting Large Kernel Design in CNNs (RepLKNet) ”, Ding et al 2022

Scaling Up Your Kernels to 31×31: Revisiting Large Kernel Design in CNNs (RepLKNet)⁠

“Democratizing Contrastive Language-Image Pre-Training: A CLIP Benchmark of Data, Model, and Supervision ”, Cui et al 2022

Democratizing Contrastive Language-Image Pre-training: A CLIP Benchmark of Data, Model, and Supervision⁠

“Variational Autoencoders Without the Variation ”, Daly et al 2022

Variational Autoencoders Without the Variation⁠

“On the Effectiveness of Dataset Watermarking in Adversarial Settings ”, Tekgul & Asokan 2022

On the Effectiveness of Dataset Watermarking in Adversarial Settings⁠

“General Cyclical Training of Neural Networks ”, Smith 2022

General Cyclical Training of Neural Networks⁠

“Approximating CNNs With Bag-Of-Local-Features Models Works Surprisingly Well on ImageNet ”, Wiel et al 2022

Approximating CNNs with Bag-of-local-Features models works surprisingly well on ImageNet⁠

“Variational Neural Cellular Automata ”, Palm et al 2022

Variational Neural Cellular Automata⁠

“ConvMixer: Patches Are All You Need? ”, Trockman & Kolter 2022

ConvMixer: Patches Are All You Need?⁠

“Using Machine Learning Based Models for Personality Recognition ”, Deilami et al 2022

Using Machine Learning Based Models for Personality Recognition⁠

“HyperTransformer: Model Generation for Supervised and Semi-Supervised Few-Shot Learning ”, Zhmoginov et al 2022

HyperTransformer: Model Generation for Supervised and Semi-Supervised Few-Shot Learning⁠

“ConvNeXt: A ConvNet for the 2020s ”, Liu et al 2022

ConvNeXt: A ConvNet for the 2020s⁠

“An Empirical Investigation of the Role of Pre-Training in Lifelong Learning ”, Mehta et al 2021

An Empirical Investigation of the Role of Pre-training in Lifelong Learning⁠

“Noether Networks: Meta-Learning Useful Conserved Quantities ”, Alet et al 2021

Noether Networks: Meta-Learning Useful Conserved Quantities⁠

“AugMax: Adversarial Composition of Random Augmentations for Robust Training ”, Wang et al 2021

AugMax: Adversarial Composition of Random Augmentations for Robust Training⁠

“The Efficiency Misnomer ”, Dehghani et al 2021

The Efficiency Misnomer⁠

“Logical Activation Functions: Logit-Space Equivalents of Probabilistic Boolean Operators ”, Lowe et al 2021

Logical Activation Functions: Logit-space equivalents of Probabilistic Boolean Operators⁠

“Evaluating Loss Functions for Illustration Super-Resolution Neural Networks ”, Nepomuceno & Silva 2021

Evaluating Loss Functions for Illustration Super-Resolution Neural Networks⁠

“TWIST: Self-Supervised Learning by Estimating Twin Class Distributions ”, Wang et al 2021

TWIST: Self-Supervised Learning by Estimating Twin Class Distributions⁠

“Deep Learning Models of Cognitive Processes Constrained by Human Brain Connectomes ”, Zhang et al 2021

Deep learning models of cognitive processes constrained by human brain connectomes⁠

“Decoupled Contrastive Learning ”, Yeh et al 2021

Decoupled Contrastive Learning⁠

“Supervision Exists Everywhere: A Data Efficient Contrastive Language-Image Pre-Training Paradigm (DeCLIP) ”, Li et al 2021

Supervision Exists Everywhere: A Data Efficient Contrastive Language-Image Pre-training Paradigm (DeCLIP)⁠

“Mining for Strong Gravitational Lenses With Self-Supervised Learning ”, Stein et al 2021

Mining for strong gravitational lenses with self-supervised learning⁠

“`THINGSvision`: A Python Toolbox for Streamlining the Extraction of Activations From Deep Neural Networks ”, Muttenthaler & Hebart 2021

THINGSvision: A Python Toolbox for Streamlining the Extraction of Activations From Deep Neural Networks⁠

“A Battle of Network Structures: An Empirical Study of CNN, Transformer, and MLP ”, Zhao et al 2021

A Battle of Network Structures: An Empirical Study of CNN, Transformer, and MLP⁠

“Predicting Phenotypes from Genetic, Environment, Management, and Historical Data Using CNNs ”, Washburn et al 2021

Predicting phenotypes from genetic, environment, management, and historical data using CNNs⁠

“Do Vision Transformers See Like Convolutional Neural Networks? ”, Raghu et al 2021

Do Vision Transformers See Like Convolutional Neural Networks?⁠

“Dataset Distillation With Infinitely Wide Convolutional Networks ”, Nguyen et al 2021

Dataset Distillation with Infinitely Wide Convolutional Networks⁠

“Neuroprosthesis for Decoding Speech in a Paralyzed Person With Anarthria ”, Moses et al 2021

Neuroprosthesis for Decoding Speech in a Paralyzed Person with Anarthria⁠

“Graph Jigsaw Learning for Cartoon Face Recognition ”, Li et al 2021

Graph Jigsaw Learning for Cartoon Face Recognition⁠

“Prediction Depth: Deep Learning Through the Lens of Example Difficulty ”, Baldock et al 2021

Prediction Depth: Deep Learning Through the Lens of Example Difficulty⁠

“Revisiting the Calibration of Modern Neural Networks ”, Minderer et al 2021

Revisiting the Calibration of Modern Neural Networks⁠

“Partial Success in Closing the Gap between Human and Machine Vision ”, Geirhos et al 2021

Partial success in closing the gap between human and machine vision⁠

“CoAtNet: Marrying Convolution and Attention for All Data Sizes ”, Dai et al 2021

CoAtNet: Marrying Convolution and Attention for All Data Sizes⁠

“Effect of Pre-Training Scale on Intra/Inter-Domain Full and Few-Shot Transfer Learning for Natural and Medical X-Ray Chest Images ”, Cherti & Jitsev 2021

Effect of Pre-Training Scale on Intra/Inter-Domain Full and Few-Shot Transfer Learning for Natural and Medical X-Ray Chest Images⁠

“Embracing New Techniques in Deep Learning for Estimating Image Memorability ”, Needell & Bainbridge 2021

Embracing New Techniques in Deep Learning for Estimating Image Memorability⁠

“Predicting Sex from Retinal Fundus Photographs Using Automated Deep Learning ”, Korot et al 2021

Predicting sex from retinal fundus photographs using automated deep learning⁠

“Rethinking and Improving the Robustness of Image Style Transfer ”, Wang et al 2021

Rethinking and Improving the Robustness of Image Style Transfer⁠

“Rip Van Winkle’s Razor, a Simple New Estimate for Adaptive Data Analysis ”, Arora & Zhang 2021

Rip van Winkle’s Razor, a Simple New Estimate for Adaptive Data Analysis

“The Surprising Impact of Mask-Head Architecture on Novel Class Segmentation ”, Birodkar et al 2021

The surprising impact of mask-head architecture on novel class segmentation⁠

“Pervasive Label Errors in Test Sets Destabilize Machine Learning Benchmarks ”, Northcutt et al 2021

Pervasive Label Errors in Test Sets Destabilize Machine Learning Benchmarks⁠

“ConViT: Improving Vision Transformers With Soft Convolutional Inductive Biases ”, d’Ascoli et al 2021

ConViT: Improving Vision Transformers with Soft Convolutional Inductive Biases⁠

“Learning from Videos to Understand the World ”, Zweig et al 2021

Learning from videos to understand the world⁠

“Fast and Accurate Model Scaling ”, Dollár et al 2021

Fast and Accurate Model Scaling⁠

“Nutrition5k: Towards Automatic Nutritional Understanding of Generic Food ”, Thames et al 2021

Nutrition5k: Towards Automatic Nutritional Understanding of Generic Food⁠

“Transfer of Fully Convolutional Policy-Value Networks Between Games and Game Variants ”, Soemers et al 2021

Transfer of Fully Convolutional Policy-Value Networks Between Games and Game Variants⁠

“Momentum Residual Neural Networks ”, Sander et al 2021

Momentum Residual Neural Networks⁠

“Hiding Data Hiding ”, Wu et al 2021

Hiding Data Hiding⁠

“Explaining Neural Scaling Laws ”, Bahri et al 2021

Explaining Neural Scaling Laws⁠

“NFNet: High-Performance Large-Scale Image Recognition Without Normalization ”, Brock et al 2021

NFNet: High-Performance Large-Scale Image Recognition Without Normalization⁠

“Brain2Pix: Fully Convolutional Naturalistic Video Reconstruction from Brain Activity ”, Le et al 2021

Brain2Pix: Fully convolutional naturalistic video reconstruction from brain activity⁠

“Words As a Window: Using Word Embeddings to Explore the Learned Representations of Convolutional Neural Networks ”, Dharmaretnam et al 2021

Words as a window: Using word embeddings to explore the learned representations of Convolutional Neural Networks⁠

“E(3)-Equivariant Graph Neural Networks for Data-Efficient and Accurate Interatomic Potentials ”, Batzner et al 2021

E(3)-Equivariant Graph Neural Networks for Data-Efficient and Accurate Interatomic Potentials⁠

“Meta Pseudo Labels ”, Pham et al 2021

Meta Pseudo Labels⁠

“Is MLP-Mixer a CNN in Disguise? As Part of This Blog Post, We Look at the MLP Mixer Architecture in Detail and Also Understand Why It Is Not Considered Convolution Free. ”

⁠Is MLP-Mixer a CNN in disguise? As part of this blog post, we look at the MLP Mixer architecture in detail and also understand why it is not considered convolution free.⁠ :

View HTML:

⁠/doc/ai/nn/fully-connected/2021-arora.html⁠

“Converting Tabular Data into Images for Deep Learning With Convolutional Neural Networks ”, Zhu et al 2021

Converting tabular data into images for deep learning with convolutional neural networks⁠

“Taming Transformers for High-Resolution Image Synthesis ”, Esser et al 2020

Taming Transformers for High-Resolution Image Synthesis⁠

“Ensemble Learning of Convolutional Neural Network, Support Vector Machine, and Best Linear Unbiased Predictor for Brain Age Prediction: ARAMIS Contribution to the Predictive Analytics Competition 2019 Challenge ”, Couvy-Duchesne et al 2020

Ensemble Learning of Convolutional Neural Network, Support Vector Machine, and Best Linear Unbiased Predictor for Brain Age Prediction: ARAMIS Contribution to the Predictive Analytics Competition 2019 Challenge⁠

“Scaling down Deep Learning ”, Greydanus 2020

Scaling down Deep Learning

“Very Deep VAEs Generalize Autoregressive Models and Can Outperform Them on Images ”, Child 2020

Very Deep VAEs Generalize Autoregressive Models and Can Outperform Them on Images⁠

“Understanding RL Vision: With Diverse Environments, We Can Analyze, Diagnose and Edit Deep Reinforcement Learning Models Using Attribution ”, Hilton et al 2020

Understanding RL Vision: With diverse environments, we can analyze, diagnose and edit deep reinforcement learning models using attribution⁠

“Fourier Neural Operator for Parametric Partial Differential Equations ”, Li et al 2020

Fourier Neural Operator for Parametric Partial Differential Equations⁠

“Deep Learning-Based Classification of the Polar Emotions of ‘moe’-Style Cartoon Pictures ”, Cao et al 2020b

Deep learning-based classification of the polar emotions of ‘moe’-style cartoon pictures⁠

“GiantMIDI-Piano: A Large-Scale MIDI Dataset for Classical Piano Music ”, Kong et al 2020

⁠GiantMIDI-Piano: A large-scale MIDI dataset for classical piano music⁠

“Sharpness-Aware Minimization (SAM) for Efficiently Improving Generalization ”, Foret et al 2020

Sharpness-Aware Minimization (SAM) for Efficiently Improving Generalization⁠

“Demonstrating That Dataset Domains Are Largely Linearly Separable in the Feature Space of Common CNNs ”, Dragan 2020

Demonstrating that dataset domains are largely linearly separable in the feature space of common CNNs⁠

“Optimal Peanut Butter and Banana Sandwiches ”, Rosenthal 2020

Optimal Peanut Butter and Banana Sandwiches

“Accuracy and Performance Comparison of Video Action Recognition Approaches ”, Hutchinson et al 2020

Accuracy and Performance Comparison of Video Action Recognition Approaches⁠

“A Digital Biomarker of Diabetes from Smartphone-Based Vascular Signals ”, Avram et al 2020

A digital biomarker of diabetes from smartphone-based vascular signals⁠

“SSD vs. YOLO for Detection of Outdoor Urban Advertising Panels under Multiple Variabilities ”, Morera et al 2020

SSD vs. YOLO for Detection of Outdoor Urban Advertising Panels under Multiple Variabilities⁠

“On Robustness and Transferability of Convolutional Neural Networks ”, Djolonga et al 2020

On Robustness and Transferability of Convolutional Neural Networks⁠

“NVAE: A Deep Hierarchical Variational Autoencoder ”, Vahdat & Kautz 2020

NVAE: A Deep Hierarchical Variational Autoencoder⁠

“CoCoNuT: Combining Context-Aware Neural Translation Models Using Ensemble for Program Repair ”, Lutellier et al 2020

CoCoNuT: Combining Context-Aware Neural Translation Models using Ensemble for Program Repair⁠

“The Many Faces of Robustness: A Critical Analysis of Out-Of-Distribution Generalization ”, Hendrycks et al 2020

The Many Faces of Robustness: A Critical Analysis of Out-of-Distribution Generalization⁠

“SimCLRv2: Big Self-Supervised Models Are Strong Semi-Supervised Learners ”, Chen et al 2020

SimCLRv2: Big Self-Supervised Models are Strong Semi-Supervised Learners⁠

“2020-06-08-Gwern-Palm-Yolov3-Handdetector126.weights (235MB) ”

2020-06-08-gwern-palm-yolov3-handdetector126.weights (235MB)⁠

“FBNetV3: Joint Architecture-Recipe Search Using Predictor Pretraining ”, Dai et al 2020

FBNetV3: Joint Architecture-Recipe Search using Predictor Pretraining⁠

“Danny Hernandez on Forecasting and the Drivers of AI Progress ”, Koehler et al 2020

Danny Hernandez on forecasting and the drivers of AI progress⁠

“Measuring the Algorithmic Efficiency of Neural Networks ”, Hernandez & Brown 2020

Measuring the Algorithmic Efficiency of Neural Networks⁠

“AI and Efficiency: We’re Releasing an Analysis Showing That Since 2012 the Amount of Compute Needed to Train a Neural Net to the Same Performance on ImageNet Classification Has Been Decreasing by a Factor of 2 Every 16 Months ”, Hernandez & Brown 2020

AI and Efficiency: We’re releasing an analysis showing that since 2012 the amount of compute needed to train a neural net to the same performance on ImageNet classification has been decreasing by a factor of 2 every 16 months⁠

“Reinforcement Learning With Augmented Data ”, Laskin et al 2020

Reinforcement Learning with Augmented Data⁠

“YOLOv4: Optimal Speed and Accuracy of Object Detection ”, Bochkovskiy et al 2020

YOLOv4: Optimal Speed and Accuracy of Object Detection⁠

“Scaling Laws from the Data Manifold Dimension ”, Sharma & Kaplan 2020

Scaling Laws from the Data Manifold Dimension⁠

“Shortcut Learning in Deep Neural Networks ”, Geirhos et al 2020

Shortcut Learning in Deep Neural Networks⁠

“Evolving Normalization-Activation Layers ”, Liu et al 2020

Evolving Normalization-Activation Layers⁠

“Conditional Convolutions for Instance Segmentation ”, Tian et al 2020

Conditional Convolutions for Instance Segmentation⁠

“Train-By-Reconnect: Decoupling Locations of Weights from Their Values (LaPerm) ”, Qiu & Suda 2020

Train-by-Reconnect: Decoupling Locations of Weights from their Values (LaPerm)⁠

“Rethinking Parameter Counting in Deep Models: Effective Dimensionality Revisited ”, Maddox et al 2020

Rethinking Parameter Counting in Deep Models: Effective Dimensionality Revisited⁠

“Do We Need Zero Training Loss After Achieving Zero Training Error? ”, Ishida et al 2020

Do We Need Zero Training Loss After Achieving Zero Training Error?⁠

“Bayesian Deep Learning and a Probabilistic Perspective of Generalization ”, Wilson & Izmailov 2020

Bayesian Deep Learning and a Probabilistic Perspective of Generalization⁠

“A Simple Framework for Contrastive Learning of Visual Representations ”, Chen et al 2020

A Simple Framework for Contrastive Learning of Visual Representations⁠

“Growing Neural Cellular Automata: Differentiable Model of Morphogenesis ”, Mordvintsev et al 2020

Growing Neural Cellular Automata: Differentiable Model of Morphogenesis⁠

“Accelerating Feedforward Computation via Parallel Nonlinear Equation Solving ”, Song et al 2020

Accelerating Feedforward Computation via Parallel Nonlinear Equation Solving⁠

“First-In-Human Evaluation of a Hand-Held Automated Venipuncture Device for Rapid Venous Blood Draws ”, Leipheimer et al 2020

First-in-human evaluation of a hand-held automated venipuncture device for rapid venous blood draws⁠

“ImageNet-A: Natural Adversarial Examples ”, Hendrycks et al 2020

ImageNet-A: Natural Adversarial Examples⁠

“Deep-Eyes: Fully Automatic Anime Character Colorization With Painting of Details on Empty Pupils ”, Akita et al 2020

Deep-Eyes: Fully Automatic Anime Character Colorization with Painting of Details on Empty Pupils⁠

“CARN: Convolutional Anchored Regression Network for Fast and Accurate Single Image Super-Resolution ”, Li 2020

⁠CARN: Convolutional Anchored Regression Network for Fast and Accurate Single Image Super-Resolution⁠ :

View PDF:

⁠/doc/www/openaccess.thecvf.com/3fe19144f573eaba2ca1b17d9880bd983a0d0d9a.pdf⁠

“The Importance of Deconstruction ”, Weinberger 2020

The Importance of Deconstruction⁠

“Big Transfer (BiT): General Visual Representation Learning ”, Kolesnikov et al 2019

Big Transfer (BiT): General Visual Representation Learning⁠

“Linear Mode Connectivity and the Lottery Ticket Hypothesis ”, Frankle et al 2019

Linear Mode Connectivity and the Lottery Ticket Hypothesis⁠

“Dynamic Convolution: Attention over Convolution Kernels ”, Chen et al 2019

Dynamic Convolution: Attention over Convolution Kernels⁠

“Deep Double Descent: We Show That the Double Descent Phenomenon Occurs in CNNs, ResNets, and Transformers: Performance First Improves, Then Gets Worse, and Then Improves Again With Increasing Model Size, Data Size, or Training Time ”, Nakkiran et al 2019

Deep Double Descent: We show that the double descent phenomenon occurs in CNNs, ResNets, and transformers: performance first improves, then gets worse, and then improves again with increasing model size, data size, or training time⁠

“Fantastic Generalization Measures and Where to Find Them ”, Jiang et al 2019

Fantastic Generalization Measures and Where to Find Them⁠

“Anonymous Market Product Classification Based on Deep Learning ”, Yang et al 2019b

Anonymous market product classification based on deep learning⁠

“The Origins and Prevalence of Texture Bias in Convolutional Neural Networks ”, Hermann et al 2019

The Origins and Prevalence of Texture Bias in Convolutional Neural Networks⁠

“How Machine Learning Can Help Unlock the World of Ancient Japan ”, Lamb 2019

How Machine Learning Can Help Unlock the World of Ancient Japan⁠

“SimpleShot: Revisiting Nearest-Neighbor Classification for Few-Shot Learning ”, Wang et al 2019

SimpleShot: Revisiting Nearest-Neighbor Classification for Few-Shot Learning⁠

“Self-Training With Noisy Student Improves ImageNet Classification ”, Xie et al 2019

Self-training with Noisy Student improves ImageNet classification⁠

“Taxonomy of Real Faults in Deep Learning Systems ”, Humbatova et al 2019

Taxonomy of Real Faults in Deep Learning Systems⁠

“On the Measure of Intelligence ”, Chollet 2019

On the Measure of Intelligence⁠

“DD-PPO: Learning Near-Perfect PointGoal Navigators from 2.5 Billion Frames ”, Wijmans et al 2019

DD-PPO: Learning Near-Perfect PointGoal Navigators from 2.5 Billion Frames⁠

“Accelerating Deep Learning by Focusing on the Biggest Losers ”, Jiang et al 2019

Accelerating Deep Learning by Focusing on the Biggest Losers⁠

“ANIL: Rapid Learning or Feature Reuse? Towards Understanding the Effectiveness of MAML ”, Raghu et al 2019

ANIL: Rapid Learning or Feature Reuse? Towards Understanding the Effectiveness of MAML⁠

“ObjectNet: A Large-Scale Bias-Controlled Dataset for Pushing the Limits of Object Recognition Models ”, Barbu et al 2019

ObjectNet: A large-scale bias-controlled dataset for pushing the limits of object recognition models⁠

“CAR: Learned Image Downscaling for Upscaling Using Content Adaptive Resampler ”, Sun & Chen 2019

CAR: Learned Image Downscaling for Upscaling using Content Adaptive Resampler⁠

“Finding the Needle in the Haystack With Convolutions: on the Benefits of Architectural Bias ”, d’Ascoli et al 2019

Finding the Needle in the Haystack with Convolutions: on the benefits of architectural bias⁠

“Intriguing Properties of Adversarial Training at Scale ”, Xie & Yuille 2019

Intriguing properties of adversarial training at scale⁠

“Adversarial Robustness As a Prior for Learned Representations ”, Engstrom et al 2019

Adversarial Robustness as a Prior for Learned Representations⁠

“Human-Level Performance in 3D Multiplayer Games With Population-Based Reinforcement Learning ”, Jaderberg et al 2019

Human-level performance in 3D multiplayer games with population-based reinforcement learning⁠

“ImageNet-Sketch: Learning Robust Global Representations by Penalizing Local Predictive Power ”, Wang et al 2019

ImageNet-Sketch: Learning Robust Global Representations by Penalizing Local Predictive Power⁠

“Cold Case: The Lost MNIST Digits ”, Yadav & Bottou 2019

Cold Case: The Lost MNIST Digits⁠

“Improved Object Recognition Using Neural Networks Trained to Mimic the Brain’s Statistical Properties ”, Federer et al 2019

Improved object recognition using neural networks trained to mimic the brain’s statistical properties⁠

“Neural System Identification With Neural Information Flow ”, Seeliger et al 2019

Neural System Identification with Neural Information Flow⁠

“Percival: Making In-Browser Perceptual Ad Blocking Practical With Deep Learning ”, Din et al 2019

Percival: Making In-Browser Perceptual Ad Blocking Practical With Deep Learning⁠

“CutMix: Regularization Strategy to Train Strong Classifiers With Localizable Features ”, Yun et al 2019

CutMix: Regularization Strategy to Train Strong Classifiers with Localizable Features⁠

“Adversarial Examples Are Not Bugs, They Are Features ”, Ilyas et al 2019

Adversarial Examples Are Not Bugs, They Are Features⁠

“Searching for MobileNetV3 ”, Howard et al 2019

Searching for MobileNetV3⁠

“Billion-Scale Semi-Supervised Learning for Image Classification ”, Yalniz et al 2019

Billion-scale semi-supervised learning for image classification⁠

“A Recipe for Training Neural Networks ”, Karpathy 2019

A Recipe for Training Neural Networks

“NAS-FPN: Learning Scalable Feature Pyramid Architecture for Object Detection ”, Ghiasi et al 2019

NAS-FPN: Learning Scalable Feature Pyramid Architecture for Object Detection⁠

“COCO-GAN: Generation by Parts via Conditional Coordinating ”, Lin et al 2019

COCO-GAN: Generation by Parts via Conditional Coordinating⁠

“Benchmarking Neural Network Robustness to Common Corruptions and Perturbations ”, Hendrycks & Dietterich 2019

Benchmarking Neural Network Robustness to Common Corruptions and Perturbations⁠

“Semantic Image Synthesis With Spatially-Adaptive Normalization ”, Park et al 2019

Semantic Image Synthesis with Spatially-Adaptive Normalization⁠

“The Bitter Lesson ”, Sutton 2019

The Bitter Lesson

“Learning To Follow Directions in Street View ”, Hermann et al 2019

Learning To Follow Directions in Street View⁠

“SuperTML: Two-Dimensional Word Embedding for the Precognition on Structured Tabular Data ”, Sun et al 2019

SuperTML: Two-Dimensional Word Embedding for the Precognition on Structured Tabular Data⁠

“Real-Time Continuous Transcription With Live Transcribe ”, Savla 2019

Real-time Continuous Transcription with Live Transcribe⁠

“Do We Train on Test Data? Purging CIFAR of Near-Duplicates ”, Barz & Denzler 2019

Do We Train on Test Data? Purging CIFAR of Near-Duplicates⁠

“Pay Less Attention With Lightweight and Dynamic Convolutions ”, Wu et al 2019

Pay Less Attention with Lightweight and Dynamic Convolutions⁠

“Association Between Surgical Skin Markings in Dermoscopic Images and Diagnostic Performance of a Deep Learning Convolutional Neural Network for Melanoma Recognition ”, Winkler et al 2019

⁠Association Between Surgical Skin Markings in Dermoscopic Images and Diagnostic Performance of a Deep Learning Convolutional Neural Network for Melanoma Recognition⁠ :

View PDF:

⁠/doc/ai/nn/cnn/2019-winkler.pdf⁠

“Detecting Advertising on Building Façades With Computer Vision ”, Bochkarev & Smirnov 2019

Detecting advertising on building façades with computer vision⁠

“On Lazy Training in Differentiable Programming ”, Chizat et al 2018

On Lazy Training in Differentiable Programming⁠

“Quantifying Generalization in Reinforcement Learning ”, Cobbe et al 2018

Quantifying Generalization in Reinforcement Learning⁠

“ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware ”, Cai et al 2018

ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware⁠

“ImageNet-Trained CNNs Are Biased towards Texture; Increasing Shape Bias Improves Accuracy and Robustness ”, Geirhos et al 2018

ImageNet-trained CNNs are biased towards texture; increasing shape bias improves accuracy and robustness⁠

“Evolving Space-Time Neural Architectures for Videos ”, Piergiovanni et al 2018

Evolving Space-Time Neural Architectures for Videos⁠

“ADNet: A Deep Network for Detecting Adverts ”, Hossari et al 2018

ADNet: A Deep Network for Detecting Adverts⁠

“AdVersarial: Perceptual Ad Blocking Meets Adversarial Machine Learning ”, Tramèr et al 2018

AdVersarial: Perceptual Ad Blocking meets Adversarial Machine Learning⁠

“FloWaveNet: A Generative Flow for Raw Audio ”, Kim et al 2018

FloWaveNet: A Generative Flow for Raw Audio⁠

“StreetNet: Preference Learning With Convolutional Neural Network on Urban Crime Perception ”, Fu et al 2018

StreetNet: Preference Learning with Convolutional Neural Network on Urban Crime Perception⁠

“Beyond Human Perception: Sexual Dimorphism in Hand and Wrist Radiographs Is Discernible by a Deep Learning Model ”, Yune et al 2018

Beyond Human Perception: Sexual Dimorphism in Hand and Wrist Radiographs Is Discernible by a Deep Learning Model⁠

“Towards Understanding Learning Representations: To What Extent Do Different Neural Networks Learn the Same Representation ”, Wang et al 2018

Towards Understanding Learning Representations: To What Extent Do Different Neural Networks Learn the Same Representation⁠

“Understanding and Correcting Pathologies in the Training of Learned Optimizers ”, Metz et al 2018

Understanding and correcting pathologies in the training of learned optimizers⁠

“Graph Convolutional Reinforcement Learning ”, Jiang et al 2018

Graph Convolutional Reinforcement Learning⁠

“Cellular Automata As Convolutional Neural Networks ”, Gilpin 2018

Cellular automata as convolutional neural networks⁠

“Don’t Give Me the Details, Just the Summary! Topic-Aware Convolutional Neural Networks for Extreme Summarization ”, Narayan et al 2018

Don’t Give Me the Details, Just the Summary! Topic-Aware Convolutional Neural Networks for Extreme Summarization⁠

“Human-Like Playtesting With Deep Learning ”, Gudmundsson et al 2018

Human-Like Playtesting with Deep Learning⁠

“CurriculumNet: Weakly Supervised Learning from Large-Scale Web Images ”, Guo et al 2018

CurriculumNet: Weakly Supervised Learning from Large-Scale Web Images⁠

“MnasNet: Platform-Aware Neural Architecture Search for Mobile ”, Tan et al 2018

MnasNet: Platform-Aware Neural Architecture Search for Mobile⁠

“LEO: Meta-Learning With Latent Embedding Optimization ”, Rusu et al 2018

LEO: Meta-Learning with Latent Embedding Optimization⁠

“Glow: Generative Flow With Invertible 1×1 Convolutions ”, Kingma & Dhariwal 2018

Glow: Generative Flow with Invertible 1×1 Convolutions⁠

“The Goldilocks Zone: Towards Better Understanding of Neural Network Loss Landscapes ”, Fort & Scherlis 2018

The Goldilocks zone: Towards better understanding of neural network loss landscapes⁠

“Benchmarking Neural Network Robustness to Common Corruptions and Surface Variations ”, Hendrycks & Dietterich 2018

Benchmarking Neural Network Robustness to Common Corruptions and Surface Variations⁠

“Confounding Variables Can Degrade Generalization Performance of Radiological Deep Learning Models ”, Zech et al 2018

Confounding variables can degrade generalization performance of radiological deep learning models⁠

“Faster SGD Training by Minibatch Persistency ”, Fischetti et al 2018

Faster SGD training by minibatch persistency⁠

“Dynamical Isometry and a Mean Field Theory of CNNs: How to Train 10,000-Layer Vanilla Convolutional Neural Networks ”, Xiao et al 2018

Dynamical Isometry and a Mean Field Theory of CNNs: How to Train 10,000-Layer Vanilla Convolutional Neural Networks⁠

“Resource-Efficient Neural Architect ”, Zhou et al 2018

Resource-Efficient Neural Architect⁠

“More Than a Feeling: Learning to Grasp and Regrasp Using Vision and Touch ”, Calandra et al 2018

More Than a Feeling: Learning to Grasp and Regrasp using Vision and Touch⁠

“Deep Learning Generalizes Because the Parameter-Function Map Is Biased towards Simple Functions ”, Valle-Pérez et al 2018

Deep learning generalizes because the parameter-function map is biased towards simple functions⁠

“Bidirectional Learning for Robust Neural Networks ”, Pontes-Filho & Liwicki 2018

Bidirectional Learning for Robust Neural Networks⁠

“BDD100K: A Diverse Driving Dataset for Heterogeneous Multitask Learning ”, Yu et al 2018

BDD100K: A Diverse Driving Dataset for Heterogeneous Multitask Learning⁠

“Self-Distillation: Born Again Neural Networks ”, Furlanello et al 2018

Self-distillation: Born Again Neural Networks⁠

“Tile2Vec: Unsupervised Representation Learning for Spatially Distributed Data ”, Jean et al 2018

Tile2Vec: Unsupervised representation learning for spatially distributed data⁠

“Exploring the Limits of Weakly Supervised Pretraining ”, Mahajan et al 2018

Exploring the Limits of Weakly Supervised Pretraining⁠

“Large Scale Distributed Neural Network Training through Online Distillation ”, Anil et al 2018

Large scale distributed neural network training through online distillation⁠

“YOLOv3: An Incremental Improvement ”, Redmon & Farhadi 2018

YOLOv3: An Incremental Improvement⁠

“Associative Compression Networks for Representation Learning ”, Graves et al 2018

⁠Associative Compression Networks for Representation Learning⁠

“Reptile/FOMAML: On First-Order Meta-Learning Algorithms ”, Nichol et al 2018

Reptile/FOMAML: On First-Order Meta-Learning Algorithms⁠

“Essentially No Barriers in Neural Network Energy Landscape ”, Draxler et al 2018

Essentially No Barriers in Neural Network Energy Landscape⁠

“Loss Surfaces, Mode Connectivity, and Fast Ensembling of DNNs ”, Garipov et al 2018

Loss Surfaces, Mode Connectivity, and Fast Ensembling of DNNs⁠

“Guess, Check and Fix: a Phenomenology of Improvisation in ‘Neural’ Painting ”, Choi 2018

Guess, check and fix: a phenomenology of improvisation in ‘neural’ painting⁠

“Sim-To-Real Optimization of Complex Real World Mobile Network With Imperfect Information via Deep Reinforcement Learning from Self-Play ”, Tan et al 2018

Sim-to-Real Optimization of Complex Real World Mobile Network with Imperfect Information via Deep Reinforcement Learning from Self-play⁠

“Evolved Policy Gradients ”, Houthooft et al 2018

Evolved Policy Gradients⁠

“Large-Scale, High-Resolution Comparison of the Core Visual Object Recognition Behavior of Humans, Monkeys, and State-Of-The-Art Deep Artificial Neural Networks ”, Rajalingham et al 2018

Large-scale, high-resolution comparison of the core visual object recognition behavior of humans, monkeys, and state-of-the-art deep artificial neural networks⁠

“IMPALA: Scalable Distributed Deep-RL With Importance Weighted Actor-Learner Architectures ”, Espeholt et al 2018

IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures⁠

“Active, Continual Fine Tuning of Convolutional Neural Networks for Reducing Annotation Efforts ”, Zhou et al 2018

Active, Continual Fine Tuning of Convolutional Neural Networks for Reducing Annotation Efforts⁠

“ArcFace: Additive Angular Margin Loss for Deep Face Recognition ”, Deng et al 2018

ArcFace: Additive Angular Margin Loss for Deep Face Recognition⁠

“Man against Machine: Diagnostic Performance of a Deep Learning Convolutional Neural Network for Dermoscopic Melanoma Recognition in Comparison to 58 Dermatologists ”, Haenssle et al 2018

⁠Man against machine: diagnostic performance of a deep learning convolutional neural network for dermoscopic melanoma recognition in comparison to 58 dermatologists⁠ :

View PDF:

⁠/doc/ai/nn/cnn/2018-haenssle.pdf⁠

“DeepGS: Predicting Phenotypes from Genotypes Using Deep Learning ”, Ma et al 2017

DeepGS: Predicting phenotypes from genotypes using Deep Learning⁠

“Deep Image Reconstruction from Human Brain Activity ”, Shen et al 2017

Deep image reconstruction from human brain activity⁠

“Visualizing the Loss Landscape of Neural Nets ”, Li et al 2017

Visualizing the Loss Landscape of Neural Nets⁠

“SPP-Net: Deep Absolute Pose Regression With Synthetic Views ”, Purkait et al 2017

SPP-Net: Deep Absolute Pose Regression with Synthetic Views⁠

“LBA: Learning by Asking Questions ”, Misra et al 2017

LBA: Learning by Asking Questions⁠

“China’s AI Advances Help Its Tech Industry, and State Security ”, Mozur & Bradsher 2017

China’s AI Advances Help Its Tech Industry, and State Security⁠

“Measuring the Tendency of CNNs to Learn Surface Statistical Regularities ”, Jo & Bengio 2017

Measuring the tendency of CNNs to Learn Surface Statistical Regularities⁠

“3D Semantic Segmentation With Submanifold Sparse Convolutional Networks ”, Graham et al 2017

3D Semantic Segmentation with Submanifold Sparse Convolutional Networks⁠

“BlockDrop: Dynamic Inference Paths in Residual Networks ”, Wu et al 2017

BlockDrop: Dynamic Inference Paths in Residual Networks⁠

“Knowledge Concentration: Learning 100K Object Classifiers in a Single CNN ”, Gao et al 2017

Knowledge Concentration: Learning 100K Object Classifiers in a Single CNN⁠

“The Signature of Robot Action Success in EEG Signals of a Human Observer: Decoding and Visualization Using Deep Convolutional Neural Networks ”, Behncke et al 2017

The signature of robot action success in EEG signals of a human observer: Decoding and visualization using deep convolutional neural networks⁠

“11K Hands: Gender Recognition and Biometric Identification Using a Large Dataset of Hand Images ”, Afifi 2017

11K Hands: Gender recognition and biometric identification using a large dataset of hand images⁠

“Learning to Play Chess With Minimal Lookahead and Deep Value Neural Networks ”, Sabatelli 2017 (page 3)

Learning to Play Chess with Minimal Lookahead and Deep Value Neural Networks⁠

“Learning to Generalize: Meta-Learning for Domain Generalization ”, Li et al 2017

Learning to Generalize: Meta-Learning for Domain Generalization⁠

“High-Precision Automated Reconstruction of Neurons With Flood-Filling Networks ”, Januszewski et al 2017

High-Precision Automated Reconstruction of Neurons with Flood-filling Networks⁠

“Efficient K-Shot Learning With Regularized Deep Networks ”, Yoo et al 2017

Efficient K-shot Learning with Regularized Deep Networks⁠

“NIMA: Neural Image Assessment ”, Talebi & Milanfar 2017

NIMA: Neural Image Assessment⁠

“The Devil Is in the Tails: Fine-Grained Classification in the Wild ”, Horn & Perona 2017

The Devil is in the Tails: Fine-grained Classification in the Wild⁠

“Squeeze-And-Excitation Networks ”, Hu et al 2017

Squeeze-and-Excitation Networks⁠

“What Does a Convolutional Neural Network Recognize in the Moon? ”, Shoji 2017

What does a convolutional neural network recognize in the moon?⁠

“SMASH: One-Shot Model Architecture Search through HyperNetworks ”, Brock et al 2017

SMASH: One-Shot Model Architecture Search through HyperNetworks⁠

“BitNet: Bit-Regularized Deep Neural Networks ”, Raghavan et al 2017

BitNet: Bit-Regularized Deep Neural Networks⁠

“A Deep Architecture for Unified Esthetic Prediction ”, Murray & Gordo 2017

A deep architecture for unified esthetic prediction⁠

“Learning With Rethinking: Recurrently Improving Convolutional Neural Networks through Feedback ”, Li et al 2017

Learning with Rethinking: Recurrently Improving Convolutional Neural Networks through Feedback⁠

“WebVision Database: Visual Learning and Understanding from Web Data ”, Li et al 2017

WebVision Database: Visual Learning and Understanding from Web Data⁠

“Focal Loss for Dense Object Detection ”, Lin et al 2017

Focal Loss for Dense Object Detection⁠

“Active Learning for Convolutional Neural Networks: A Core-Set Approach ”, Sener & Savarese 2017

Active Learning for Convolutional Neural Networks: A Core-Set Approach⁠

“Learning to Infer Graphics Programs from Hand-Drawn Images ”, Ellis et al 2017

Learning to Infer Graphics Programs from Hand-Drawn Images⁠

“A Downsampled Variant of ImageNet As an Alternative to the CIFAR Datasets ”, Chrabaszcz et al 2017

A Downsampled Variant of ImageNet as an Alternative to the CIFAR datasets⁠

“Learning Transferable Architectures for Scalable Image Recognition ”, Zoph et al 2017

Learning Transferable Architectures for Scalable Image Recognition⁠

“Efficient Architecture Search by Network Transformation ”, Cai et al 2017

Efficient Architecture Search by Network Transformation⁠

“A Simple Neural Attentive Meta-Learner ”, Mishra et al 2017

A Simple Neural Attentive Meta-Learner⁠

“Towards Deep Learning Models Resistant to Adversarial Attacks ”, Madry et al 2017

Towards Deep Learning Models Resistant to Adversarial Attacks⁠

“Gradient Diversity: a Key Ingredient for Scalable Distributed Learning ”, Yin et al 2017

Gradient Diversity: a Key Ingredient for Scalable Distributed Learning⁠

“Device Placement Optimization With Reinforcement Learning ”, Mirhoseini et al 2017

Device Placement Optimization with Reinforcement Learning⁠

“Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour ”, Goyal et al 2017

Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour⁠

“Submanifold Sparse Convolutional Networks ”, Graham & Maaten 2017

Submanifold Sparse Convolutional Networks⁠

“A Simple Neural Network Module for Relational Reasoning ”, Santoro et al 2017

A simple neural network module for relational reasoning⁠

“Deep Learning Is Robust to Massive Label Noise ”, Rolnick et al 2017

Deep Learning is Robust to Massive Label Noise⁠

“What Makes a Good Image? Airbnb Demand Analytics Leveraging Interpretable Image Features ”, Zhang et al 2017

What Makes a Good Image? Airbnb Demand Analytics Leveraging Interpretable Image Features⁠

“Picasso: A Modular Framework for Visualizing the Learning Process of Neural Network Image Classifiers ”, Henderson & Rothe 2017

Picasso: A Modular Framework for Visualizing the Learning Process of Neural Network Image Classifiers⁠

“BAM! The Behance Artistic Media Dataset for Recognition Beyond Photography ”, Wilber et al 2017

BAM! The Behance Artistic Media Dataset for Recognition Beyond Photography⁠

“Adversarial Neural Machine Translation ”, Wu et al 2017

Adversarial Neural Machine Translation⁠

“Multi-Scale Dense Networks for Resource Efficient Image Classification ”, Huang et al 2017

Multi-Scale Dense Networks for Resource Efficient Image Classification⁠

“Scaling the Scattering Transform: Deep Hybrid Networks ”, Oyallon et al 2017

Scaling the Scattering Transform: Deep Hybrid Networks⁠

“Mask R-CNN ”, He et al 2017

Mask R-CNN⁠

“Using Human Brain Activity to Guide Machine Learning ”, Fong et al 2017

Using Human Brain Activity to Guide Machine Learning⁠

“Learned Optimizers That Scale and Generalize ”, Wichrowska et al 2017

Learned Optimizers that Scale and Generalize⁠

“Prediction and Control With Temporal Segment Models ”, Mishra et al 2017

Prediction and Control with Temporal Segment Models⁠

“Parallel Multiscale Autoregressive Density Estimation ”, Reed et al 2017

Parallel Multiscale Autoregressive Density Estimation⁠

“Convolution Aware Initialization ”, Aghajanyan 2017

Convolution Aware Initialization⁠

“Gender-From-Iris or Gender-From-Mascara? ”, Kuehlkamp et al 2017

Gender-From-Iris or Gender-From-Mascara?⁠

“BrainNetCNN: Convolutional Neural Networks for Brain Networks; towards Predicting Neurodevelopment ”, Kawahara et al 2017

BrainNetCNN: Convolutional neural networks for brain networks; towards predicting neurodevelopment⁠

“Universal Representations: The Missing Link between Faces, Text, Planktons, and Cat Breeds ”, Bilen & Vedaldi 2017

Universal representations: The missing link between faces, text, planktons, and cat breeds⁠

“PixelCNN++: Improving the PixelCNN With Discretized Logistic Mixture Likelihood and Other Modifications ”, Salimans et al 2017

PixelCNN++: Improving the PixelCNN with Discretized Logistic Mixture Likelihood and Other Modifications⁠

“YOLO9000: Better, Faster, Stronger ”, Redmon & Farhadi 2016

YOLO9000: Better, Faster, Stronger⁠

“Language Modeling With Gated Convolutional Networks ”, Dauphin et al 2016

Language Modeling with Gated Convolutional Networks⁠

“Highway and Residual Networks Learn Unrolled Iterative Estimation ”, Greff et al 2016

Highway and Residual Networks learn Unrolled Iterative Estimation⁠

“Learning from Simulated and Unsupervised Images through Adversarial Training ”, Shrivastava et al 2016

Learning from Simulated and Unsupervised Images through Adversarial Training⁠

“LipNet: End-To-End Sentence-Level Lipreading ”, Assael et al 2016

LipNet: End-to-End Sentence-level Lipreading⁠

“Feature Pyramid Networks for Object Detection ”, Lin et al 2016

Feature Pyramid Networks for Object Detection⁠

“Self-Critical Sequence Training for Image Captioning ”, Rennie et al 2016

Self-critical Sequence Training for Image Captioning⁠

“ResNeXt: Aggregated Residual Transformations for Deep Neural Networks ”, Xie et al 2016

ResNeXt: Aggregated Residual Transformations for Deep Neural Networks⁠

“Responses to Critiques on Machine Learning of Criminality Perceptions (Addendum of ArXiv:1611.04135) ”, Wu & Zhang 2016

Responses to Critiques on Machine Learning of Criminality Perceptions (Addendum of arXiv:1611.04135)⁠

“Understanding Deep Learning Requires Rethinking Generalization ”, Zhang et al 2016

Understanding deep learning requires rethinking generalization⁠

“Designing Neural Network Architectures Using Reinforcement Learning ”, Baker et al 2016

Designing Neural Network Architectures using Reinforcement Learning⁠

“VPN: Video Pixel Networks ”, Kalchbrenner et al 2016

VPN: Video Pixel Networks⁠

“HyperNetworks ”, Ha et al 2016

HyperNetworks⁠

“Neural Photo Editing With Introspective Adversarial Networks ”, Brock et al 2016

Neural Photo Editing with Introspective Adversarial Networks⁠

“On Large-Batch Training for Deep Learning: Generalization Gap and Sharp Minima ”, Keskar et al 2016

On Large-Batch Training for Deep Learning: Generalization Gap and Sharp Minima⁠

“WaveNet: A Generative Model for Raw Audio ”, Oord et al 2016

WaveNet: A Generative Model for Raw Audio⁠

“Direct Feedback Alignment Provides Learning in Deep Neural Networks ”, Nøkland 2016

Direct Feedback Alignment Provides Learning in Deep Neural Networks⁠

“Deep Learning Human Mind for Automated Visual Classification ”, Spampinato et al 2016

Deep Learning Human Mind for Automated Visual Classification⁠

“Temporal Convolutional Networks: A Unified Approach to Action Segmentation ”, Lea et al 2016

Temporal Convolutional Networks: A Unified Approach to Action Segmentation⁠

“DenseNet: Densely Connected Convolutional Networks ”, Huang et al 2016

DenseNet: Densely Connected Convolutional Networks⁠

“Clockwork Convnets for Video Semantic Segmentation ”, Shelhamer et al 2016

Clockwork Convnets for Video Semantic Segmentation⁠

“Deep Learning the City: Quantifying Urban Perception At A Global Scale ”, Dubey et al 2016

Deep Learning the City: Quantifying Urban Perception At A Global Scale⁠

“Convolutional Neural Fabrics ”, Saxena & Verbeek 2016

Convolutional Neural Fabrics⁠

“Deep Neural Networks Are Robust to Weight Binarization and Other Non-Linear Distortions ”, Merolla et al 2016

Deep neural networks are robust to weight binarization and other non-linear distortions⁠

“DeepLab: Semantic Image Segmentation With Deep Convolutional Nets, Atrous Convolution (ASPP), and Fully Connected CRFs ”, Chen et al 2016

DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution (ASPP), and Fully Connected CRFs⁠

“FractalNet: Ultra-Deep Neural Networks without Residuals ”, Larsson et al 2016

FractalNet: Ultra-Deep Neural Networks without Residuals⁠

“Wide Residual Networks ”, Zagoruyko & Komodakis 2016

Wide Residual Networks⁠

“Residual Networks Behave Like Ensembles of Relatively Shallow Networks ”, Veit et al 2016

Residual Networks Behave Like Ensembles of Relatively Shallow Networks⁠

“Neural Autoregressive Distribution Estimation ”, Uria et al 2016

Neural Autoregressive Distribution Estimation⁠

“ViZDoom: A Doom-Based AI Research Platform for Visual Reinforcement Learning ”, Kempka et al 2016

ViZDoom: A Doom-based AI Research Platform for Visual Reinforcement Learning⁠

“OHEM: Training Region-Based Object Detectors With Online Hard Example Mining ”, Shrivastava et al 2016

OHEM: Training Region-based Object Detectors with Online Hard Example Mining⁠

“Deep Networks With Stochastic Depth ”, Huang et al 2016

Deep Networks with Stochastic Depth⁠

“Convolutional Networks for Fast, Energy-Efficient Neuromorphic Computing ”, Esser et al 2016

Convolutional Networks for Fast, Energy-Efficient Neuromorphic Computing⁠

“Do Deep Convolutional Nets Really Need to Be Deep and Convolutional? ”, Urban et al 2016

Do Deep Convolutional Nets Really Need to be Deep and Convolutional?⁠

“XNOR-Net: ImageNet Classification Using Binary Convolutional Neural Networks ”, Rastegari et al 2016

XNOR-Net: ImageNet Classification Using Binary Convolutional Neural Networks⁠

“Learning Hand-Eye Coordination for Robotic Grasping With Deep Learning and Large-Scale Data Collection ”, Levine et al 2016

Learning Hand-Eye Coordination for Robotic Grasping with Deep Learning and Large-Scale Data Collection⁠

“Network Morphism ”, Wei et al 2016

Network Morphism⁠

“Inception-V4, Inception-ResNet and the Impact of Residual Connections on Learning ”, Szegedy et al 2016

Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning⁠

“PlaNet—Photo Geolocation With Convolutional Neural Networks ”, Weyand et al 2016

PlaNet—Photo Geolocation with Convolutional Neural Networks⁠

“Value Iteration Networks ”, Tamar et al 2016

Value Iteration Networks⁠

“PixelRNN: Pixel Recurrent Neural Networks ”, Oord et al 2016

PixelRNN: Pixel Recurrent Neural Networks⁠

“Image Synthesis from Yahoo’s `open_nsfw` ”, Goh 2016

Image Synthesis from Yahoo’s open_nsfw⁠

“Deep Residual Learning for Image Recognition ”, He et al 2015

Deep Residual Learning for Image Recognition⁠

“Microsoft Researchers Win ImageNet Computer Vision Challenge ”, Linn 2015

Microsoft researchers win ImageNet computer vision challenge⁠

“Adding Gradient Noise Improves Learning for Very Deep Networks ”, Neelakantan et al 2015

Adding Gradient Noise Improves Learning for Very Deep Networks⁠

“The Unreasonable Effectiveness of Noisy Data for Fine-Grained Recognition ”, Krause et al 2015

The Unreasonable Effectiveness of Noisy Data for Fine-Grained Recognition⁠

“Learning Visual Features from Large Weakly Supervised Data ”, Joulin et al 2015

Learning Visual Features from Large Weakly Supervised Data⁠

“`Illustration2Vec`: a Semantic Vector Representation of Illustrations ”, Masaki & Matsui 2015

Illustration2Vec: a semantic vector representation of illustrations⁠

“BinaryConnect: Training Deep Neural Networks With Binary Weights during Propagations ”, Courbariaux et al 2015

BinaryConnect: Training Deep Neural Networks with binary weights during propagations⁠

“Predicting and Understanding Urban Perception With Convolutional Neural Networks ”, Porzi et al 2015

Predicting and Understanding Urban Perception with Convolutional Neural Networks⁠

“A Neural Attention Model for Abstractive Sentence Summarization ”, Rush et al 2015

A Neural Attention Model for Abstractive Sentence Summarization⁠

“LSUN: Construction of a Large-Scale Image Dataset Using Deep Learning With Humans in the Loop ”, Yu et al 2015

LSUN: Construction of a Large-scale Image Dataset using Deep Learning with Humans in the Loop⁠

“You Only Look Once: Unified, Real-Time Object Detection ”, Redmon et al 2015

You Only Look Once: Unified, Real-Time Object Detection⁠

“Clothing-1M: Learning from Massive Noisy Labeled Data for Image Classification ”, Xiao et al 2015

Clothing-1M: Learning from Massive Noisy Labeled Data for Image Classification⁠

“Dropout As a Bayesian Approximation: Representing Model Uncertainty in Deep Learning ”, Gal & Ghahramani 2015

Dropout as a Bayesian Approximation: Representing Model Uncertainty in Deep Learning⁠

“STN: Spatial Transformer Networks ”, Jaderberg et al 2015

STN: Spatial Transformer Networks⁠

“Faster R-CNN: Towards Real-Time Object Detection With Region Proposal Networks ”, Ren et al 2015

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks⁠

“Cyclical Learning Rates for Training Neural Networks ”, Smith 2015

Cyclical Learning Rates for Training Neural Networks⁠

“Deep Learning ”, LeCun et al 2015

Deep Learning⁠

“Highway Networks ”, Srivastava et al 2015

Highway Networks⁠

“Fast R-CNN ”, Girshick 2015

Fast R-CNN⁠

“End-To-End Training of Deep Visuomotor Policies ”, Levine et al 2015

End-to-End Training of Deep Visuomotor Policies⁠

“FaceNet: A Unified Embedding for Face Recognition and Clustering ”, Schroff et al 2015

FaceNet: A Unified Embedding for Face Recognition and Clustering⁠

“Deep Convolutional Inverse Graphics Network ”, Kulkarni et al 2015

⁠Deep Convolutional Inverse Graphics Network⁠

“Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift ”, Ioffe & Szegedy 2015

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift⁠

“DeepID3: Face Recognition With Very Deep Neural Networks ”, Sun et al 2015

DeepID3: Face Recognition with Very Deep Neural Networks⁠

“Explaining and Harnessing Adversarial Examples ”, Goodfellow et al 2014

Explaining and Harnessing Adversarial Examples⁠

“Understanding Image Representations by Measuring Their Equivariance and Equivalence ”, Lenc & Vedaldi 2014

Understanding image representations by measuring their equivariance and equivalence⁠

“Going Deeper With Convolutions ”, Szegedy et al 2014

Going Deeper with Convolutions⁠

“Very Deep Convolutional Networks for Large-Scale Image Recognition ”, Simonyan & Zisserman 2014

Very Deep Convolutional Networks for Large-Scale Image Recognition⁠

“ImageNet Large Scale Visual Recognition Challenge ”, Russakovsky et al 2014

ImageNet Large Scale Visual Recognition Challenge⁠

“Deep Learning Face Representation by Joint Identification-Verification ”, Sun et al 2014

Deep Learning Face Representation by Joint Identification-Verification⁠

“One Weird Trick for Parallelizing Convolutional Neural Networks ”, Krizhevsky 2014

⁠One weird trick for parallelizing convolutional neural networks⁠

“Network In Network ”, Lin et al 2013

Network In Network⁠

“R-CNN: Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation ”, Girshick et al 2013

R-CNN: Rich feature hierarchies for accurate object detection and semantic segmentation⁠

“Maxout Networks ”, Goodfellow et al 2013

Maxout Networks⁠

“ImageNet Classification With Deep Convolutional Neural Networks ”, Krizhevsky et al 2012

ImageNet Classification with Deep Convolutional Neural Networks⁠

“Multi-Column Deep Neural Network for Traffic Sign Classification ”, Cireşan et al 2012b

Multi-column deep neural network for traffic sign classification⁠

“Multi-Column Deep Neural Networks for Image Classification ”, Cireşan et al 2012

Multi-column Deep Neural Networks for Image Classification⁠

“Building High-Level Features Using Large Scale Unsupervised Learning ”, Le et al 2011

Building high-level features using large scale unsupervised learning⁠

“DanNet: Flexible, High Performance Convolutional Neural Networks for Image Classification ”, Ciresan et al 2011

DanNet: Flexible, High Performance Convolutional Neural Networks for Image Classification⁠

“MIT Places Database for Scene Recognition ”

⁠⁠MIT Places Database for Scene Recognition :

View HTML:

⁠/doc/www/places.csail.mit.edu/e7643bbe03217046530b98b5db1ed1504d9ddd7e.html⁠

“Hypernetworks [Blog] ”, Ha 2025

⁠Hypernetworks [blog]

“STL-10 Dataset ”

⁠⁠STL-10 dataset⁠

“Deconvolution and Checkerboard Artifacts ”

⁠Deconvolution and Checkerboard Artifacts⁠

“Hierarchical Object Detection With Deep Reinforcement Learning ”

⁠Hierarchical Object Detection with Deep Reinforcement Learning :

View HTML:

⁠/doc/www/imatge-upc.github.io/f106bf397cea4b3c184a40c91893ee695f7646df.html⁠

“Creating a 17 KB Style Transfer Model With Layer Pruning and Quantization ”, Toole 2025

Creating a 17 KB style transfer model with layer pruning and quantization⁠

“Now Anyone Can Train ImageNet in 18 Minutes ”

⁠Now anyone can train ImageNet in 18 minutes⁠

“The World’s First Commercial Automatic Chicken Sexing Machine ”

⁠The world’s first commercial automatic chicken sexing machine

“Cats, Rats, AI, Oh My! ”

⁠Cats, Rats, AI, Oh My!⁠

Sort By Magic

Annotations sorted by machine learning into ⁠inferred 'tags'⁠. This provides an alternative way to browse: instead of by date order, one can browse in topic order. The 'sorted' list has been automatically clustered into multiple sections & auto-labeled for easier browsing.

Beginning with the newest annotation, it uses the embedding of each annotation to attempt to create a list of nearest-neighbor annotations, creating a progression of topics. For more details, see the link.

`deep-summary`

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

`attribute-classification`

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

`residual-networks`

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

⁠[see previous entry]⁠

Wikipedia

Miscellaneous

Bibliography

https://arxiv.org/abs/2412.20292: “An Analytic Theory of Creativity in Convolutional Diffusion Models ”⁠, Mason Kamb, Surya Ganguli⁠
link-bibliography⁠
https://arxiv.org/abs/2409.16211#bytedance: “MaskBit: Embedding-Free Image Generation via Bit Tokens ”⁠, Mark Weber, Lijun Yu, Qihang Yu …, Xueqing Deng, Xiaohui Shen, Daniel Cremers, Liang-Chieh Chen
link-bibliography⁠
https://arxiv.org/abs/2405.20233: “Grokfast: Accelerated Grokking by Amplifying Slow Gradients ”⁠, Jaerin Lee, Bong Gyun Kang, Kihoon Kim, Kyoung Mu Lee
link-bibliography⁠
2024-vong.pdf: “Grounded Language Acquisition through the Eyes and Ears of a Single Child ”⁠, Wai Keen Vong, Wentao Wang, A. Emin Orhan, Brenden M. Lake
link-bibliography⁠
https://arxiv.org/abs/2311.15599#tencent: “UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio, Video, Point Cloud, Time-Series and Image Recognition ”⁠, Xiaohan Ding, Yiyuan Zhang, Yixiao Ge …, Sijie Zhao, Lin Song, Xiangyu Yue, Ying Shan⁠
link-bibliography⁠
https://www.sciencedirect.com/science/article/pii/S0893608023005051: “An Exact Mapping from ReLU Networks to Spiking Neural Networks ”⁠, Ana Stanojevic, Stanisław Woźniak, Guillaume Bellec …, Giovanni Cherubini, Angeliki Pantazi, Wulfram Gerstner⁠
link-bibliography⁠
https://arxiv.org/abs/2310.16764#deepmind: “ConvNets Match Vision Transformers at Scale ”⁠, Samuel L. Smith⁠, Andrew Brock⁠, Leonard Berrada, Soham De
link-bibliography⁠
https://arxiv.org/abs/2309.10713: “Interpret Vision Transformers As ConvNets With Dynamic Convolutions ”⁠, Chong Zhou, Chen Change Loy, Bo Dai⁠
link-bibliography⁠
https://arxiv.org/abs/2306.09346: “Rosetta Neurons: Mining the Common Units in a Model Zoo ”⁠, Amil Dravid, Yossi Gandelsman, Alexei A. Efros⁠, Assaf Shocher
link-bibliography⁠
https://dl.acm.org/doi/pdf/10.1145/3589246.3595371: “U-Net CNN in APL: Exploring Zero-Framework, Zero-Library Machine Learning ”⁠, Aaron W. Hsu, Rodrigo Girão Serrão
link-bibliography⁠
https://arxiv.org/abs/2305.12972: “VanillaNet: the Power of Minimalism in Deep Learning ”⁠, Hanting Chen, Yunhe Wang, Jianyuan Guo, Dacheng Tao⁠
link-bibliography⁠
https://arxiv.org/abs/2304.05538: “ImageNet-Hard: The Hardest Images Remaining from a Study of the Power of Zoom and Spatial Biases in Image Classification ”⁠, Mohammad Reza Taesiri, Giang Nguyen⁠, Sarra Habchi …, Cor-Paul Bezemer, Anh Nguyen
link-bibliography⁠
https://ieeexplore.ieee.org/abstract/document/10097719: “Hierarchical Multi-Label Attribute Classification With Graph Convolutional Networks on Anime Illustration ”⁠, Ziwen Lan, Keisuke Maeda, Takahiro Ogawa⁠, Miki Haseyama
link-bibliography⁠
https://arxiv.org/abs/2212.10544: “Pretraining Without Attention ”⁠, ⁠Junxiong Wang, Jing Nathan Yan, Albert Gu⁠, ⁠Alexander M. Rush
link-bibliography⁠
https://arxiv.org/abs/2212.06727: “What Do Vision Transformers Learn? A Visual Exploration ”⁠, Amin Ghiasi, Hamid Kazemi, Eitan Borgnia …, Steven Reich⁠, Manli Shu, Micah Goldblum, Andrew Gordon Wilson, Tom Goldstein⁠
link-bibliography⁠
2022-carragher.pdf: “Simulated Automated Facial Recognition Systems As Decision-Aids in Forensic Face Matching Tasks ”⁠, Daniel J. Carragher, Peter J. B. Hancock
link-bibliography⁠
2022-lan.pdf: “GCN-Based Multi-Modal Multi-Label Attribute Classification in Anime Illustration Using Domain-Specific Semantic Features ”⁠, Ziwen Lan, Keisuke Maeda, Takahiro Ogawa⁠, Miki Haseyama
link-bibliography⁠
https://arxiv.org/abs/2210.03651: “Understanding the Covariance Structure of Convolutional Filters ”⁠, Asher Trockman, Devin Willmott, J. Zico Kolter
link-bibliography⁠
https://arxiv.org/abs/2210.01117: “Omnigrok: Grokking Beyond Algorithmic Data ”⁠, Ziming Liu, Eric J. Michaud, Max Tegmark⁠
link-bibliography⁠
https://arxiv.org/abs/2209.12892: “g.pt: Learning to Learn With Generative Models of Neural Network Checkpoints ”⁠, William Peebles, Ilija Radosavovic, Tim Brooks …, Alexei A. Efros⁠, Jitendra Malik⁠
link-bibliography⁠
2022-pototzky.pdf: “FastSiam: Resource-Efficient Self-Supervised Learning on a Single GPU ”⁠, Daniel Pototzky, Azhar Sultan⁠, Lars Schmidt-Thieme
link-bibliography⁠
https://www.science.org/doi/10.1126/sciadv.abm2219: “Reassessing Hierarchical Correspondences between Brain and Deep Networks through Direct Interface ”⁠, Nicholas J. Sexton, Bradley C. Love
link-bibliography⁠
https://arxiv.org/abs/2206.07137: “RHO-LOSS: Prioritized Training on Points That Are Learnable, Worth Learning, and Not Yet Learnt ”⁠, Sören Mindermann, Jan Brauner, Muhammed Razzak …, Mrinank Sharma, Andreas Kirsch, Winnie Xu, Benedikt Höltgen, Aidan N. Gomez⁠, Adrien Morisot, Sebastian Farquhar, Yarin Gal
link-bibliography⁠
https://arxiv.org/abs/2206.04658#nvidia: “BigVGAN: A Universal Neural Vocoder With Large-Scale Training ”⁠, Sang-gil Lee, Wei Ping, Boris Ginsburg …, Bryan Catanzaro⁠, Sungroh Yoon
link-bibliography⁠
https://arxiv.org/abs/2203.06717: “Scaling Up Your Kernels to 31×31: Revisiting Large Kernel Design in CNNs (RepLKNet) ”⁠, Xiaohan Ding, Xiangyu Zhang, Yizhuang Zhou …, Jungong Han, Guiguang Ding, Jian Sun
link-bibliography⁠
https://arxiv.org/abs/2202.08835: “General Cyclical Training of Neural Networks ”⁠, Leslie N. Smith
link-bibliography⁠
https://openreview.net/forum?id=SkfMWhAqYQ: “Approximating CNNs With Bag-Of-Local-Features Models Works Surprisingly Well on ImageNet ”⁠, Wiel, Brendel, Matthias Bethge⁠
link-bibliography⁠
https://arxiv.org/abs/2201.09792: “ConvMixer: Patches Are All You Need? ”⁠, Asher Trockman, J. Zico Kolter
link-bibliography⁠
https://arxiv.org/abs/2201.03545#facebook: “ConvNeXt: A ConvNet for the 2020s ”⁠, ⁠Zhuang Liu, Hanzi Mao, Chao-Yuan Wu …, Christoph Feichtenhofer, Trevor Darrell⁠, Saining Xie
link-bibliography⁠
https://arxiv.org/abs/2110.13771#nvidia: “AugMax: Adversarial Composition of Random Augmentations for Robust Training ”⁠, Haotao Wang, Chaowei Xiao, Jean Kossaifi …, Zhiding Yu, Anima Anandkumar⁠, Zhangyang Wang
link-bibliography⁠
https://arxiv.org/abs/2110.07402#bytedance: “TWIST: Self-Supervised Learning by Estimating Twin Class Distributions ”⁠, Feng Wang, Tao Kong, Rufeng Zhang …, Huaping Liu, Hang Li
link-bibliography⁠
https://arxiv.org/abs/2110.06848: “Decoupled Contrastive Learning ”⁠, Chun-Hsiao Yeh, Cheng-Yao Hong, Yen-Chi Hsu …, Tyng-Luh Liu, Yubei Chen, Yann LeCun⁠
link-bibliography⁠
https://arxiv.org/abs/2110.05208: “Supervision Exists Everywhere: A Data Efficient Contrastive Language-Image Pre-Training Paradigm (DeCLIP) ”⁠, Yangguang Li, Feng Liang, Lichen Zhao …, Yufeng Cui, Wanli Ouyang, Jing Shao, Fengwei Yu, Junjie Yan
link-bibliography⁠
https://www.frontiersin.org/articles/10.3389/fninf.2021.679838/full: “THINGSvision: A Python Toolbox for Streamlining the Extraction of Activations From Deep Neural Networks ”⁠, Lukas Muttenthaler, Martin N. Hebart
link-bibliography⁠
https://arxiv.org/abs/2108.13002#microsoft: “A Battle of Network Structures: An Empirical Study of CNN, Transformer, and MLP ”⁠, Yucheng Zhao, Guangting Wang, Chuanxin Tang …, Chong Luo, Wenjun Zeng⁠, Zheng-Jun Zha
link-bibliography⁠
https://arxiv.org/abs/2108.08810#google: “Do Vision Transformers See Like Convolutional Neural Networks? ”⁠, Maithra Raghu, Thomas Unterthiner, Simon Kornblith …, Chiyuan Zhang, Alexey Dosovitskiy
link-bibliography⁠
2021-moses.pdf: “Neuroprosthesis for Decoding Speech in a Paralyzed Person With Anarthria ”⁠, David A. Moses, Sean L. Metzger, Jessie R. Liu …, Gopala K. Anumanchipalli, Joseph G. Makin, Pengfei F. Sun, Josh Chartier, Maximilian E. Dougherty, Patricia M. Liu, Gary M. Abrams, Adelyn Tu-Chan, Karunesh Ganguly, Edward F. Chang⁠
link-bibliography⁠
https://arxiv.org/abs/2106.07411: “Partial Success in Closing the Gap between Human and Machine Vision ”⁠, Robert Geirhos⁠, Kantharaju Narayanappa, Benjamin Mitzkus …, Tizian Thieringer, Matthias Bethge⁠, Felix A. Wichmann, Wiel, Brendel
link-bibliography⁠
https://arxiv.org/abs/2106.04803#google: “CoAtNet: Marrying Convolution and Attention for All Data Sizes ”⁠, Zihang Dai⁠, Hanxiao Liu, Quoc V. Le⁠, Mingxing Tan
link-bibliography⁠
https://arxiv.org/abs/2106.00116: “Effect of Pre-Training Scale on Intra/Inter-Domain Full and Few-Shot Transfer Learning for Natural and Medical X-Ray Chest Images ”⁠, Mehdi Cherti, Jenia Jitsev
link-bibliography⁠
https://arxiv.org/abs/2104.05623: “Rethinking and Improving the Robustness of Image Style Transfer ”⁠, Pei Wang, Yijun Li, Nuno Vasconcelos
link-bibliography⁠
https://www.offconvex.org/2021/04/07/ripvanwinkle/: “Rip Van Winkle’s Razor, a Simple New Estimate for Adaptive Data Analysis ”, Sanjeev Arora⁠, Yi Zhang
link-bibliography⁠
https://arxiv.org/abs/2104.00613: “The Surprising Impact of Mask-Head Architecture on Novel Class Segmentation ”⁠, Vighnesh Birodkar, Zhichao Lu, Siyang Li …, Vivek Rathod, Jonathan Huang
link-bibliography⁠
https://arxiv.org/abs/2103.14749: “Pervasive Label Errors in Test Sets Destabilize Machine Learning Benchmarks ”⁠, Curtis G. Northcutt, Anish Athalye, Jonas Mueller
link-bibliography⁠
https://arxiv.org/abs/2103.10697#facebook: “ConViT: Improving Vision Transformers With Soft Convolutional Inductive Biases ”⁠, Stéphane d’Ascoli, Hugo Touvron, Matthew Leavitt …, Ari Morcos, Giulio Biroli, Levent Sagun
link-bibliography⁠
https://ai.meta.com/blog/learning-from-videos-to-understand-the-world/: “Learning from Videos to Understand the World ”⁠, Geoffrey Zweig, Polina Kuznetsova⁠, Michael Auli, Francois Fagan
link-bibliography⁠
https://arxiv.org/abs/2102.06171#deepmind: “NFNet: High-Performance Large-Scale Image Recognition Without Normalization ”⁠, Andrew Brock⁠, Soham De, Samuel L. Smith⁠, Karen Simonyan⁠
link-bibliography⁠
https://arxiv.org/abs/2003.10580#google: “Meta Pseudo Labels ”⁠, Hieu Pham, Zihang Dai⁠, Qizhe Xie …, Minh-Thang Luong, Quoc V. Le⁠
link-bibliography⁠
https://arxiv.org/abs/2011.10650#openai: “Very Deep VAEs Generalize Autoregressive Models and Can Outperform Them on Images ”⁠, ⁠Rewon Child
link-bibliography⁠
https://arxiv.org/abs/2010.01412#google: “Sharpness-Aware Minimization (SAM) for Efficiently Improving Generalization ”⁠, Pierre Foret, Ariel Kleiner, Hossein Mobahi, Behnam Neyshabur
link-bibliography⁠
https://www.ethanrosenthal.com/2020/08/25/optimal-peanut-butter-and-banana-sandwiches/: “Optimal Peanut Butter and Banana Sandwiches ”, Ethan Rosenthal
link-bibliography⁠
https://arxiv.org/abs/2008.09037: “Accuracy and Performance Comparison of Video Action Recognition Approaches ”⁠, Matthew Hutchinson⁠, Siddharth Samsi, William Arcand …, David Bestor, Bill Bergeron, Chansup Byun, Michael Houle, Matthew Hubbell, Michael Jones, Jeremy Kepner, Andrew Kirby, Peter Michaleas, Lauren Milechin, Julie Mullen, Andrew Prout⁠, Antonio Rosa, Albert Reuther, Charles Yee, Vijay Gadepally
link-bibliography⁠
https://www.mdpi.com/1424-8220/20/16/4587: “SSD vs. YOLO for Detection of Outdoor Urban Advertising Panels under Multiple Variabilities ”⁠, Ángel Morera, Ángel Sánchez, A. Belén Moreno …, Ángel D. Sappa, José F. Véle
link-bibliography⁠
https://arxiv.org/abs/2007.03898#nvidia: “NVAE: A Deep Hierarchical Variational Autoencoder ”⁠, Arash Vahdat, Jan Kautz
link-bibliography⁠
https://arxiv.org/abs/2004.10934: “YOLOv4: Optimal Speed and Accuracy of Object Detection ”⁠, Alexey Bochkovskiy, Chien-Yao Wang, Hong-Yuan Mark Liao
link-bibliography⁠
https://arxiv.org/abs/2002.05709#google: “A Simple Framework for Contrastive Learning of Visual Representations ”⁠, Ting Chen, Simon Kornblith, Mohammad Norouzi⁠, Geoffrey Hinton⁠
link-bibliography⁠
https://arxiv.org/abs/2002.03629: “Accelerating Feedforward Computation via Parallel Nonlinear Equation Solving ”⁠, Yang Song⁠, Chenlin Meng, Renjie Liao, Stefano Ermon⁠
link-bibliography⁠
https://www.youtube.com/watch?v=kY2NHSKBi10: “The Importance of Deconstruction ”⁠, ⁠Kilian Q. Weinberger
link-bibliography⁠
https://arxiv.org/abs/1912.03458#microsoft: “Dynamic Convolution: Attention over Convolution Kernels ”⁠, Yinpeng Chen, Xiyang Dai, Mengchen Liu …, Dongdong Chen, Lu Yuan, Zicheng Liu⁠
link-bibliography⁠
https://openai.com/research/deep-double-descent: “Deep Double Descent: We Show That the Double Descent Phenomenon Occurs in CNNs, ResNets, and Transformers: Performance First Improves, Then Gets Worse, and Then Improves Again With Increasing Model Size, Data Size, or Training Time ”⁠, Preetum Nakkiran, ⁠Gal Kaplun, Yamini Bansal⁠ …, Tristan Yang⁠, Boaz Barak⁠, Ilya Sutskever⁠
link-bibliography⁠
https://arxiv.org/abs/1912.02178: “Fantastic Generalization Measures and Where to Find Them ”⁠, Yiding Jiang, Behnam Neyshabur, Hossein Mobahi …, Dilip Krishnan, Samy Bengio⁠
link-bibliography⁠
https://arxiv.org/abs/1911.04252#google: “Self-Training With Noisy Student Improves ImageNet Classification ”⁠, Qizhe Xie, Minh-Thang Luong, Eduard Hovy⁠, Quoc V. Le⁠
link-bibliography⁠
https://arxiv.org/abs/1911.00357#facebook: “DD-PPO: Learning Near-Perfect PointGoal Navigators from 2.5 Billion Frames ”⁠, Erik Wijmans, Abhishek Kadian, Ari Morcos …, Stefan Lee, Irfan Essa⁠, Devi Parikh⁠, Manolis Savva, Dhruv Batra
link-bibliography⁠
2019-jaderberg.pdf#deepmind: “Human-Level Performance in 3D Multiplayer Games With Population-Based Reinforcement Learning ”⁠, Max Jaderberg, Wojciech M. Czarnecki, Iain Dunning …, Luke Marris, Guy Lever⁠, Antonio Garcia Castañeda, Charles Beattie⁠, Neil C. Rabinowitz, Ari S. Morcos, Avraham Ruderman, Nicolas Sonnerat, Tim Green, Louise Deason, Joel Z. Leibo, David Silver⁠, Demis Hassabis⁠, Koray Kavukcuoglu⁠, ⁠Thore Graepel
link-bibliography⁠
https://arxiv.org/abs/1905.04899: “CutMix: Regularization Strategy to Train Strong Classifiers With Localizable Features ”⁠, Sangdoo Yun, Dongyoon Han, Seong Joon Oh …, Sanghyuk Chun, Junsuk Choe, Youngjoon Yoo
link-bibliography⁠
https://arxiv.org/abs/1905.00546#facebook: “Billion-Scale Semi-Supervised Learning for Image Classification ”⁠, I. Zeki Yalniz, Hervé Jégou, Kan Chen …, Manohar Paluri, Dhruv Mahajan
link-bibliography⁠
https://karpathy.github.io/2019/04/25/recipe/: “A Recipe for Training Neural Networks ”, Andrej Karpathy⁠
link-bibliography⁠
http://www.incompleteideas.net/IncIdeas/BitterLesson.html: “The Bitter Lesson ”, Rich Sutton⁠
link-bibliography⁠
https://www.sciencedirect.com/science/article/pii/S1877050919311299: “Detecting Advertising on Building Façades With Computer Vision ”⁠, Kirill Bochkarev, Egor Smirnov
link-bibliography⁠
https://arxiv.org/abs/1811.02155: “FloWaveNet: A Generative Flow for Raw Audio ”⁠, Sungwon Kim, Sang-gil Lee, Jongyoon Song …, Jaehyeon Kim, Sungroh Yoon
link-bibliography⁠
2018-fu.pdf: “StreetNet: Preference Learning With Convolutional Neural Network on Urban Crime Perception ”⁠, Kaiqun Fu, Zhiqian Chen, Chang-Tien Lu
link-bibliography⁠
2018-gudmundsson.pdf: “Human-Like Playtesting With Deep Learning ”⁠, Stefan Freyr Gudmundsson, Philipp Eisen, Erik Poromaa …, Alex Nodet, Sami Purmonen, Bartlomiej Kozakowski, Richard Meurling, Lele Cao
link-bibliography⁠
https://arxiv.org/abs/1808.01097: “CurriculumNet: Weakly Supervised Learning from Large-Scale Web Images ”⁠, Sheng Guo, Weilin Huang, Haozhi Zhang …, Chenfan Zhuang, Dengke Dong, Matthew R. Scott, Dinglong Huang
link-bibliography⁠
https://arxiv.org/abs/1805.00932#facebook: “Exploring the Limits of Weakly Supervised Pretraining ”⁠, Dhruv Mahajan, Ross Girshick⁠, Vignesh Ramanathan …, Kaiming He⁠, Manohar Paluri, Yixuan Li⁠, Ashwin Bharambe, ⁠Laurens van der Maaten
link-bibliography⁠
https://arxiv.org/abs/1804.02767: “YOLOv3: An Incremental Improvement ”⁠, Joseph Redmon, Ali Farhadi⁠
link-bibliography⁠
https://arxiv.org/abs/1803.02999#openai: “Reptile/FOMAML: On First-Order Meta-Learning Algorithms ”⁠, Alex Nichol, Joshua Achiam, ⁠John Schulman
link-bibliography⁠
2018-choi.pdf: “Guess, Check and Fix: a Phenomenology of Improvisation in ‘Neural’ Painting ”⁠, Suk Kyoung Choi
link-bibliography⁠
https://arxiv.org/abs/1802.01561#deepmind: “IMPALA: Scalable Distributed Deep-RL With Importance Weighted Actor-Learner Architectures ”⁠, Lasse Espeholt, Hubert Soyer, Remi Munos …, Karen Simonyan⁠, Volodymir Mnih, Tom Ward, Yotam Doron, Vlad Firoiu, Tim Harley, Iain Dunning, Shane Legg⁠, Koray Kavukcuoglu⁠
link-bibliography⁠
https://www.nytimes.com/2017/12/03/business/china-artificial-intelligence.html: “China’s AI Advances Help Its Tech Industry, and State Security ”⁠, Paul Mozur, Keith Bradsher⁠
link-bibliography⁠
2017-sabatelli.pdf#page=3: “Learning to Play Chess With Minimal Lookahead and Deep Value Neural Networks ”⁠, Matthia Sabatelli
link-bibliography⁠
https://arxiv.org/abs/1708.05344: “SMASH: One-Shot Model Architecture Search through HyperNetworks ”⁠, Andrew Brock⁠, Theodore Lim, J. M. Ritchie, Nick Weston
link-bibliography⁠
https://arxiv.org/abs/1707.08819: “A Downsampled Variant of ImageNet As an Alternative to the CIFAR Datasets ”⁠, Patryk Chrabaszcz, ⁠Ilya Loshchilov, ⁠Frank Hutter
link-bibliography⁠
https://arxiv.org/abs/1706.06083: “Towards Deep Learning Models Resistant to Adversarial Attacks ”⁠, ⁠Aleksander Madry, Aleksandar Makelov, Ludwig Schmidt⁠ …, Dimitris Tsipras, Adrian Vladu
link-bibliography⁠
https://arxiv.org/abs/1706.01427#deepmind: “A Simple Neural Network Module for Relational Reasoning ”⁠, Adam Santoro⁠, David Raposo, David G. T. Barrett …, Mateusz Malinowski⁠, ⁠Razvan Pascanu⁠, Peter Battaglia, Timothy Lillicrap⁠
link-bibliography⁠
https://papers.ssrn.com/sol3/papers.cfm?abstract_id=2976021: “What Makes a Good Image? Airbnb Demand Analytics Leveraging Interpretable Image Features ”⁠, Shunyuan Zhang, Dokyun Lee, Param Vir Singh, ⁠Kannan Srinivasan
link-bibliography⁠
https://arxiv.org/abs/1703.08961: “Scaling the Scattering Transform: Deep Hybrid Networks ”⁠, Edouard Oyallon, Eugene Belilovsky, Sergey Zagoruyko
link-bibliography⁠
https://arxiv.org/abs/1703.06870#facebook: “Mask R-CNN ”⁠, Kaiming He⁠, Georgia Gkioxari, Piotr Dollár, Ross Girshick⁠
link-bibliography⁠
https://arxiv.org/abs/1702.06295: “Convolution Aware Initialization ”⁠, Armen Aghajanyan
link-bibliography⁠
2017-kawahara.pdf: “BrainNetCNN: Convolutional Neural Networks for Brain Networks; towards Predicting Neurodevelopment ”⁠, Jeremy Kawahara, Colin J. Brown, Steven P. Miller …, Brian G. Booth, Vann Chau, Ruth E. Grunau, Jill G. Zwicker, Ghassan Hamarneh
link-bibliography⁠
https://arxiv.org/abs/1612.08242: “YOLO9000: Better, Faster, Stronger ”⁠, Joseph Redmon, Ali Farhadi⁠
link-bibliography⁠
https://arxiv.org/abs/1612.07771#schmidhuber: “Highway and Residual Networks Learn Unrolled Iterative Estimation ”⁠, Klaus Greff, Rupesh K. Srivastava, Jürgen Schmidhuber⁠
link-bibliography⁠
https://arxiv.org/abs/1612.00563: “Self-Critical Sequence Training for Image Captioning ”⁠, Steven J. Rennie, Etienne Marcheret, Youssef Mroueh …, Jarret Ross, Vaibhava Goel
link-bibliography⁠
https://arxiv.org/abs/1611.05431#facebook: “ResNeXt: Aggregated Residual Transformations for Deep Neural Networks ”⁠, Saining Xie, Ross Girshick⁠, Piotr Dollár …, Zhuowen Tu, Kaiming He⁠
link-bibliography⁠
https://arxiv.org/abs/1609.09106#google: “HyperNetworks ”⁠, ⁠David Ha, Andrew Dai, Quoc V. Le⁠
link-bibliography⁠
https://arxiv.org/abs/1608.03609: “Clockwork Convnets for Video Semantic Segmentation ”⁠, Evan Shelhamer, Kate Rakelly, Judy Hoffman⁠, Trevor Darrell⁠
link-bibliography⁠
https://arxiv.org/abs/1605.07146: “Wide Residual Networks ”⁠, Sergey Zagoruyko, Nikos Komodakis
link-bibliography⁠
https://arxiv.org/abs/1603.05279: “XNOR-Net: ImageNet Classification Using Binary Convolutional Neural Networks ”⁠, Mohammad Rastegari, Vicente Ordonez, Joseph Redmon, Ali Farhadi⁠
link-bibliography⁠
https://arxiv.org/abs/1512.03385#microsoft: “Deep Residual Learning for Image Recognition ”⁠, Kaiming He⁠, Xiangyu Zhang, Shaoqing Ren, Jian Sun
link-bibliography⁠
https://blogs.microsoft.com/ai/microsoft-researchers-win-imagenet-computer-vision-challenge/: “Microsoft Researchers Win ImageNet Computer Vision Challenge ”⁠, Allison Linn
link-bibliography⁠
https://arxiv.org/abs/1511.06789#google: “The Unreasonable Effectiveness of Noisy Data for Fine-Grained Recognition ”⁠, Jonathan Krause⁠, Benjamin Sapp, Andrew Howard …, Howard Zhou, Alexander Toshev, Tom Duerig, James Philbin, Li Fei-Fei⁠
link-bibliography⁠
https://arxiv.org/abs/1511.02251#facebook: “Learning Visual Features from Large Weakly Supervised Data ”⁠, Armand Joulin⁠, ⁠Laurens van der Maaten, Allan Jabri, Nicolas Vasilache
link-bibliography⁠
2015-porzi.pdf: “Predicting and Understanding Urban Perception With Convolutional Neural Networks ”⁠, Lorenzo Porzi, Samuel Rota Bulò, Bruno Lepri, Elisa Ricci
link-bibliography⁠
https://arxiv.org/abs/1506.02640: “You Only Look Once: Unified, Real-Time Object Detection ”⁠, Joseph Redmon, Santosh Divvala, Ross Girshick⁠, Ali Farhadi⁠
link-bibliography⁠
https://openaccess.thecvf.com/content_cvpr_2015/papers/Xiao_Learning_From_Massive_2015_CVPR_paper.pdf#baidu: “Clothing-1M: Learning from Massive Noisy Labeled Data for Image Classification ”⁠, Tong Xiao, Tian Xia⁠, Yi Yang …, Chang Huang, Xiaogang Wang⁠
link-bibliography⁠
https://arxiv.org/abs/1506.01497#microsoft: “Faster R-CNN: Towards Real-Time Object Detection With Region Proposal Networks ”⁠, Shaoqing Ren, Kaiming He⁠, Ross Girshick⁠, Jian Sun
link-bibliography⁠
https://arxiv.org/abs/1506.01186: “Cyclical Learning Rates for Training Neural Networks ”⁠, Leslie N. Smith
link-bibliography⁠
https://arxiv.org/abs/1504.08083#microsoft: “Fast R-CNN ”⁠, Ross Girshick⁠
link-bibliography⁠
https://arxiv.org/abs/1311.2524: “R-CNN: Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation ”⁠, Ross Girshick⁠, Jeff Donahue, Trevor Darrell⁠, Jitendra Malik⁠
link-bibliography⁠
https://arxiv.org/abs/1102.0183#schmidhuber: “DanNet: Flexible, High Performance Convolutional Neural Networks for Image Classification ”⁠, Dan Claudiu Ciresan, Ueli Meier, Jonathan Masci …, Luca Maria Gambardella⁠, Jürgen Schmidhuber⁠
link-bibliography⁠

[Quote Of The Day]

[Site Of The Day]

[Annotation Of The Day]

[adblock public service announcement]