- See Also
-
Links
- “Interactive-Chain-Prompting (INTERCPT): Ambiguity Resolution for Crosslingual Conditional Generation With Interaction”, Et Al 2023
- “The Semantic Scholar Open Data Platform”, Et Al 2023
- “How Close Is ChatGPT to Human Experts? Comparison Corpus, Evaluation, and Detection”, Et Al 2023
- “Med-PaLM: Large Language Models Encode Clinical Knowledge”, Et Al 2022
- “Unnatural Instructions: Tuning Language Models With (Almost) No Human Labor”, Et Al 2022
- “A Whack-A-Mole Dilemma: Shortcuts Come in Multiples Where Mitigating One Amplifies Others”, Et Al 2022
- “The Stack: 3 TB of Permissively Licensed Source Code”, Et Al 2022
- “UniSumm: Unified Few-shot Summarization With Multi-Task Pre-Training and Prefix-Tuning”, Et Al 2022
- “A Creative Industry Image Generation Dataset Based on Captions”, Et Al 2022
- “AltCLIP: Altering the Language Encoder in CLIP for Extended Language Capabilities”, Et Al 2022
- “MMDialog: A Large-scale Multi-turn Dialogue Dataset Towards Multi-modal Open-domain Conversation”, Et Al 2022
- “AnimeRun: 2D Animation Visual Correspondence from Open Source 3D Movies”, Et Al 2022
- “BLOOMZ/mT0: Crosslingual Generalization through Multitask Finetuning”, Et Al 2022
- “Dungeons and Data: A Large-Scale NetHack Dataset”, Et Al 2022
- “Will We Run out of Data? An Analysis of the Limits of Scaling Datasets in Machine Learning”, Et Al 2022
- “Large Language Models Can Self-Improve”, Et Al 2022
- “CARP: Robust Preference Learning for Storytelling via Contrastive Reinforcement Learning”, Et Al 2022
- “MTEB: Massive Text Embedding Benchmark”, Et Al 2022
- “Most Language Models Can Be Poets Too: An AI Writing Assistant and Constrained Text Generation Studio”, Et Al 2022
- “Dynamic Prompt Learning via Policy Gradient for Semi-structured Mathematical Reasoning”, Et Al 2022
- “Brain Imaging Generation With Latent Diffusion Models”, Et Al 2022
- “PaLI: A Jointly-Scaled Multilingual Language-Image Model”, Et Al 2022
- “FOLIO: Natural Language Reasoning With First-Order Logic”, Et Al 2022
- “Red Teaming Language Models to Reduce Harms: Methods, Scaling Behaviors, and Lessons Learned”, Et Al 2022
- “Bugs in the Data: How ImageNet Misrepresents Biodiversity”, 2022
- “Discovering Bugs in Vision Models Using Off-the-shelf Image Generation and Captioning”, Et Al 2022
- “Benchmarking Compositionality With Formal Languages”, Et Al 2022
- “Quality Not Quantity: On the Interaction between Dataset Design and Robustness of CLIP”, Et Al 2022
- “Few-shot Adaptation Works With UnpredicTable Data”, Et Al 2022
- “Language Models Can Teach Themselves to Program Better”, Et Al 2022
- “NewsStories: Illustrating Articles With Visual Summaries”, Et Al 2022
- “CelebV-HQ: A Large-Scale Video Facial Attributes Dataset”, Et Al 2022
- “Why Do Tree-based Models Still Outperform Deep Learning on Tabular Data?”, Et Al 2022
- “Pile of Law: Learning Responsible Data Filtering from the Law and a 256GB Open-Source Legal Dataset”, Et Al 2022
- “RST: ReStructured Pre-training”, 2022
- “Learning to Generate Artistic Character Line Drawing”, Et Al 2022
- “Dataset Condensation via Efficient Synthetic-Data Parameterization”, Et Al 2022
- “Fine-grained Image Captioning With CLIP Reward”, Et Al 2022
- “InstructDial: Improving Zero and Few-shot Generalization in Dialogue through Instruction Tuning”, Et Al 2022
- “Flexible Diffusion Modeling of Long Videos”, Et Al 2022
- “Instruction Induction: From Few Examples to Natural Language Task Descriptions”, Et Al 2022
- “Housekeep: Tidying Virtual Households Using Commonsense Reasoning”, Et Al 2022
- “Down and Across: Introducing Crossword-Solving As a New NLP Benchmark”, Et Al 2022
- “Automated Crossword Solving”, Et Al 2022
- “Dialog Inpainting: Turning Documents into Dialogues”, Et Al 2022
- “SymphonyNet: Symphony Generation With Permutation Invariant Language Model”, Et Al 2022
- “When Does Dough Become a Bagel? Analyzing the Remaining Mistakes on ImageNet”, Et Al 2022
- “Building Machine Translation Systems for the Next Thousand Languages”, Et Al 2022
- “Data Determines Distributional Robustness in Contrastive Language Image Pre-training (CLIP)”, Et Al 2022
- “A Challenging Benchmark of Anime Style Recognition”, Et Al 2022
- “TK-Instruct: Benchmarking Generalization via In-Context Instructions on 1,600+ Language Tasks”, Et Al 2022
- “Winoground: Probing Vision and Language Models for Visio-Linguistic Compositionality”, Et Al 2022
- “ByT5 Model for Massively Multilingual Grapheme-to-phoneme Conversion”, Et Al 2022
- “KNN-Diffusion: Image Generation via Large-Scale Retrieval”, Et Al 2022
- “STaR: Bootstrapping Reasoning With Reasoning”, Et Al 2022
- “CLIP Meets GamePhysics: Towards Bug Identification in Gameplay Videos Using Zero-shot Transfer Learning”, Et Al 2022
- “Bamboo: Building Mega-Scale Vision Dataset Continually With Human-Machine Synergy”, Et Al 2022
- “Self-Distilled StyleGAN: Towards Generation from Internet Photos”, Et Al 2022
- “RuCLIP—new Models and Experiments: a Technical Report”, Et Al 2022
- “Wukong: 100 Million Large-scale Chinese Cross-modal Pre-training Dataset and A Foundation Framework”, Et Al 2022
- “ROME: Locating and Editing Factual Associations in GPT”, Et Al 2022
- “DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generative Transformers”, Et Al 2022
- “PromptSource: An Integrated Development Environment and Repository for Natural Language Prompts”, Et Al 2022
- “StyleGAN-XL: Scaling StyleGAN to Large Diverse Datasets”, Et Al 2022
- “Can Wikipedia Help Offline Reinforcement Learning?”, Et Al 2022
- “BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation”, Et Al 2022
- “SWAG: Revisiting Weakly Supervised Pre-Training of Visual Perception Models”, Et Al 2022
- “CoAuthor: Designing a Human-AI Collaborative Writing Dataset for Exploring Language Model Capabilities”, Et Al 2022
- “WANLI: Worker and AI Collaboration for Natural Language Inference Dataset Creation”, Et Al 2022
- “SynthBio: A Case Study in Faster Curation of Text Datasets”, Et Al 2022
- “BigDatasetGAN: Synthesizing ImageNet With Pixel-wise Annotations”, Et Al 2022
- “ERNIE-ViLG: Unified Generative Pre-training for Bidirectional Vision-Language Generation”, Et Al 2021
- “A Fistful of Words: Learning Transferable Visual Models from Bag-of-Words Supervision”, Et Al 2021
- “GLIDE: Towards Photorealistic Image Generation and Editing With Text-Guided Diffusion Models”, Et Al 2021
- “WebGPT: Browser-assisted Question-answering With Human Feedback”, Et Al 2021
- “Models in the Loop: Aiding Crowdworkers With Generative Annotation Assistants”, Et Al 2021
- “FRUIT: Faithfully Reflecting Updated Information in Text”, IV Et Al 2021
- “MAD: A Scalable Dataset for Language Grounding in Videos from Movie Audio Descriptions”, Et Al 2021
- “BASIC: Combined Scaling for Open-Vocabulary Image Classification”, Et Al 2021
- “Few-Shot Self-Rationalization With Natural Language Prompts”, Et Al 2021
- “Solving Probability and Statistics Problems by Program Synthesis”, Et Al 2021
- “AnimeCeleb: Large-Scale Animation CelebHeads Dataset for Head Reenactment”, Et Al 2021
- “LAION-400M: Open Dataset of CLIP-Filtered 400 Million Image-Text Pairs”, Et Al 2021
- “An Explanation of In-context Learning As Implicit Bayesian Inference”, Et Al 2021
- “Training Verifiers to Solve Math Word Problems”, Et Al 2021
- “A Connectome of The Drosophila Central Complex Reveals Network Motifs Suitable for Flexible Navigation and Context-dependent Action Selection”, Et Al 2021
- “HTCN: Harmonious Text Colorization Network for Visual-Textual Presentation Design”, Et Al 2021
- “T0: Multitask Prompted Training Enables Zero-Shot Task Generalization”, Et Al 2021
- “MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research”, Et Al 2021
- “LAION-400-Million Open Dataset”, 2021
- “Transfer Learning for Pose Estimation of Illustrated Characters”, 2021
- “Scaling Vision Transformers”, Et Al 2021
- “SpeechStew: Simply Mix All Available Speech Recognition Data to Train One Large Neural Network”, Et Al 2021
- “Pervasive Label Errors in Test Sets Destabilize Machine Learning Benchmarks”, Et Al 2021
- “NaturalProofs: Mathematical Theorem Proving in Natural Language”, Et Al 2021
- “Get Your Vitamin C! Robust Fact Verification With Contrastive Evidence (VitaminC)”, Et Al 2021
- “Are NLP Models Really Able to Solve Simple Math Word Problems?”, Et Al 2021
- “Measuring Mathematical Problem Solving With the MATH Dataset”, Et Al 2021
- “WIT: Wikipedia-based Image Text Dataset for Multimodal Multilingual Machine Learning”, Et Al 2021
- “A Massive 7T FMRI Dataset to Bridge Cognitive and Computational Neuroscience”, Et Al 2021
- “Conceptual 12M: Pushing Web-Scale Image-Text Pre-Training To Recognize Long-Tail Visual Concepts”, Et Al 2021
- “ALIGN: Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision”, Et Al 2021
- “Scaling Laws for Transfer”, Et Al 2021
- “MSR-VTT: A Large Video Description Dataset for Bridging Video and Language”, Et Al 2021
- “CLIP: Connecting Text and Images: We’re Introducing a Neural Network Called CLIP Which Efficiently Learns Visual Concepts from Natural Language Supervision. CLIP Can Be Applied to Any Visual Classification Benchmark by Simply Providing the Names of the Visual Categories to Be Recognized, Similar to The”Zero-shot” Capabilities of GPT-2 and GPT-3”, Et Al 2021
- “Learning Transferable Visual Models From Natural Language Supervision”, Et Al 2021
- “The Pile: An 800GB Dataset of Diverse Text for Language Modeling”, Et Al 2021
- “End-to-End Chinese Landscape Painting Creation Using Generative Adversarial Networks”, 2020
- “Open-Domain Question Answering Goes Conversational via Question Rewriting”, Et Al 2020
- “Digital Voicing of Silent Speech”, 2020
- “The Many Faces of Robustness: A Critical Analysis of Out-of-Distribution Generalization”, Et Al 2020
- “The NetHack Learning Environment”, Et Al 2020
- “Anime Crop Datasets: Faces, Figures, & Hands”, Et Al 2020
- “TyDiQA: A Benchmark for Information-Seeking Question Answering in Typologically Diverse Languages”, Et Al 2020
- “SAYCam: A Large, Longitudinal Audiovisual Dataset Recorded from the Infant’s Perspective”, Et Al 2020
- “ImageNet-A: Natural Adversarial Examples”, Et Al 2020
- “Libri-Light: A Benchmark for ASR With Limited or No Supervision”, Et Al 2019
- “How Can We Know What Language Models Know?”, Et Al 2019
- “Compressive Transformers for Long-Range Sequence Modelling”, Et Al 2019
- “T5: Exploring the Limits of Transfer Learning With a Unified Text-to-Text Transformer”, Et Al 2019
- “CATER: A Diagnostic Dataset for Compositional Actions and TEmporal Reasoning”, 2019
- “PubMedQA: A Dataset for Biomedical Research Question Answering”, Et Al 2019
- “ObjectNet: A Large-scale Bias-controlled Dataset for Pushing the Limits of Object Recognition Models”, Et Al 2019
- “No Press Diplomacy: Modeling Multi-Agent Gameplay”, Et Al 2019
- “LVIS: A Dataset for Large Vocabulary Instance Segmentation”, Et Al 2019
- “A Large Single-participant FMRI Dataset for Probing Brain Responses to Naturalistic Stimuli in Space and Time”, Et Al 2019
- “OK-VQA: A Visual Question Answering Benchmark Requiring External Knowledge”, Et Al 2019
- “ImageNet-Sketch: Learning Robust Global Representations by Penalizing Local Predictive Power”, Et Al 2019
- “SuperGLUE: A Stickier Benchmark for General-Purpose Language Understanding Systems”, Et Al 2019
- “ProductNet: a Collection of High-Quality Datasets for Product Representation Learning”, Et Al 2019
- “Benchmarking Neural Network Robustness to Common Corruptions and Perturbations”, 2019
- “A Replication Study: Machine Learning Models Are Capable of Predicting Sexual Orientation From Facial Images”, 2019
- “Language Models Are Unsupervised Multitask Learners”, Et Al 2019
- “Do We Train on Test Data? Purging CIFAR of Near-Duplicates”, 2019
- “FIGR: Few-shot Image Generation With Reptile”, 2019
- “A Style-Based Generator Architecture for Generative Adversarial Networks”, Et Al 2018
- “ImageNet-trained CNNs Are Biased towards Texture; Increasing Shape Bias Improves Accuracy and Robustness”, Et Al 2018
- “The Open Images Dataset V4: Unified Image Classification, Object Detection, and Visual Relationship Detection at Scale”, Et Al 2018
- “CommonsenseQA: A Question Answering Challenge Targeting Commonsense Knowledge”, Et Al 2018
- “Don’t Give Me the Details, Just the Summary! Topic-Aware Convolutional Neural Networks for Extreme Summarization”, Et Al 2018
- “A Short Note about Kinetics-600”, Et Al 2018
- “CurriculumNet: Weakly Supervised Learning from Large-Scale Web Images”, Et Al 2018
- “Benchmarking Neural Network Robustness to Common Corruptions and Surface Variations”, 2018
- “Conceptual Captions: A Cleaned, Hypernymed, Image Alt-text Dataset For Automatic Image Captioning”, Et Al 2018
- “BDD100K: A Diverse Driving Dataset for Heterogeneous Multitask Learning”, Et Al 2018
- “Exploring the Limits of Weakly Supervised Pretraining”, Et Al 2018
- “GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding”, Et Al 2018
- “The Sound of Pixels”, Et Al 2018
- “Think You Have Solved Question Answering? Try ARC, the AI2 Reasoning Challenge”, Et Al 2018
- “FEVER: a Large-scale Dataset for Fact Extraction and VERification”, Et Al 2018
- “11K Hands: Gender Recognition and Biometric Identification Using a Large Dataset of Hand Images”, 2017
- “WebVision Database: Visual Learning and Understanding from Web Data”, Et Al 2017
- “Revisiting Unreasonable Effectiveness of Data in Deep Learning Era”, Et Al 2017
- “Driver Identification Using Automobile Sensor Data from a Single Turn”, Et Al 2017
- “The Kinetics Human Action Video Dataset”, Et Al 2017
- “Dense-Captioning Events in Videos”, Et Al 2017
- “RACE: Large-scale ReAding Comprehension Dataset From Examinations”, Et Al 2017
- “Pointer Sentinel Mixture Models”, Et Al 2016
- “Solving General Arithmetic Word Problems”, 2016
- “The LAMBADA Dataset: Word Prediction Requiring a Broad Discourse Context”, Et Al 2016
- “Convolutional Sketch Inversion”, Et Al 2016
- “Danbooru2021: A Large-Scale Crowdsourced and Tagged Anime Illustration Dataset”, 2015
- “Neural Module Networks”, Et Al 2015
- “Sketch-based Manga Retrieval Using Manga109 Dataset”, Et Al 2015
- “LSUN: Construction of a Large-scale Image Dataset Using Deep Learning With Humans in the Loop”, Et Al 2015
- “YFCC100M: The New Data in Multimedia Research”, Et Al 2015
- “ImageNet Large Scale Visual Recognition Challenge”, Et Al 2014
- “Microsoft COCO: Common Objects in Context”, Et Al 2014
- “UCF101: A Dataset of 101 Human Actions Classes From Videos in The Wild”, Et Al 2012
- “Solving Math Word Problems: We’ve Trained a System That Solves Grade School Math Problems With Nearly Twice the Accuracy of a Fine-tuned GPT-3 Model. It Solves about 90% As Many Problems As Real Kids: a Small Sample of 9-12 Year Olds Scored 60% on a Test from Our Dataset, While Our System Scored 55% on Those Same Problems. This Is Important Because Today’s AI Is Still Quite Weak at Commonsense Multistep Reasoning, Which Is Easy Even for Grade School Kids. We Achieved These Results by Training Our Model to Recognize Its Mistakes, so That It Can Try Repeatedly Until It Finds a Solution That Works”
- Wikipedia
- Miscellaneous
- Link Bibliography
See Also
Links
“Interactive-Chain-Prompting (INTERCPT): Ambiguity Resolution for Crosslingual Conditional Generation With Interaction”, Et Al 2023
“Interactive-Chain-Prompting (INTERCPT): Ambiguity Resolution for Crosslingual Conditional Generation with Interaction”, 2023-01-24 ( ; similar)
“The Semantic Scholar Open Data Platform”, Et Al 2023
“The Semantic Scholar Open Data Platform”, 2023-01-24 (similar)
“How Close Is ChatGPT to Human Experts? Comparison Corpus, Evaluation, and Detection”, Et Al 2023
“How Close is ChatGPT to Human Experts? Comparison Corpus, Evaluation, and Detection”, 2023-01-18 ( ; similar)
“Med-PaLM: Large Language Models Encode Clinical Knowledge”, Et Al 2022
“Med-PaLM: Large Language Models Encode Clinical Knowledge”, 2022-12-26 ( ; similar; bibliography)
“Unnatural Instructions: Tuning Language Models With (Almost) No Human Labor”, Et Al 2022
“Unnatural Instructions: Tuning Language Models with (Almost) No Human Labor”, 2022-12-19 ( ; similar)
“A Whack-A-Mole Dilemma: Shortcuts Come in Multiples Where Mitigating One Amplifies Others”, Et Al 2022
“A Whack-A-Mole Dilemma: Shortcuts Come in Multiples Where Mitigating One Amplifies Others”, 2022-12-09 ( ; similar)
“The Stack: 3 TB of Permissively Licensed Source Code”, Et Al 2022
“The Stack: 3 TB of permissively licensed source code”, 2022-11-20 ( ; similar; bibliography)
“UniSumm: Unified Few-shot Summarization With Multi-Task Pre-Training and Prefix-Tuning”, Et Al 2022
“UniSumm: Unified Few-shot Summarization with Multi-Task Pre-Training and Prefix-Tuning”, 2022-11-17 ( ; similar)
“A Creative Industry Image Generation Dataset Based on Captions”, Et Al 2022
“A Creative Industry Image Generation Dataset Based on Captions”, 2022-11-16 ( ; similar)
“AltCLIP: Altering the Language Encoder in CLIP for Extended Language Capabilities”, Et Al 2022
“AltCLIP: Altering the Language Encoder in CLIP for Extended Language Capabilities”, 2022-11-12 ( ; similar; bibliography)
“MMDialog: A Large-scale Multi-turn Dialogue Dataset Towards Multi-modal Open-domain Conversation”, Et Al 2022
“MMDialog: A Large-scale Multi-turn Dialogue Dataset Towards Multi-modal Open-domain Conversation”, 2022-11-10 ( ; similar)
“AnimeRun: 2D Animation Visual Correspondence from Open Source 3D Movies”, Et Al 2022
“AnimeRun: 2D Animation Visual Correspondence from Open Source 3D Movies”, 2022-11-10 ( ; similar)
“BLOOMZ/mT0: Crosslingual Generalization through Multitask Finetuning”, Et Al 2022
“BLOOMZ/mT0: Crosslingual Generalization through Multitask Finetuning”, 2022-11-03 ( ; similar)
“Dungeons and Data: A Large-Scale NetHack Dataset”, Et Al 2022
“Dungeons and Data: A Large-Scale NetHack Dataset”, 2022-11-01 ( ; similar)
“Will We Run out of Data? An Analysis of the Limits of Scaling Datasets in Machine Learning”, Et Al 2022
“Will we run out of data? An analysis of the limits of scaling datasets in Machine Learning”, 2022-10-26 ( ; similar)
“Large Language Models Can Self-Improve”, Et Al 2022
“Large Language Models Can Self-Improve”, 2022-10-20 ( ; similar; bibliography)
“CARP: Robust Preference Learning for Storytelling via Contrastive Reinforcement Learning”, Et Al 2022
“CARP: Robust Preference Learning for Storytelling via Contrastive Reinforcement Learning”, 2022-10-14 ( ; similar; bibliography)
“MTEB: Massive Text Embedding Benchmark”, Et Al 2022
“MTEB: Massive Text Embedding Benchmark”, 2022-10-13 ( ; similar)
“Most Language Models Can Be Poets Too: An AI Writing Assistant and Constrained Text Generation Studio”, Et Al 2022
“Most Language Models can be Poets too: An AI Writing Assistant and Constrained Text Generation Studio”, 2022-10-12 ( ; backlinks; similar; bibliography)
“Dynamic Prompt Learning via Policy Gradient for Semi-structured Mathematical Reasoning”, Et Al 2022
“Dynamic Prompt Learning via Policy Gradient for Semi-structured Mathematical Reasoning”, 2022-09-29 ( ; similar)
“Brain Imaging Generation With Latent Diffusion Models”, Et Al 2022
“Brain Imaging Generation with Latent Diffusion Models”, 2022-09-15 ( ; backlinks; similar)
“PaLI: A Jointly-Scaled Multilingual Language-Image Model”, Et Al 2022
“PaLI: A Jointly-Scaled Multilingual Language-Image Model”, 2022-09-14 ( ; similar)
“FOLIO: Natural Language Reasoning With First-Order Logic”, Et Al 2022
“FOLIO: Natural Language Reasoning with First-Order Logic”, 2022-09-02 ( ; similar; bibliography)
“Red Teaming Language Models to Reduce Harms: Methods, Scaling Behaviors, and Lessons Learned”, Et Al 2022
“Red Teaming Language Models to Reduce Harms: Methods, Scaling Behaviors, and Lessons Learned”, 2022-08-25 ( ; similar; bibliography)
“Bugs in the Data: How ImageNet Misrepresents Biodiversity”, 2022
“Bugs in the Data: How ImageNet Misrepresents Biodiversity”, 2022-08-24 ( ; similar)
“Discovering Bugs in Vision Models Using Off-the-shelf Image Generation and Captioning”, Et Al 2022
“Discovering Bugs in Vision Models using Off-the-shelf Image Generation and Captioning”, 2022-08-18 ( ; similar; bibliography)
“Benchmarking Compositionality With Formal Languages”, Et Al 2022
“Benchmarking Compositionality with Formal Languages”, 2022-08-17 ( ; similar)
“Quality Not Quantity: On the Interaction between Dataset Design and Robustness of CLIP”, Et Al 2022
“Quality Not Quantity: On the Interaction between Dataset Design and Robustness of CLIP”, 2022-08-10 ( ; similar; bibliography)
“Few-shot Adaptation Works With UnpredicTable Data”, Et Al 2022
“Few-shot Adaptation Works with UnpredicTable Data”, 2022-08-01 ( ; similar)
“Language Models Can Teach Themselves to Program Better”, Et Al 2022
“Language Models Can Teach Themselves to Program Better”, 2022-07-29 ( ; similar)
“NewsStories: Illustrating Articles With Visual Summaries”, Et Al 2022
“NewsStories: Illustrating articles with visual summaries”, 2022-07-26 ( ; similar; bibliography)
“CelebV-HQ: A Large-Scale Video Facial Attributes Dataset”, Et Al 2022
“CelebV-HQ: A Large-Scale Video Facial Attributes Dataset”, 2022-07-25 ( ; similar)
“Why Do Tree-based Models Still Outperform Deep Learning on Tabular Data?”, Et Al 2022
“Why do tree-based models still outperform deep learning on tabular data?”, 2022-07-18 ( ; similar)
“Pile of Law: Learning Responsible Data Filtering from the Law and a 256GB Open-Source Legal Dataset”, Et Al 2022
“Pile of Law: Learning Responsible Data Filtering from the Law and a 256GB Open-Source Legal Dataset”, 2022-07-01 ( ; similar)
“RST: ReStructured Pre-training”, 2022
“RST: reStructured Pre-training”, 2022-06-22 ( ; similar)
“Learning to Generate Artistic Character Line Drawing”, Et Al 2022
“Learning to Generate Artistic Character Line Drawing”, 2022-06-07 ( ; similar)
“Dataset Condensation via Efficient Synthetic-Data Parameterization”, Et Al 2022
“Dataset Condensation via Efficient Synthetic-Data Parameterization”, 2022-05-30 ( ; similar)
“Fine-grained Image Captioning With CLIP Reward”, Et Al 2022
“Fine-grained Image Captioning with CLIP Reward”, 2022-05-26 ( ; similar)
“InstructDial: Improving Zero and Few-shot Generalization in Dialogue through Instruction Tuning”, Et Al 2022
“InstructDial: Improving Zero and Few-shot Generalization in Dialogue through Instruction Tuning”, 2022-05-25 ( ; similar)
“Flexible Diffusion Modeling of Long Videos”, Et Al 2022
“Flexible Diffusion Modeling of Long Videos”, 2022-05-23 ( ; similar)
“Instruction Induction: From Few Examples to Natural Language Task Descriptions”, Et Al 2022
“Instruction Induction: From Few Examples to Natural Language Task Descriptions”, 2022-05-22 ( ; similar)
“Housekeep: Tidying Virtual Households Using Commonsense Reasoning”, Et Al 2022
“Housekeep: Tidying Virtual Households using Commonsense Reasoning”, 2022-05-22 ( ; backlinks; similar)
“Down and Across: Introducing Crossword-Solving As a New NLP Benchmark”, Et Al 2022
“Down and Across: Introducing Crossword-Solving as a New NLP Benchmark”, 2022-05-20 ( ; similar)
“Automated Crossword Solving”, Et Al 2022
“Automated Crossword Solving”, 2022-05-19 ( ; similar; bibliography)
“Dialog Inpainting: Turning Documents into Dialogues”, Et Al 2022
“Dialog Inpainting: Turning Documents into Dialogues”, 2022-05-18 ( ; similar; bibliography)
“SymphonyNet: Symphony Generation With Permutation Invariant Language Model”, Et Al 2022
“SymphonyNet: Symphony Generation with Permutation Invariant Language Model”, 2022-05-10 ( ; similar)
“When Does Dough Become a Bagel? Analyzing the Remaining Mistakes on ImageNet”, Et Al 2022
“When does dough become a bagel? Analyzing the remaining mistakes on ImageNet”, 2022-05-09 ( ; similar; bibliography)
“Building Machine Translation Systems for the Next Thousand Languages”, Et Al 2022
“Building Machine Translation Systems for the Next Thousand Languages”, 2022-05-09 ( ; similar; bibliography)
“Data Determines Distributional Robustness in Contrastive Language Image Pre-training (CLIP)”, Et Al 2022
“Data Determines Distributional Robustness in Contrastive Language Image Pre-training (CLIP)”, 2022-05-03 ( ; similar; bibliography)
“A Challenging Benchmark of Anime Style Recognition”, Et Al 2022
“A Challenging Benchmark of Anime Style Recognition”, 2022-04-29 ( ; similar)
“TK-Instruct: Benchmarking Generalization via In-Context Instructions on 1,600+ Language Tasks”, Et Al 2022
“Tk-Instruct: Benchmarking Generalization via In-Context Instructions on 1,600+ Language Tasks”, 2022-04-16 ( ; backlinks; similar; bibliography)
“Winoground: Probing Vision and Language Models for Visio-Linguistic Compositionality”, Et Al 2022
“Winoground: Probing Vision and Language Models for Visio-Linguistic Compositionality”, 2022-04-07 ( ; similar)
“ByT5 Model for Massively Multilingual Grapheme-to-phoneme Conversion”, Et Al 2022
“ByT5 model for massively multilingual grapheme-to-phoneme conversion”, 2022-04-06 ( ; similar; bibliography)
“KNN-Diffusion: Image Generation via Large-Scale Retrieval”, Et Al 2022
“KNN-Diffusion: Image Generation via Large-Scale Retrieval”, 2022-04-06 ( ; similar)
“STaR: Bootstrapping Reasoning With Reasoning”, Et Al 2022
“STaR: Bootstrapping Reasoning With Reasoning”, 2022-03-28 ( ; backlinks; similar)
“CLIP Meets GamePhysics: Towards Bug Identification in Gameplay Videos Using Zero-shot Transfer Learning”, Et Al 2022
“CLIP meets GamePhysics: Towards bug identification in gameplay videos using zero-shot transfer learning”, 2022-03-21 ( ; similar; bibliography)
“Bamboo: Building Mega-Scale Vision Dataset Continually With Human-Machine Synergy”, Et Al 2022
“Bamboo: Building Mega-Scale Vision Dataset Continually with Human-Machine Synergy”, 2022-03-15 ( ; similar)
“Self-Distilled StyleGAN: Towards Generation from Internet Photos”, Et Al 2022
“Self-Distilled StyleGAN: Towards Generation from Internet Photos”, 2022-02-24 ( ; similar; bibliography)
“RuCLIP—new Models and Experiments: a Technical Report”, Et Al 2022
“RuCLIP—new models and experiments: a technical report”, 2022-02-22 ( ; similar)
“Wukong: 100 Million Large-scale Chinese Cross-modal Pre-training Dataset and A Foundation Framework”, Et Al 2022
“Wukong: 100 Million Large-scale Chinese Cross-modal Pre-training Dataset and A Foundation Framework”, 2022-02-14 ( ; similar; bibliography)
“ROME: Locating and Editing Factual Associations in GPT”, Et Al 2022
“ROME: Locating and Editing Factual Associations in GPT”, 2022-02-10 ( ; similar)
“DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generative Transformers”, Et Al 2022
“DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generative Transformers”, 2022-02-08 ( ; similar)
“PromptSource: An Integrated Development Environment and Repository for Natural Language Prompts”, Et Al 2022
“PromptSource: An Integrated Development Environment and Repository for Natural Language Prompts”, 2022-02-02 ( ; similar)
“StyleGAN-XL: Scaling StyleGAN to Large Diverse Datasets”, Et Al 2022
“StyleGAN-XL: Scaling StyleGAN to Large Diverse Datasets”, 2022-02-01 ( ; backlinks; similar; bibliography)
“Can Wikipedia Help Offline Reinforcement Learning?”, Et Al 2022
“Can Wikipedia Help Offline Reinforcement Learning?”, 2022-01-28 ( ; similar)
“BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation”, Et Al 2022
“BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation”, 2022-01-28 ( ; similar; bibliography)
“SWAG: Revisiting Weakly Supervised Pre-Training of Visual Perception Models”, Et Al 2022
“SWAG: Revisiting Weakly Supervised Pre-Training of Visual Perception Models”, 2022-01-20 ( ; similar; bibliography)
“CoAuthor: Designing a Human-AI Collaborative Writing Dataset for Exploring Language Model Capabilities”, Et Al 2022
“CoAuthor: Designing a Human-AI Collaborative Writing Dataset for Exploring Language Model Capabilities”, 2022-01-18 ( ; similar)
“WANLI: Worker and AI Collaboration for Natural Language Inference Dataset Creation”, Et Al 2022
“WANLI: Worker and AI Collaboration for Natural Language Inference Dataset Creation”, 2022-01-16 ( ; similar; bibliography)
“SynthBio: A Case Study in Faster Curation of Text Datasets”, Et Al 2022
“SynthBio: A Case Study in Faster Curation of Text Datasets”, 2022-01-13 ( ; similar)
“BigDatasetGAN: Synthesizing ImageNet With Pixel-wise Annotations”, Et Al 2022
“BigDatasetGAN: Synthesizing ImageNet with Pixel-wise Annotations”, 2022-01-12 ( ; backlinks; similar; bibliography)
“ERNIE-ViLG: Unified Generative Pre-training for Bidirectional Vision-Language Generation”, Et Al 2021
“ERNIE-ViLG: Unified Generative Pre-training for Bidirectional Vision-Language Generation”, 2021-12-31 ( ; similar; bibliography)
“A Fistful of Words: Learning Transferable Visual Models from Bag-of-Words Supervision”, Et Al 2021
“A Fistful of Words: Learning Transferable Visual Models from Bag-of-Words Supervision”, 2021-12-27 ( ; similar)
“GLIDE: Towards Photorealistic Image Generation and Editing With Text-Guided Diffusion Models”, Et Al 2021
“GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models”, 2021-12-20 ( ; similar)
“WebGPT: Browser-assisted Question-answering With Human Feedback”, Et Al 2021
“WebGPT: Browser-assisted question-answering with human feedback”, 2021-12-16 ( ; similar; bibliography)
“Models in the Loop: Aiding Crowdworkers With Generative Annotation Assistants”, Et Al 2021
“Models in the Loop: Aiding Crowdworkers with Generative Annotation Assistants”, 2021-12-16 ( ; similar)
“FRUIT: Faithfully Reflecting Updated Information in Text”, IV Et Al 2021
“FRUIT: Faithfully Reflecting Updated Information in Text”, 2021-12-16 ( ; similar)
“MAD: A Scalable Dataset for Language Grounding in Videos from Movie Audio Descriptions”, Et Al 2021
“MAD: A Scalable Dataset for Language Grounding in Videos from Movie Audio Descriptions”, 2021-12-01 ( ; similar)
“BASIC: Combined Scaling for Open-Vocabulary Image Classification”, Et Al 2021
“BASIC: Combined Scaling for Open-Vocabulary Image Classification”, 2021-11-19 ( ; similar; bibliography)
“Few-Shot Self-Rationalization With Natural Language Prompts”, Et Al 2021
“Few-Shot Self-Rationalization with Natural Language Prompts”, 2021-11-16 ( ; similar)
“Solving Probability and Statistics Problems by Program Synthesis”, Et Al 2021
“Solving Probability and Statistics Problems by Program Synthesis”, 2021-11-16 ( ; backlinks; similar; bibliography)
“AnimeCeleb: Large-Scale Animation CelebHeads Dataset for Head Reenactment”, Et Al 2021
“AnimeCeleb: Large-Scale Animation CelebHeads Dataset for Head Reenactment”, 2021-11-15 ( ; similar)
“LAION-400M: Open Dataset of CLIP-Filtered 400 Million Image-Text Pairs”, Et Al 2021
“LAION-400M: Open Dataset of CLIP-Filtered 400 Million Image-Text Pairs”, 2021-11-03 ( ; similar; bibliography)
“An Explanation of In-context Learning As Implicit Bayesian Inference”, Et Al 2021
“An Explanation of In-context Learning as Implicit Bayesian Inference”, 2021-11-03 ( ; backlinks; similar)
“Training Verifiers to Solve Math Word Problems”, Et Al 2021
“Training Verifiers to Solve Math Word Problems”, 2021-10-27 ( ; similar)
“A Connectome of The Drosophila Central Complex Reveals Network Motifs Suitable for Flexible Navigation and Context-dependent Action Selection”, Et Al 2021
“A connectome of the Drosophila central complex reveals network motifs suitable for flexible navigation and context-dependent action selection”, 2021-10-26 ( ; backlinks; similar; bibliography)
“HTCN: Harmonious Text Colorization Network for Visual-Textual Presentation Design”, Et Al 2021
“HTCN: Harmonious Text Colorization Network for Visual-Textual Presentation Design”, 2021-10-22 ( ; similar)
“T0: Multitask Prompted Training Enables Zero-Shot Task Generalization”, Et Al 2021
“T0: Multitask Prompted Training Enables Zero-Shot Task Generalization”, 2021-10-15 ( ; backlinks; similar)
“MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research”, Et Al 2021
“MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research”, 2021-09-27 ( ; similar)
“LAION-400-Million Open Dataset”, 2021
“LAION-400-Million Open Dataset”, 2021-08-20 ( ; similar; bibliography)
“Transfer Learning for Pose Estimation of Illustrated Characters”, 2021
“Transfer Learning for Pose Estimation of Illustrated Characters”, 2021-08-04 ( ; similar)
“Scaling Vision Transformers”, Et Al 2021
“Scaling Vision Transformers”, 2021-06-08 ( ; similar; bibliography)
“SpeechStew: Simply Mix All Available Speech Recognition Data to Train One Large Neural Network”, Et Al 2021
“SpeechStew: Simply Mix All Available Speech Recognition Data to Train One Large Neural Network”, 2021-04-05 ( ; similar)
“Pervasive Label Errors in Test Sets Destabilize Machine Learning Benchmarks”, Et Al 2021
“Pervasive Label Errors in Test Sets Destabilize Machine Learning Benchmarks”, 2021-03-26 ( ; similar; bibliography)
“NaturalProofs: Mathematical Theorem Proving in Natural Language”, Et Al 2021
“NaturalProofs: Mathematical Theorem Proving in Natural Language”, 2021-03-24 ( ; backlinks; similar)
“Get Your Vitamin C! Robust Fact Verification With Contrastive Evidence (VitaminC)”, Et Al 2021
“Get Your Vitamin C! Robust Fact Verification with Contrastive Evidence (VitaminC)”, 2021-03-15 ( ; backlinks; similar)
“Are NLP Models Really Able to Solve Simple Math Word Problems?”, Et Al 2021
“Are NLP Models really able to Solve Simple Math Word Problems?”, 2021-03-12 ( ; similar)
“Measuring Mathematical Problem Solving With the MATH Dataset”, Et Al 2021
“Measuring Mathematical Problem Solving With the MATH Dataset”, 2021-03-05 ( ; backlinks; similar)
“WIT: Wikipedia-based Image Text Dataset for Multimodal Multilingual Machine Learning”, Et Al 2021
“WIT: Wikipedia-based Image Text Dataset for Multimodal Multilingual Machine Learning”, 2021-03-02 ( ; similar)
“A Massive 7T FMRI Dataset to Bridge Cognitive and Computational Neuroscience”, Et Al 2021
“A massive 7T fMRI dataset to bridge cognitive and computational neuroscience”, 2021-02-22 ( ; backlinks; similar)
“Conceptual 12M: Pushing Web-Scale Image-Text Pre-Training To Recognize Long-Tail Visual Concepts”, Et Al 2021
“Conceptual 12M: Pushing Web-Scale Image-Text Pre-Training To Recognize Long-Tail Visual Concepts”, 2021-02-17 ( ; similar)
“ALIGN: Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision”, Et Al 2021
“ALIGN: Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision”, 2021-02-11 ( ; similar; bibliography)
“Scaling Laws for Transfer”, Et Al 2021
“Scaling Laws for Transfer”, 2021-02-02 ( ; similar)
“MSR-VTT: A Large Video Description Dataset for Bridging Video and Language”, Et Al 2021
“MSR-VTT: A Large Video Description Dataset for Bridging Video and Language”, 2021-01-17 ( ; similar)
“CLIP: Connecting Text and Images: We’re Introducing a Neural Network Called CLIP Which Efficiently Learns Visual Concepts from Natural Language Supervision. CLIP Can Be Applied to Any Visual Classification Benchmark by Simply Providing the Names of the Visual Categories to Be Recognized, Similar to The”Zero-shot” Capabilities of GPT-2 and GPT-3”, Et Al 2021
“CLIP: Connecting Text and Images: We’re introducing a neural network called CLIP which efficiently learns visual concepts from natural language supervision. CLIP can be applied to any visual classification benchmark by simply providing the names of the visual categories to be recognized, similar to the “zero-shot” capabilities of GPT-2 and GPT-3”, 2021-01-05 ( ; backlinks; similar; bibliography)
“Learning Transferable Visual Models From Natural Language Supervision”, Et Al 2021
“Learning Transferable Visual Models From Natural Language Supervision”, 2021-01-05 ( ; backlinks; similar; bibliography)
“The Pile: An 800GB Dataset of Diverse Text for Language Modeling”, Et Al 2021
“The Pile: An 800GB Dataset of Diverse Text for Language Modeling”, 2021 ( ; similar; bibliography)
“End-to-End Chinese Landscape Painting Creation Using Generative Adversarial Networks”, 2020
“End-to-End Chinese Landscape Painting Creation Using Generative Adversarial Networks”, 2020-11-11 ( ; backlinks; similar)
“Open-Domain Question Answering Goes Conversational via Question Rewriting”, Et Al 2020
“Open-Domain Question Answering Goes Conversational via Question Rewriting”, 2020-10-10 ( ; similar)
“Digital Voicing of Silent Speech”, 2020
“Digital Voicing of Silent Speech”, 2020-10-06 ( ; similar)
“The Many Faces of Robustness: A Critical Analysis of Out-of-Distribution Generalization”, Et Al 2020
“The Many Faces of Robustness: A Critical Analysis of Out-of-Distribution Generalization”, 2020-06-29 ( ; backlinks; similar)
“The NetHack Learning Environment”, Et Al 2020
“The NetHack Learning Environment”, 2020-06-24 ( ; similar)
“Anime Crop Datasets: Faces, Figures, & Hands”, Et Al 2020
“Anime Crop Datasets: Faces, Figures, & Hands”, 2020-05-10 ( ; backlinks; bibliography)
“TyDiQA: A Benchmark for Information-Seeking Question Answering in Typologically Diverse Languages”, Et Al 2020
“TyDiQA: A Benchmark for Information-Seeking Question Answering in Typologically Diverse Languages”, 2020-03-10 (backlinks; similar)
“SAYCam: A Large, Longitudinal Audiovisual Dataset Recorded from the Infant’s Perspective”, Et Al 2020
“SAYCam: A large, longitudinal audiovisual dataset recorded from the infant’s perspective”, 2020-01-14 ( ; similar)
“ImageNet-A: Natural Adversarial Examples”, Et Al 2020
“ImageNet-A: Natural Adversarial Examples”, 2020-01-08 ( ; backlinks; similar)
“Libri-Light: A Benchmark for ASR With Limited or No Supervision”, Et Al 2019
“Libri-Light: A Benchmark for ASR with Limited or No Supervision”, 2019-12-17 ( ; similar)
“How Can We Know What Language Models Know?”, Et Al 2019
“How Can We Know What Language Models Know?”, 2019-11-28 ( ; backlinks; similar)
“Compressive Transformers for Long-Range Sequence Modelling”, Et Al 2019
“Compressive Transformers for Long-Range Sequence Modelling”, 2019-11-13 ( ; similar)
“T5: Exploring the Limits of Transfer Learning With a Unified Text-to-Text Transformer”, Et Al 2019
“T5: Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer”, 2019-10-23 ( ; similar)
“CATER: A Diagnostic Dataset for Compositional Actions and TEmporal Reasoning”, 2019
“CATER: A diagnostic dataset for Compositional Actions and TEmporal Reasoning”, 2019-10-10 ( ; backlinks; similar)
“PubMedQA: A Dataset for Biomedical Research Question Answering”, Et Al 2019
“PubMedQA: A Dataset for Biomedical Research Question Answering”, 2019-09-13 ( ; backlinks; similar)
“ObjectNet: A Large-scale Bias-controlled Dataset for Pushing the Limits of Object Recognition Models”, Et Al 2019
“ObjectNet: A large-scale bias-controlled dataset for pushing the limits of object recognition models”, 2019-09-06 ( ; backlinks; similar)
“No Press Diplomacy: Modeling Multi-Agent Gameplay”, Et Al 2019
“No Press Diplomacy: Modeling Multi-Agent Gameplay”, 2019-09-04 ( ; similar)
“LVIS: A Dataset for Large Vocabulary Instance Segmentation”, Et Al 2019
“LVIS: A Dataset for Large Vocabulary Instance Segmentation”, 2019-08-08 (similar)
“A Large Single-participant FMRI Dataset for Probing Brain Responses to Naturalistic Stimuli in Space and Time”, Et Al 2019
“A large single-participant fMRI dataset for probing brain responses to naturalistic stimuli in space and time”, 2019-07-02 ( ; similar)
“OK-VQA: A Visual Question Answering Benchmark Requiring External Knowledge”, Et Al 2019
“OK-VQA: A Visual Question Answering Benchmark Requiring External Knowledge”, 2019-05-31 ( ; similar)
“ImageNet-Sketch: Learning Robust Global Representations by Penalizing Local Predictive Power”, Et Al 2019
“ImageNet-Sketch: Learning Robust Global Representations by Penalizing Local Predictive Power”, 2019-05-29 ( ; backlinks; similar)
“SuperGLUE: A Stickier Benchmark for General-Purpose Language Understanding Systems”, Et Al 2019
“SuperGLUE: A Stickier Benchmark for General-Purpose Language Understanding Systems”, 2019-05-02 ( ; backlinks; similar; bibliography)
“ProductNet: a Collection of High-Quality Datasets for Product Representation Learning”, Et Al 2019
“ProductNet: a Collection of High-Quality Datasets for Product Representation Learning”, 2019-04-18 ( ; similar)
“Benchmarking Neural Network Robustness to Common Corruptions and Perturbations”, 2019
“Benchmarking Neural Network Robustness to Common Corruptions and Perturbations”, 2019-03-28 ( ; backlinks; similar)
“A Replication Study: Machine Learning Models Are Capable of Predicting Sexual Orientation From Facial Images”, 2019
“A Replication Study: Machine Learning Models Are Capable of Predicting Sexual Orientation From Facial Images”, 2019-02-27 ( ; backlinks; similar)
“Language Models Are Unsupervised Multitask Learners”, Et Al 2019
“Language Models are Unsupervised Multitask Learners”, 2019-02-14 ( ; similar)
“Do We Train on Test Data? Purging CIFAR of Near-Duplicates”, 2019
“Do We Train on Test Data? Purging CIFAR of Near-Duplicates”, 2019-02-01 ( ; similar)
“FIGR: Few-shot Image Generation With Reptile”, 2019
“FIGR: Few-shot Image Generation with Reptile”, 2019-01-08 ( ; backlinks; similar)
“A Style-Based Generator Architecture for Generative Adversarial Networks”, Et Al 2018
“A Style-Based Generator Architecture for Generative Adversarial Networks”, 2018-12-12 ( ; similar)
“ImageNet-trained CNNs Are Biased towards Texture; Increasing Shape Bias Improves Accuracy and Robustness”, Et Al 2018
“ImageNet-trained CNNs are biased towards texture; increasing shape bias improves accuracy and robustness”, 2018-11-29 ( ; backlinks; similar)
“The Open Images Dataset V4: Unified Image Classification, Object Detection, and Visual Relationship Detection at Scale”, Et Al 2018
“The Open Images Dataset V4: Unified image classification, object detection, and visual relationship detection at scale”, 2018-11-02 ( ; similar)
“CommonsenseQA: A Question Answering Challenge Targeting Commonsense Knowledge”, Et Al 2018
“CommonsenseQA: A Question Answering Challenge Targeting Commonsense Knowledge”, 2018-11-02 ( ; similar)
“Don’t Give Me the Details, Just the Summary! Topic-Aware Convolutional Neural Networks for Extreme Summarization”, Et Al 2018
“Don’t Give Me the Details, Just the Summary! Topic-Aware Convolutional Neural Networks for Extreme Summarization”, 2018-08-27 ( ; backlinks; similar)
“A Short Note about Kinetics-600”, Et Al 2018
“A Short Note about Kinetics-600”, 2018-08-03 ( ; similar; bibliography)
“CurriculumNet: Weakly Supervised Learning from Large-Scale Web Images”, Et Al 2018
“CurriculumNet: Weakly Supervised Learning from Large-Scale Web Images”, 2018-08-03 ( ; backlinks; similar; bibliography)
“Benchmarking Neural Network Robustness to Common Corruptions and Surface Variations”, 2018
“Benchmarking Neural Network Robustness to Common Corruptions and Surface Variations”, 2018-07-04 ( ; backlinks; similar)
“BDD100K: A Diverse Driving Dataset for Heterogeneous Multitask Learning”, Et Al 2018
“BDD100K: A Diverse Driving Dataset for Heterogeneous Multitask Learning”, 2018-05-12 ( ; backlinks; similar)
“Exploring the Limits of Weakly Supervised Pretraining”, Et Al 2018
“Exploring the Limits of Weakly Supervised Pretraining”, 2018-05-02 ( ; similar; bibliography)
“GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding”, Et Al 2018
“GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding”, 2018-04-20 ( ; backlinks; similar)
“The Sound of Pixels”, Et Al 2018
“The Sound of Pixels”, 2018-04-09 ( ; similar)
“Think You Have Solved Question Answering? Try ARC, the AI2 Reasoning Challenge”, Et Al 2018
“Think you have Solved Question Answering? Try ARC, the AI2 Reasoning Challenge”, 2018-03-14 ( ; similar)
“FEVER: a Large-scale Dataset for Fact Extraction and VERification”, Et Al 2018
“FEVER: a large-scale dataset for Fact Extraction and VERification”, 2018-03-14 ( ; backlinks; similar)
“11K Hands: Gender Recognition and Biometric Identification Using a Large Dataset of Hand Images”, 2017
“11K Hands: Gender recognition and biometric identification using a large dataset of hand images”, 2017-11-12 ( ; backlinks; similar)
“WebVision Database: Visual Learning and Understanding from Web Data”, Et Al 2017
“WebVision Database: Visual Learning and Understanding from Web Data”, 2017-08-09 ( ; backlinks; similar)
“Revisiting Unreasonable Effectiveness of Data in Deep Learning Era”, Et Al 2017
“Revisiting Unreasonable Effectiveness of Data in Deep Learning Era”, 2017-07-10 ( ; similar)
“Driver Identification Using Automobile Sensor Data from a Single Turn”, Et Al 2017
“Driver Identification Using Automobile Sensor Data from a Single Turn”, 2017-06-09 ( ; backlinks; similar)
“The Kinetics Human Action Video Dataset”, Et Al 2017
“The Kinetics Human Action Video Dataset”, 2017-05-19 ( ; similar)
“Dense-Captioning Events in Videos”, Et Al 2017
“Dense-Captioning Events in Videos”, 2017-05-02 ( ; backlinks; similar)
“RACE: Large-scale ReAding Comprehension Dataset From Examinations”, Et Al 2017
“RACE: Large-scale ReAding Comprehension Dataset From Examinations”, 2017-04-15 ( ; backlinks; similar)
“Pointer Sentinel Mixture Models”, Et Al 2016
“Pointer Sentinel Mixture Models”, 2016-09-26 ( ; backlinks; similar)
“Solving General Arithmetic Word Problems”, 2016
“Solving General Arithmetic Word Problems”, 2016-08-04 ( ; backlinks; similar)
“The LAMBADA Dataset: Word Prediction Requiring a Broad Discourse Context”, Et Al 2016
“The LAMBADA dataset: Word prediction requiring a broad discourse context”, 2016-06-20 ( ; backlinks; similar)
“Convolutional Sketch Inversion”, Et Al 2016
“Convolutional Sketch Inversion”, 2016-06-09 ( ; similar)
“Danbooru2021: A Large-Scale Crowdsourced and Tagged Anime Illustration Dataset”, 2015
“Danbooru2021: A Large-Scale Crowdsourced and Tagged Anime Illustration Dataset”, 2015-12-15 ( ; backlinks; bibliography)
“Neural Module Networks”, Et Al 2015
“Neural Module Networks”, 2015-11-09 ( ; backlinks; similar)
“Sketch-based Manga Retrieval Using Manga109 Dataset”, Et Al 2015
“Sketch-based Manga Retrieval using Manga109 Dataset”, 2015-10-15 ( ; backlinks; similar)
“LSUN: Construction of a Large-scale Image Dataset Using Deep Learning With Humans in the Loop”, Et Al 2015
“LSUN: Construction of a Large-scale Image Dataset using Deep Learning with Humans in the Loop”, 2015-06-10 ( ; backlinks; similar)
“YFCC100M: The New Data in Multimedia Research”, Et Al 2015
“YFCC100M: The New Data in Multimedia Research”, 2015-03-05 ( ; similar)
“ImageNet Large Scale Visual Recognition Challenge”, Et Al 2014
“ImageNet Large Scale Visual Recognition Challenge”, 2014-09-01 ( ; backlinks; similar)
“Microsoft COCO: Common Objects in Context”, Et Al 2014
“Microsoft COCO: Common Objects in Context”, 2014-05-01 ( ; similar)
“UCF101: A Dataset of 101 Human Actions Classes From Videos in The Wild”, Et Al 2012
“UCF101: A Dataset of 101 Human Actions Classes From Videos in The Wild”, 2012-12-03 ( ; backlinks; similar)
“Solving Math Word Problems: We’ve Trained a System That Solves Grade School Math Problems With Nearly Twice the Accuracy of a Fine-tuned GPT-3 Model. It Solves about 90% As Many Problems As Real Kids: a Small Sample of 9-12 Year Olds Scored 60% on a Test from Our Dataset, While Our System Scored 55% on Those Same Problems. This Is Important Because Today’s AI Is Still Quite Weak at Commonsense Multistep Reasoning, Which Is Easy Even for Grade School Kids. We Achieved These Results by Training Our Model to Recognize Its Mistakes, so That It Can Try Repeatedly Until It Finds a Solution That Works”
“Solving Math Word Problems: We’ve trained a system that solves grade school math problems with nearly twice the accuracy of a fine-tuned GPT-3 model. It solves about 90% as many problems as real kids: a small sample of 9-12 year olds scored 60% on a test from our dataset, while our system scored 55% on those same problems. This is important because today’s AI is still quite weak at commonsense multistep reasoning, which is easy even for grade school kids. We achieved these results by training our model to recognize its mistakes, so that it can try repeatedly until it finds a solution that works” ( ; backlinks)
Wikipedia
Miscellaneous
Link Bibliography
-
https://arxiv.org/abs/2212.13138#google
: “Med-PaLM: Large Language Models Encode Clinical Knowledge”, : -
https://arxiv.org/abs/2211.15533
: “The Stack: 3 TB of Permissively Licensed Source Code”, : -
https://arxiv.org/abs/2211.06679#baai
: “AltCLIP: Altering the Language Encoder in CLIP for Extended Language Capabilities”, Zhongzhi Chen, Guang Liu, Bo-Wen Zhang, Fulong Ye, Qinghong Yang, Ledell Wu: -
https://arxiv.org/abs/2210.11610#google
: “Large Language Models Can Self-Improve”, Jiaxin Huang, Shixiang Shane Gu, Le Hou, Yuexin Wu, Xuezhi Wang, Hongkun Yu, Jiawei Han: -
https://arxiv.org/abs/2210.07792#eleutherai
: “CARP: Robust Preference Learning for Storytelling via Contrastive Reinforcement Learning”, : -
https://aclanthology.org/2022.cai-1.2.pdf
: “Most Language Models Can Be Poets Too: An AI Writing Assistant and Constrained Text Generation Studio”, Allen Roush, Sanjay Basu, Akshay Moorthy, Dmitry Dubovoy: -
https://arxiv.org/abs/2209.00840
: “FOLIO: Natural Language Reasoning With First-Order Logic”, : -
https://www.anthropic.com/red_teaming.pdf
: “Red Teaming Language Models to Reduce Harms: Methods, Scaling Behaviors, and Lessons Learned”, : -
https://arxiv.org/abs/2208.08831#deepmind
: “Discovering Bugs in Vision Models Using Off-the-shelf Image Generation and Captioning”, Olivia Wiles, Isabela Albuquerque, Sven Gowal: -
https://arxiv.org/abs/2208.05516
: “Quality Not Quantity: On the Interaction between Dataset Design and Robustness of CLIP”, Thao Nguyen, Gabriel Ilharco, Mitchell Wortsman, Sewoong Oh, Ludwig Schmidt: -
https://arxiv.org/abs/2207.13061
: “NewsStories: Illustrating Articles With Visual Summaries”, Reuben Tan, Bryan A. Plummer, Kate Saenko, J. P. Lewis, Avneesh Sud, Thomas Leung: -
https://arxiv.org/abs/2205.09665#bair
: “Automated Crossword Solving”, Eric Wallace, Nicholas Tomlin, Albert Xu, Kevin Yang, Eshaan Pathak, Matthew Ginsberg, Dan Klein: -
https://arxiv.org/abs/2205.09073#google
: “Dialog Inpainting: Turning Documents into Dialogues”, Zhuyun Dai, Arun Tejasvi Chaganty, Vincent Zhao, Aida Amini, Qazi Mamunur Rashid, Mike Green, Kelvin Guu: -
https://arxiv.org/abs/2205.04596#google
: “When Does Dough Become a Bagel? Analyzing the Remaining Mistakes on ImageNet”, Vijay Vasudevan, Benjamin Caine, Raphael Gontijo-Lopes, Sara Fridovich-Keil, Rebecca Roelofs: -
https://arxiv.org/abs/2205.03983#google
: “Building Machine Translation Systems for the Next Thousand Languages”, : -
https://arxiv.org/abs/2205.01397
: “Data Determines Distributional Robustness in Contrastive Language Image Pre-training (CLIP)”, Alex Fang, Gabriel Ilharco, Mitchell Wortsman, Yuhao Wan, Vaishaal Shankar, Achal Dave, Ludwig Schmidt: -
https://arxiv.org/abs/2204.07705
: “T<em>k< / em>-Instruct: Benchmarking Generalization via In-Context Instructions on 1,600+ Language Tasks”, : -
https://arxiv.org/abs/2204.03067
: “ByT5 Model for Massively Multilingual Grapheme-to-phoneme Conversion”, Jian Zhu, Cong Zhang, David Jurgens: -
https://arxiv.org/abs/2203.11096
: “CLIP Meets GamePhysics: Towards Bug Identification in Gameplay Videos Using Zero-shot Transfer Learning”, Mohammad Reza Taesiri, Finlay Macklon, Cor-Paul Bezemer: -
https://arxiv.org/abs/2202.12211#google
: “Self-Distilled StyleGAN: Towards Generation from Internet Photos”, Ron Mokady, Michal Yarom, Omer Tov, Oran Lang, Daniel Cohen-Or, Tali Dekel, Michal Irani, Inbar Mosseri: -
https://arxiv.org/abs/2202.06767#huawei
: “Wukong: 100 Million Large-scale Chinese Cross-modal Pre-training Dataset and A Foundation Framework”, : -
https://arxiv.org/abs/2202.00273
: “StyleGAN-XL: Scaling StyleGAN to Large Diverse Datasets”, Axel Sauer, Katja Schwarz, Andreas Geiger: -
https://arxiv.org/abs/2201.12086#salesforce
: “BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation”, Junnan Li, Dongxu Li, Caiming Xiong, Steven Hoi: -
https://arxiv.org/abs/2201.08371#facebook
: “SWAG: Revisiting Weakly Supervised Pre-Training of Visual Perception Models”, : -
https://swabhs.com/assets/pdf/wanli.pdf#allen
: “WANLI: Worker and AI Collaboration for Natural Language Inference Dataset Creation”, Alisa Liu, Swabha Swayamdipta, Noah A. Smith, Yejin Choi: -
https://arxiv.org/abs/2201.04684
: “BigDatasetGAN: Synthesizing ImageNet With Pixel-wise Annotations”, Daiqing Li, Huan Ling, Seung Wook Kim, Karsten Kreis, Adela Barriuso, Sanja Fidler, Antonio Torralba: -
https://arxiv.org/abs/2112.15283#baidu
: “ERNIE-ViLG: Unified Generative Pre-training for Bidirectional Vision-Language Generation”, Han Zhang, Weichong Yin, Yewei Fang, Lanxin Li, Boqiang Duan, Zhihua Wu, Yu Sun, Hao Tian, Hua Wu, Haifeng Wang: -
https://arxiv.org/abs/2112.09332#openai
: “WebGPT: Browser-assisted Question-answering With Human Feedback”, : -
https://arxiv.org/abs/2111.10050#google
: “BASIC: Combined Scaling for Open-Vocabulary Image Classification”, : -
https://arxiv.org/abs/2111.08267
: “Solving Probability and Statistics Problems by Program Synthesis”, Leonard Tang, Elizabeth Ke, Nikhil Singh, Nakul Verma, Iddo Drori: -
https://arxiv.org/abs/2111.02114#laion
: “LAION-400M: Open Dataset of CLIP-Filtered 400 Million Image-Text Pairs”, : -
https://elifesciences.org/articles/66039
: “A Connectome of the <em>Drosophila< / em> Central Complex Reveals Network Motifs Suitable for Flexible Navigation and Context-dependent Action Selection”, : -
https://laion.ai/blog/laion-400-open-dataset/
: “LAION-400-Million Open Dataset”, Christoph Schuhmann: -
https://arxiv.org/abs/2106.04560#google
: “Scaling Vision Transformers”, Xiaohua Zhai, Alexander Kolesnikov, Neil Houlsby, Lucas Beyer: -
https://arxiv.org/abs/2103.14749
: “Pervasive Label Errors in Test Sets Destabilize Machine Learning Benchmarks”, Curtis G. Northcutt, Anish Athalye, Jonas Mueller: -
https://arxiv.org/abs/2102.05918#google
: “ALIGN: Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision”, : -
https://openai.com/blog/clip/
: “CLIP: Connecting Text and Images: We’re Introducing a Neural Network Called CLIP Which Efficiently Learns Visual Concepts from Natural Language Supervision. CLIP Can Be Applied to Any Visual Classification Benchmark by Simply Providing the Names of the Visual Categories to Be Recognized, Similar to the”zero-shot” Capabilities of GPT-2 and GPT-3”, Alec Radford, Ilya Sutskever, Jong Wook Kim, Gretchen Krueger, Sandhini Agarwal: -
https://cdn.openai.com/papers/Learning_Transferable_Visual_Models_From_Natural_Language_Supervision.pdf
: “Learning Transferable Visual Models From Natural Language Supervision”, : -
https://arxiv.org/abs/2101.00027#eleutherai
: “The Pile: An 800GB Dataset of Diverse Text for Language Modeling”, : -
crop
: “Anime Crop Datasets: Faces, Figures, & Hands”, Gwern Branwen, Arfafax, Shawn Presser, Anonymous, Danbooru Community: -
https://arxiv.org/abs/1905.00537
: “SuperGLUE: A Stickier Benchmark for General-Purpose Language Understanding Systems”, : -
https://arxiv.org/abs/1808.01340#deepmind
: “A Short Note about Kinetics-600”, Joao Carreira, Eric Noland, Andras Banki-Horvath, Chloe Hillier, Andrew Zisserman: -
https://arxiv.org/abs/1808.01097
: “CurriculumNet: Weakly Supervised Learning from Large-Scale Web Images”, Sheng Guo, Weilin Huang, Haozhi Zhang, Chenfan Zhuang, Dengke Dong, Matthew R. Scott, Dinglong Huang: -
2018-sharma.pdf#google
: “Conceptual Captions: A Cleaned, Hypernymed, Image Alt-text Dataset For Automatic Image Captioning”, Piyush Sharma, Nan Ding, Sebastian Goodman, Radu Soricut: -
https://arxiv.org/abs/1805.00932#facebook
: “Exploring the Limits of Weakly Supervised Pretraining”, : -
danbooru2021
: “Danbooru2021: A Large-Scale Crowdsourced and Tagged Anime Illustration Dataset”, Gwern Branwen: