“BatchBALD: Efficient and Diverse Batch Acquisition for Deep Bayesian Active Learning”, Andreas Kirsch, Joost van Amersfoort, Yarin Gal2019-06-19 (; backlinks)⁠:

We develop BatchBALD, a tractable approximation to the mutual information between a batch of points and model parameters, which we use as an acquisition function to select multiple informative points jointly for the task of deep Bayesian active learning.

BatchBALD is a greedy linear-time 1 − 1⁄e-approximate algorithm amenable to dynamic programming and efficient caching. We compare BatchBALD to the commonly used approach for batch data acquisition and find that the current approach acquires similar and redundant points, sometimes performing worse than randomly acquiring data.

We finish by showing that, using BatchBALD to consider dependencies within an acquisition batch, we achieve new state-of-the-art performance on standard benchmarks, providing substantial data efficiency improvements in batch acquisition.