Bibliography (4):
Uni-Perceiver: Pre-training Unified Architecture for Generic Perception for Zero-shot and Few-shot Tasks
Uni-Perceiver-MoE: Learning Sparse Generalist Models with Conditional MoEs
Attention Is All You Need
Wikipedia Bibliography:
Maximum likelihood estimation