Bibliography (6):
https://github.com/probcomp/LLaMPPL
LLaMa-1: Open and Efficient Foundation Language Models
Attention Is All You Need
Wikipedia Bibliography:
Reinforcement learning
Monte Carlo method
Beam search