Bibliography (6):
https://x.com/avisingh599/status/1734603680933192089
ReST: Reinforced Self-Training (ReST) for Language Modeling
Measuring Mathematical Problem Solving With the MATH Dataset
PaLM: Scaling Language Modeling with Pathways
Wikipedia Bibliography:
https://en.wikipedia.org/wiki/Language_model :
https://en.wikipedia.org/wiki/Language_model
https://en.wikipedia.org/wiki/Expectation%E2%80%93maximization_algorithm :
https://en.wikipedia.org/wiki/Expectation%E2%80%93maximization_algorithm