“Hermes 3 Technical Report”, 2024-08-15 (; similar):
Instruct (or “chat”) tuned models have become the primary way in which most people interact with large language models. As opposed to “base” or “foundation” models, instruct-tuned models are optimized to respond to imperative statements.
We present Hermes 3, a neutrally-aligned generalist instruct and tool use model with strong reasoning and creative abilities.
Its largest version, Hermes-3-405B, achieves state-of-the-art performance among open weight models on several public benchmarks.
View PDF: