Abstract
Instruct (or "chat") tuned models have become the primary way in which most people interact with large language models. As opposed to "base" or "foundation" models, instruct-tuned models are optimized to respond to imperative statements. We present Hermes 3, a neutrally-aligned generalist instruct and tool use model with strong reasoning and creative abilities. Its largest version, Hermes 3 405B, achieves state of the art performance among open weight models on several public benchmarks.
Community
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
The following papers were recommended by the Semantic Scholar API
- Following Length Constraints in Instructions (2024)
- LIONs: An Empirically Optimized Approach to Align Language Models (2024)
- Prompt Refinement or Fine-tuning? Best Practices for using LLMs in Computational Social Science Tasks (2024)
- Let Me Speak Freely? A Study on the Impact of Format Restrictions on Performance of Large Language Models (2024)
- Thought-Like-Pro: Enhancing Reasoning of Large Language Models through Self-Driven Prolog-based Chain-of-Though (2024)
Please give a thumbs up to this comment if you found it helpful!
If you want recommendations for any Paper on Hugging Face checkout this Space
You can directly ask Librarian Bot for paper recommendations by tagging it in a comment:
@librarian-bot
recommend
Who is Rishi Sunak?
Hey , may you answer me, what is the difference from the Lama3.1 model?
It would be super cool if you would release the datasets publicly.
Models citing this paper 34
Browse 34 models citing this paperDatasets citing this paper 0
No dataset linking this paper