llm-blender
/

PairRM-hf

Text Generation

Inference Endpoints

Model card Files Files and versions Community

Dongfu Jiang commited on Jan 5

Commit

28afd59

•

1 Parent(s): bfd2da5

Update README.md

Files changed (1) hide show

README.md +1 -0

README.md CHANGED Viewed

@@ -81,6 +81,7 @@ print(comparison_results)
 # tensor([ True, False], device='cuda:0'), which means whether candidate A is better than candidate B for each input
 ```
 # Pairwise Reward Model for LLMs (PairRM) from LLM-Blender

 # tensor([ True, False], device='cuda:0'), which means whether candidate A is better than candidate B for each input
 ```
+**We still recommend using the llm-blender wrapper to use the PairRM, as many useful application functions have been implemented to support various scenarios, such as rank, and conversation comparisons, best-of-n-sampling, etc.**
 # Pairwise Reward Model for LLMs (PairRM) from LLM-Blender