jsgreenawalt commited on
Commit
9763b64
1 Parent(s): 76f2c99

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -16,7 +16,7 @@ This merge is intended as an intermediate merge for further merges. It's useable
16
  The intuition behind this merge is as follows:
17
 
18
  We keep the top 65 percent of weight deltas from the WPO-HB fine tune at a (very near) 1.0 weight
19
- We 'flood fill' the remaining 45 percent of model weights with the SimPO weights. Because normalize is set to true, this results in a 1.0 weight from SimPO's deltas on any non-overlapping weights
20
  In cases of overlap with the top 65 weights from WPO-HB, the relative weight contribution for SimPO is near zero. In cases of non-overlap, each model gets a 1.0 or very near 1.0 weight for the merge.
21
 
22
  Per the mergekit docs:
 
16
  The intuition behind this merge is as follows:
17
 
18
  We keep the top 65 percent of weight deltas from the WPO-HB fine tune at a (very near) 1.0 weight
19
+ We 'flood fill' the remaining 45 percent of model weights with the SimPO weights. Because normalize is set to true, this results in a 1.0 weight from SimPO's deltas on any non-overlapping weights.
20
  In cases of overlap with the top 65 weights from WPO-HB, the relative weight contribution for SimPO is near zero. In cases of non-overlap, each model gets a 1.0 or very near 1.0 weight for the merge.
21
 
22
  Per the mergekit docs: