heegyu commited on
Commit
6217190
โ€ข
1 Parent(s): 3433fd2

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +5 -0
README.md CHANGED
@@ -8,6 +8,11 @@ language:
8
  - ko
9
  ---
10
 
 
 
 
 
 
11
  - Base Model: [42dot/42dot_LLM-SFT-1.3B](https://huggingface.co/42dot/42dot_LLM-SFT-1.3B)
12
  - [v0.1](https://huggingface.co/heegyu/ko-reward-model-1.3b-v0.1) ๋ชจ๋ธ์€ helpful + safety๋ฅผ ๊ฐ™์ด ํ•™์Šตํ–ˆ๊ณ  safeํ•œ ๋‹ต๋ณ€์— ์ง€๋‚˜์น˜๊ฒŒ ๋†’์€ ์ ์ˆ˜๋ฅผ ์ฃผ๋Š” ๊ฒฝํ–ฅ์ด ์žˆ์–ด์„œ ๋ถ„๋ฆฌ ํ›„ ๋”ฐ๋กœ ํ•™์Šตํ–ˆ์Šต๋‹ˆ๋‹ค.
13
  - ์ด ๋ชจ๋ธ์€ ์œค๋ฆฌ์ ์ธ ๋‹ต๋ณ€์— ๋†’์€ ์ ์ˆ˜๋ฅผ ์ฃผ๋Š” safety ๋ชจ๋ธ์ž…๋‹ˆ๋‹ค. ์œ ์šฉํ•˜๊ณ  ์ž์„ธํ•œ ๋‹ต๋ณ€์— ๋Œ€ํ•ด ๋†’์€ ์ ์ˆ˜๋ฅผ ์ฃผ๋Š” helpful ๋ชจ๋ธ์€ [heegyu/ko-reward-model-helpful-1.3b-v0.2](https://huggingface.co/heegyu/ko-reward-model-helpful-1.3b-v0.2) <- ์ด ๋ชจ๋ธ์„ ์‚ฌ์šฉํ•˜์„ธ์š”
 
8
  - ko
9
  ---
10
 
11
+ <div align="center">
12
+ <div>&nbsp;</div>
13
+ <img src="./llama_judge.jpeg" width="400"/>
14
+ </div>
15
+
16
  - Base Model: [42dot/42dot_LLM-SFT-1.3B](https://huggingface.co/42dot/42dot_LLM-SFT-1.3B)
17
  - [v0.1](https://huggingface.co/heegyu/ko-reward-model-1.3b-v0.1) ๋ชจ๋ธ์€ helpful + safety๋ฅผ ๊ฐ™์ด ํ•™์Šตํ–ˆ๊ณ  safeํ•œ ๋‹ต๋ณ€์— ์ง€๋‚˜์น˜๊ฒŒ ๋†’์€ ์ ์ˆ˜๋ฅผ ์ฃผ๋Š” ๊ฒฝํ–ฅ์ด ์žˆ์–ด์„œ ๋ถ„๋ฆฌ ํ›„ ๋”ฐ๋กœ ํ•™์Šตํ–ˆ์Šต๋‹ˆ๋‹ค.
18
  - ์ด ๋ชจ๋ธ์€ ์œค๋ฆฌ์ ์ธ ๋‹ต๋ณ€์— ๋†’์€ ์ ์ˆ˜๋ฅผ ์ฃผ๋Š” safety ๋ชจ๋ธ์ž…๋‹ˆ๋‹ค. ์œ ์šฉํ•˜๊ณ  ์ž์„ธํ•œ ๋‹ต๋ณ€์— ๋Œ€ํ•ด ๋†’์€ ์ ์ˆ˜๋ฅผ ์ฃผ๋Š” helpful ๋ชจ๋ธ์€ [heegyu/ko-reward-model-helpful-1.3b-v0.2](https://huggingface.co/heegyu/ko-reward-model-helpful-1.3b-v0.2) <- ์ด ๋ชจ๋ธ์„ ์‚ฌ์šฉํ•˜์„ธ์š”