QuantFactory/Hermes-3-Llama-3.1-8B-Kor-Finance-Advisor-GGUF
This is quantized version of kimhyeongjun/Hermes-3-Llama-3.1-8B-Kor-Finance-Advisor created using llama.cpp
Original Model Card
kimhyeongjun/Hermes-3-Llama-3.1-8B-Kor-Finance-Advisor
This is my personal toy project for Chuseok(Korean Thanksgiving Day).
This model is a fine-tuned version of NousResearch/Hermes-3-Llama-3.1-8B on the Korean_synthetic_financial_dataset_21K.
Model description
Everything happened automatically without any user intervention.
Based on finance PDF data collected directly from the web, we refined the raw data using the 'meta-llama/Meta-Llama-3.1-70B-Instruct-FP8' model. After generating synthetic data based on the cleaned data, we further evaluated the quality of the generated data using the 'meta-llama/Llama-Guard-3-8B' and 'RLHFlow/ArmoRM-Llama3-8B-v0.1' models. We then used 'Alibaba-NLP/gte-large-en-v1.5' to extract embeddings and applied Faiss to perform Jaccard distance-based nearest neighbor analysis to construct the final dataset of 21k, which is diverse and sophisticated.
๋ชจ๋ ๊ณผ์ ์ ์ฌ์ฉ์์ ๊ฐ์ ์์ด ์๋์ผ๋ก ์งํ๋์์ต๋๋ค.
์น์์ ์ง์ ์์งํ ๊ธ์ต ๊ด๋ จ PDF ๋ฐ์ดํฐ๋ฅผ ๊ธฐ๋ฐ์ผ๋ก, ๋์ด ์์ด์ 'meta-llama/Meta-Llama-3.1-70B-Instruct-FP8' ๋ชจ๋ธ์ ํ์ฉํ์ฌ Raw ๋ฐ์ดํฐ๋ฅผ ์ ์ ํ์์ต๋๋ค. ์ ์ ๋ ๋ฐ์ดํฐ๋ฅผ ๋ฐํ์ผ๋ก ํฉ์ฑ ๋ฐ์ดํฐ๋ฅผ ์์ฑํ ํ, 'meta-llama/Llama-Guard-3-8B' ๋ฐ 'RLHFlow/ArmoRM-Llama3-8B-v0.1' ๋ชจ๋ธ์ ํตํด ์์ฑ๋ ๋ฐ์ดํฐ์ ํ์ง์ ์ฌ์ธต์ ์ผ๋ก ํ๊ฐํ์์ต๋๋ค. ์ด์ด์ 'Alibaba-NLP/gte-large-en-v1.5'๋ฅผ ์ฌ์ฉํ์ฌ ์๋ฒ ๋ฉ์ ์ถ์ถํ๊ณ , Faiss๋ฅผ ์ ์ฉํ์ฌ ์์นด๋ ๊ฑฐ๋ฆฌ ๊ธฐ๋ฐ์ ๊ทผ์ ์ด์ ๋ถ์์ ์ํํจ์ผ๋ก์จ ๋ค์ํ๊ณ ์ ๊ตํ ์ต์ข ๋ฐ์ดํฐ์ 21k์ ์ง์ ๊ตฌ์ฑํ์์ต๋๋ค.
Task duration
3days (20240914~20240916)
evaluation
Nothing (I had to take the Thanksgiving holiday off.)
sample
Framework versions
- Transformers 4.44.2
- Pytorch 2.4.0+cu121
- Datasets 2.21.0
- Tokenizers 0.19.1
- Downloads last month
- 233
Model tree for QuantFactory/Hermes-3-Llama-3.1-8B-Kor-Finance-Advisor-GGUF
Base model
meta-llama/Llama-3.1-8B