chlee10's picture
Update README.md
fa54530 verified
|
raw
history blame
746 Bytes
metadata
pipeline_tag: text-generation
license: apache-2.0
language:
  - en
tags:
  - SOLAR-10.7B-v1.0
  - Open-platypus-Commercial
base_model: upstage/SOLAR-10.7B-v1.0
datasets:
  - kyujinpy/Open-platypus-Commercial
model-index:
  - name: T3Q-platypus-SOLAR-10.7B-v1.0
    results: []

Update @ 2024.03.07

T3Q-platypus-SOLAR-10.7B-v1.0

This model is a fine-tuned version of upstage/SOLAR-10.7B-v1.0

Model Developers Chihoon Lee(chlee10), T3Q

Training hyperparameters

The following hyperparameters were used during training:

  • batch_size = 16
  • num_epochs = 1
  • micro_batch = 1
  • cutoff_len = 4096
  • learning_rate = 4e-4

Framework versions

  • Transformers 4.34.1
  • Pytorch 2.1.0+cu121
  • Datasets 2.13.0
  • Tokenizers 0.14.1