free-solar-dpo-v0.1 / README.md
freewheelin's picture
Update README.md
a58f258 verified
|
raw
history blame contribute delete
No virus
566 Bytes
---
language:
- ko
- en
license: mit
---
# Model Card for free-solar-dpo-v0.1
## Developed by : [Freewheelin](https://freewheelin-recruit.oopy.io/) AI Technical Team
## Hardware and Software
* **Training Factors**: We fine-tuned this model using the [HuggingFace TRL Trainer](https://huggingface.co/docs/trl/trainer)
## Method
- This model was trained using the learning method introduced in the [SOLAR paper](https://arxiv.org/pdf/2312.15166.pdf).
## Base Model
- [freewheelin/free-solar-slerp-v0.2](https://huggingface.co/freewheelin/free-solar-slerp-v0.2)