Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
sert121
/
llama3-lora-aligned-orpo
like
0
PEFT
Safetensors
trl
orpo
Generated from Trainer
License:
cc-by-sa-4.0
Model card
Files
Files and versions
Community
Use this model
82e3d36
llama3-lora-aligned-orpo
1 contributor
History:
1 commit
sert121
initial commit
82e3d36
verified
3 months ago
.gitattributes
1.52 kB
initial commit
3 months ago