chargoddard commited on
Commit
bea3712
1 Parent(s): 4415449

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +16 -0
README.md ADDED
@@ -0,0 +1,16 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: cc-by-nc-4.0
3
+ datasets:
4
+ - pankajmathur/orca_mini_v1_dataset
5
+ - openai/summarize_from_feedback
6
+ - PygmalionAI/PIPPA
7
+ - chargoddard/rpguild
8
+ - lemonilia/LimaRP
9
+ - PKU-Alignment/PKU-SafeRLHF
10
+ - Intel/orca_dpo_pairs
11
+ - argilla/ultrafeedback-binarized-preferences
12
+ ---
13
+
14
+ Trained on a different random sampling of the same datasets used by [loyal-piano-m7](https://huggingface.co/chargoddard/loyal-piano-m7), then with cDPO on a blend of RLHF datasets.
15
+
16
+ Uses the Alpaca prompt format.