Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference status
Reset Inference status
Warm
Cold
Frozen
Misc
Reset Misc
reward-trainer
AutoTrain Compatible
Inference Endpoints
text-generation-inference
Eval Results
4-bit precision
Misc with no match
Merge
custom_code
text-embeddings-inference
8-bit precision
Carbon Emissions
Mixture of Experts
Apply filters
Models
260
Full-text search
Edit filters
Sort: Trending
Active filters:
reward-trainer
Clear all
santiviquez/reward_modeling_anthropic_hh
Text Classification
•
Updated
Jun 13
•
6
mnoukhov/pythia160m-rm-tldr
Text Classification
•
Updated
Jun 18
•
6
chandrasekhar319/reward_model_tinyllama_sql
Updated
Jun 19
mnoukhov/pythia410m-rm-tldr6.9b
Text Classification
•
Updated
Jun 20
•
630
trl-internal-testing/rm_160m
Text Classification
•
Updated
Jun 20
•
6
vwxyzjn/rm_1b
Text Classification
•
Updated
Jun 20
•
4
trl-internal-testing/rm_sentiment_1b
Text Classification
•
Updated
Jun 25
•
5
SiMajid/value_reward_modeling
Text Classification
•
Updated
Jun 21
•
5
SiMajid/deberta_value
Text Classification
•
Updated
Jun 22
•
15
SiMajid/xlm-roberta-base
Text Classification
•
Updated
Jun 21
•
5
SiMajid/opt-350-value
Text Classification
•
Updated
Jun 22
•
35
trl-internal-testing/rm_descriptiveness_1b
Text Classification
•
Updated
Jun 25
•
5
trl-internal-testing/rm_hh_1b
Text Classification
•
Updated
Jun 26
•
5
trl-internal-testing/rm_tldr_1b
Text Classification
•
Updated
Jun 26
•
7
smohammadi/tinyllama_rm_sentiment_1b
Text Classification
•
Updated
Jun 28
•
11
prometheus04/tinystarcoder-rlhf-model
Text Generation
•
Updated
Jun 29
•
2
Baidicoot/reward_modeling
Updated
Jul 2
•
1
mnoukhov/pythia160m-rm-tldr6.9b
Text Classification
•
Updated
Jul 4
•
5
mnoukhov/pythia1b-rm-tldr6.9b
Text Classification
•
Updated
Jul 3
•
30
blai88/reward_modeling_anthropic_hh
Updated
Jul 6
•
1
mnoukhov/pythia2.8b-rm-tldr6.9b
Text Classification
•
Updated
Jul 7
•
116
steve-sli/0721_185958-google-gemma-2b
Updated
Jul 21
steve-sli/0721_201833-google-gemma-2b
Updated
Jul 21
•
1
steve-sli/0721_210648-google-gemma-2b
Updated
Jul 21
•
1
steve-sli/0721_210856-google-gemma-2b
Updated
Jul 21
steve-sli/0721_211205-google-gemma-2b
Updated
Jul 21
•
1
steve-sli/0721_222324-google-gemma-2b
Updated
Jul 21
SiMajid/value-reward-model-opt-350m-v3
Text Classification
•
Updated
Jul 23
•
7
SiMajid/value-reward-model-opt-350m-v11
Text Classification
•
Updated
Jul 25
•
7
SiMajid/value-reward-model-opt-350m-v12
Text Classification
•
Updated
Jul 25
•
6
Previous
1
...
3
4
5
6
7
...
9
Next