Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
trl internal testing
company
Request to join this org
AI & ML interests
Internal testing artifact mangement for trl library
Team members
7
spaces
1
Runtime error
3
🚀
Rlhf Dialog Experiment
models
63
Sort: Recently updated
trl-internal-testing/tiny-random-Mistral-7B-Instruct-v0.3
Updated
7 days ago
trl-internal-testing/tiny-random-Mistral-7B-Instruct-v0.2
Updated
7 days ago
trl-internal-testing/tiny-random-Mistral-7B-Instruct-v0.1
Updated
7 days ago
trl-internal-testing/tiny-random-gemma-2-9b-it
Updated
7 days ago
trl-internal-testing/tiny-random-Phi-3-mini-128k-instruct
Updated
7 days ago
trl-internal-testing/tiny-random-DeepSeek-Coder-V2-Instruct
Updated
7 days ago
trl-internal-testing/tiny-random-Meta-Llama-3-8B-Instruct
Updated
7 days ago
trl-internal-testing/tiny-random-Meta-Llama-3.1-8B-Instruct
Updated
7 days ago
trl-internal-testing/tiny-random-Qwen2-7B-Instruct
Updated
7 days ago
trl-internal-testing/tiny-random-llava-1.5
Updated
Aug 16
•
15.7k
Expand 63 models
datasets
12
Sort: Recently updated
trl-internal-testing/example-images
Viewer
•
Updated
1 day ago
•
3
•
2
trl-internal-testing/zen
Viewer
•
Updated
5 days ago
•
228
•
627
trl-internal-testing/tldr-preference-sft-trl-style
Viewer
•
Updated
30 days ago
•
130k
•
5.91k
•
1
trl-internal-testing/sentiment-trl-style
Viewer
•
Updated
Aug 6
•
5.48k
•
263
trl-internal-testing/descriptiveness-trl-style
Viewer
•
Updated
Aug 6
•
5.42k
•
2
•
1
trl-internal-testing/tldr-preference-trl-style
Viewer
•
Updated
Jun 25
•
179k
•
762
•
2
trl-internal-testing/hh-rlhf-helpful-base-trl-style
Viewer
•
Updated
May 2
•
46.2k
•
14.1k
•
6
trl-internal-testing/descriptiveness-sentiment-trl-style
Viewer
•
Updated
Apr 9
•
10.9k
•
7.23k
•
1
trl-internal-testing/hh-rlhf-trl-style
Viewer
•
Updated
Mar 13
•
169k
•
1.06k
•
9
trl-internal-testing/Anthropic-hh-rlhf-processed
Viewer
•
Updated
Mar 13
•
3k
•
46
Expand 12 datasets