FAR AI

non-profit

https://far.ai/

AlignmentResearch

Request to join this org

AI & ML interests

Frontier alignment research to ensure the safe development and deployment of advanced AI systems.

spaces 1

Tuned Lens

models 3176

AlignmentResearch/test_pythia-2.8b

Updated about 6 hours ago

AlignmentResearch/test_pythia-14m

Updated about 7 hours ago

AlignmentResearch/robust_llm_test_model_from_bert

Updated about 8 hours ago

AlignmentResearch/robust_llm_example_model

Updated 3 days ago

AlignmentResearch/robust_llm_pythia-14m_clf_imdb_v-ian-079a_s-4

Updated 3 days ago • 4

AlignmentResearch/robust_llm_pythia-14m_clf_imdb_v-ian-079a_s-2

Updated 3 days ago • 4

AlignmentResearch/robust_llm_clf_pm_pythia-12b_s-4_adv_tr_rt_t-4

Updated 3 days ago

AlignmentResearch/robust_llm_clf_spam_pythia-12b_s-0_adv_tr_rt_t-0

Updated 3 days ago

AlignmentResearch/robust_llm_clf_spam_pythia-12b_s-3_adv_tr_rt_t-3

Updated 5 days ago

AlignmentResearch/robust_llm_clf_spam_pythia-12b_s-1_adv_tr_rt_t-1

Updated 5 days ago

datasets 14

AlignmentResearch/WordLength

Viewer • Updated Aug 7 • 100k • 6.53k

AlignmentResearch/Harmless

Viewer • Updated Jul 29 • 86.6k • 3.95k

AlignmentResearch/Helpful

Viewer • Updated Jul 29 • 88.1k • 3.9k

AlignmentResearch/StrongREJECT

Viewer • Updated Jul 29 • 313 • 5.34k

AlignmentResearch/PasswordMatch

Viewer • Updated Jul 29 • 100k • 44.2k

AlignmentResearch/IMDB

Viewer • Updated Jul 29 • 97.5k • 43.1k

AlignmentResearch/EnronSpam

Viewer • Updated Jul 29 • 62.3k • 6.78k

AlignmentResearch/PasswordMatch-test

Viewer • Updated Jul 26 • 50k • 4

AlignmentResearch/WordLength-test

Viewer • Updated Jul 26 • 100k • 4

AlignmentResearch/StrongREJECT-test

Viewer • Updated Jul 26 • 313 • 1.65k