Direct Preference Optimization: Your Language Model is Secretly a Reward Model Paper • 2305.18290 • Published May 29, 2023 • 45
view article Article Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models Mar 20 • 60
BioMistral: A Collection of Open-Source Pretrained Large Language Models for Medical Domains Paper • 2402.10373 • Published Feb 15 • 9
Educational Resources for Medical LLMs Collection Curated medical LLM datasets and models for use in curricular content, particularly for medical professionals (e.g. medical students). • 15 items • Updated Dec 1, 2023 • 4
Healthcare Bias Eval Datasets Collection Benchmarks and other datasets that can be used to evaluate bias in healthcare settings. • 5 items • Updated Dec 9, 2023 • 1
Aya Dataset: An Open-Access Collection for Multilingual Instruction Tuning Paper • 2402.06619 • Published Feb 9 • 52