Commit History

fix(examples): remove is_*_derived as it's parsed automatically (#1297)
a7a9a14
unverified

Nanobit commited on

Add seq2seq eval benchmark callback (#1274)
5a5d474
unverified

LeonardoEmili commited on

set fp16 to false if bf16, update bf16: auto in example YAMLs (#1122) [skip ci]
782b6a4
unverified

winglian Nanobit commited on

change val size (#992)
93ebec1
unverified

mhenrichsen commited on

new evals_per_epoch and saves_per_epoch to make things cleaner (#944)
5f79b82
unverified

winglian commited on

Feat(wandb): Refactor to be more flexible (#767)
a1da39c
unverified

Nanobit commited on

feature: loss watchdog for terminating training runs that are failing (#899)
58ec8b1
unverified

user735 Karl-Johan Alm commited on

don't compile deepspeed or bitsandbytes from source (#837)
f544ab2
unverified

winglian commited on

fix eval_steps to be a sane default (#797)
8b79ff0
unverified

winglian commited on

disable eval table w sample packing in examples (#778)
9b43e7e
unverified

winglian commited on

simplify by removing duplicate base_model_config (#772)
2d8def6
unverified

winglian commited on

Fix: lowercase `True` values in config (#713)
ace70b3
unverified

atgctg commited on

Get qlora mistral-7b fine tuning working on a single 4090 (#708)
295b266
unverified

lukemarsden commited on

Fix: Higher vram usage for mistral and sample_packing (#691)
669f1d0
unverified

Nanobit commited on

Adding qlora config for Mistral (#675)
d4a88e4
unverified

Abhishek Mishra commited on