Commit History

fix: add lr scheduler kwargs to Trainer (#972)
13e9381
unverified

Nanobit commited on

fix: switch to using the HuggingFace Transformers NEFT implementation (#941)
ef24342
unverified

dg-kalle commited on

support for mamba (#915)
40a6362
unverified

winglian commited on

Feat(wandb): Refactor to be more flexible (#767)
a1da39c
unverified

Nanobit commited on

feature: loss watchdog for terminating training runs that are failing (#899)
58ec8b1
unverified

user735 Karl-Johan Alm commited on

Feat: Add warmup_ratio (#893)
fb12895
unverified

Nanobit commited on

don't train if eval split is too small (#873)
797f3dd
unverified

winglian commited on

various bugfixes (#856)
1470650
unverified

winglian commited on

cleanup the old multipack dataloader (#841)
1a6309c
unverified

winglian commited on

multipack w batch sampler (#795)
641e6f7
unverified

winglian commited on

Threaded MultipackDistributedDataloader with prefetched samples (#759)
05bd6f1
unverified

casperhansen commited on

refactor setup trainer so we can add more hooks (#773)
6c81c61
unverified

winglian commited on