Spaces:

Dovakiins
/

qwerrwe

Build error

App Files Files Community

qwerrwe / src /axolotl /core

Commit History

fix: add lr scheduler kwargs to Trainer (#972)

13e9381
unverified

Nanobit commited on Dec 17, 2023

fix: switch to using the HuggingFace Transformers NEFT implementation (#941)

ef24342
unverified

dg-kalle commited on Dec 13, 2023

support for mamba (#915)

40a6362
unverified

winglian commited on Dec 9, 2023

Feat(wandb): Refactor to be more flexible (#767)

a1da39c
unverified

Nanobit commited on Dec 4, 2023

feature: loss watchdog for terminating training runs that are failing (#899)

58ec8b1
unverified

user735 Karl-Johan Alm commited on Dec 4, 2023

Feat: Add warmup_ratio (#893)

fb12895
unverified

Nanobit commited on Nov 25, 2023

don't train if eval split is too small (#873)

797f3dd
unverified

winglian commited on Nov 16, 2023

various bugfixes (#856)

1470650
unverified

winglian commited on Nov 15, 2023

cleanup the old multipack dataloader (#841)

1a6309c
unverified

winglian commited on Nov 12, 2023

multipack w batch sampler (#795)

641e6f7
unverified

winglian commited on Nov 8, 2023

Threaded MultipackDistributedDataloader with prefetched samples (#759)

05bd6f1
unverified

casperhansen commited on Oct 26, 2023

refactor setup trainer so we can add more hooks (#773)

6c81c61
unverified

winglian commited on Oct 23, 2023

Commit History

fix: add lr scheduler kwargs to Trainer (#972) 13e9381 unverified

fix: switch to using the HuggingFace Transformers NEFT implementation (#941) ef24342 unverified

support for mamba (#915) 40a6362 unverified

Feat(wandb): Refactor to be more flexible (#767) a1da39c unverified

feature: loss watchdog for terminating training runs that are failing (#899) 58ec8b1 unverified

Feat: Add warmup_ratio (#893) fb12895 unverified

don't train if eval split is too small (#873) 797f3dd unverified

various bugfixes (#856) 1470650 unverified

cleanup the old multipack dataloader (#841) 1a6309c unverified

multipack w batch sampler (#795) 641e6f7 unverified

Threaded MultipackDistributedDataloader with prefetched samples (#759) 05bd6f1 unverified

refactor setup trainer so we can add more hooks (#773) 6c81c61 unverified