qwerrwe / configs /gpt_neox_20b.yml

Commit History

swap batch size for gradient accumulation steps to decouple from num gpu
c2a0792

winglian commited on

Update wandb_log_model on gpt_neox_20b.yml
84fc217
unverified

Viktorius Suwandi commited on

WIP large refactor to make finetune script a little more manageable (#3)
6045345
unverified

winglian commited on