qwerrwe / configs /pythia_1_2B_alpaca.yml

Commit History

4bit quantized support (wip)
77fca25

winglian commited on

deepspeed doesn't work with flash-attn, and the gpu savings w flash attn are better than the deepspeed headaches
d1aed4c

winglian commited on

more logging, wandb fixes
05fffb5

winglian commited on

improve prepared dataset loading, fix inference
b164725

winglian commited on

more fixes and prep for llama training
949a27b

winglian commited on

make it work with pythia in the cloud
8d959a7

winglian commited on

WIP for axolotl trainer
ce24f5e

winglian commited on