Spaces:

Dovakiins
/

qwerrwe

Build error

App Files Files Community

qwerrwe / scripts /finetune.py

Commit History

suppport for alpaca-like instruction datasets without inputs

e107643

winglian commited on Apr 18, 2023

casts the prepared data to int16 (doesn't help with training memory)

2db9436

winglian commited on Apr 18, 2023

bugfixes

120e7df

winglian commited on Apr 17, 2023

fix lora target module, require explicit flash attention, fix min logging steps, don't use adam8bit for int4, hash prepared datasets, support hf hub datasets

87e073d

winglian commited on Apr 17, 2023

4bit quantized support (wip)

77fca25

winglian commited on Apr 17, 2023

cleanup, prep for 4bit quant support

12de7b7

winglian commited on Apr 16, 2023

deepspeed doesn't work with flash-attn, and the gpu savings w flash attn are better than the deepspeed headaches

d1aed4c

winglian commited on Apr 16, 2023

fix logging

a459383

winglian commited on Apr 16, 2023

prepare datasets only flag

2393801

winglian commited on Apr 15, 2023

configure log level, add llama 7b config

d33a975

winglian commited on Apr 15, 2023

more logging, wandb fixes

05fffb5

winglian commited on Apr 15, 2023

refactor trainer setup to account for deepspeed integration

2df63ef

winglian commited on Apr 15, 2023

improve prepared dataset loading, fix inference

b164725

winglian commited on Apr 15, 2023

helpful info output

937f44f

winglian commited on Apr 15, 2023

fix issue with completed model being empty

902dd0a

winglian commited on Apr 15, 2023

various bugfixes

80b2ed2

winglian commited on Apr 15, 2023

bettter handling of llama model import

45f77dd

winglian commited on Apr 14, 2023

more fixes and prep for llama training

949a27b

winglian commited on Apr 14, 2023

config chooser, update readme instructions, device config, llama flash attention, debug out the labels, fix config key checks, other bugfixes

f2a2029

winglian commited on Apr 14, 2023

black formatting

a6028d3

winglian commited on Apr 14, 2023

make it work with pythia in the cloud

8d959a7

winglian commited on Apr 14, 2023

WIP for axolotl trainer

ce24f5e

winglian commited on Apr 14, 2023