Commits · Dovakiins/qwerrwe

fix tokenizer loading, got openllama 3b working

e396654

winglian commited on May 25, 2023

fixes w/ example for super basic lora starter

a5d739b

winglian commited on May 25, 2023

Merge pull request #55 from OpenAccess-AI-Collective/missing-validation-file

de2a733
unverified

winglian commited on May 25, 2023

add missing file

1d7da3b

winglian commited on May 25, 2023

stray s

f523a08

winglian commited on May 25, 2023

cfg.cfg fix, also de-dupe lora module list

676d7da

winglian commited on May 25, 2023

fix tuple add to list

a8771b0

winglian commited on May 25, 2023

Update src/axolotl/utils/models.py

1cf21da
unverified

winglian

Nanobit commited on May 25, 2023

attempt to find linear modules for qlora

ffd1043

winglian commited on May 25, 2023

apply black formatting

ce34d64

winglian commited on May 25, 2023

Merge branch 'main' of github.com:OpenAccess-AI-Collective/axolotl into dev

ce694e2

winglian commited on May 25, 2023

remove un-needed code, add validation

1f5d83e

winglian commited on May 25, 2023

fix: handles AutoTokenizer from untrusted source

88ad05d
unverified

Valentin De Matos commited on May 24, 2023

more qlora support

e8aacfb

winglian commited on May 24, 2023

prepare does all this already for qlora?

b9d07aa

winglian commited on May 23, 2023

integrate qlora? maybe?

3b4d055

winglian commited on May 23, 2023

fix missing fp16 kwarg

2ae936f

winglian commited on May 24, 2023

fix enum pass as value

fb100a9

winglian commited on May 23, 2023

Add qa style data for alpaca instructions, fix one_cycle scheduler

3a50377

winglian commited on May 23, 2023

don't need to set here

de6da13

winglian commited on May 22, 2023

be able to use adam bnb 8bit and one cycle scheduler w fsdp

9493b1b

winglian commited on May 22, 2023

Update src/axolotl/utils/models.py for info msg

1b3e401
unverified

winglian

Nanobit commited on May 22, 2023

Update src/axolotl/utils/data.py for spelling

98a6781
unverified

winglian

Nanobit commited on May 22, 2023

make sure to use train split if loading from hf

607a4d3

winglian commited on May 22, 2023

make one cycle lr div factor configurable

99383f1

winglian commited on May 22, 2023

fix new dataset prompt tokenizers

0f74464

winglian commited on May 21, 2023

add missing init

e0602a9

winglian commited on May 21, 2023

pygmalion dataset prompts format, cached tokenized datasets should be hashed on the tokenizer too

2809f3f

winglian commited on May 21, 2023

tokenization fixes

4ea9a66

winglian commited on May 21, 2023

optionally be able to specify alpaca or chat style prompts

1d5ab84

winglian commited on May 20, 2023

Set `half` using `cfg.fp16` for 4bit

641f801
unverified

Nanobit commited on May 19, 2023

concise multiple choice and tldr summarize

1365073

winglian commited on May 17, 2023

support for replit lm

8c2f3cb

winglian commited on May 17, 2023

add alpaca multiple choice instruct dataset support

b46bc02

winglian commited on May 17, 2023

Add `lora_modules_to_save`

2c73c81
unverified

Nanobit commited on May 16, 2023

fix prompters, especially the sharegpt prompter

5e37144

winglian commited on May 16, 2023

more fixes

bdbca8f

winglian commited on May 15, 2023

more fixes

42410c7

winglian commited on May 14, 2023

fix torch_dtype for model load

aef00b6

winglian commited on May 14, 2023

move filter to before saving so it doesn't happen everytime, update runpod manual script

0d28df0

winglian commited on May 14, 2023

whoops, gt vs lt

84c7bc4

winglian commited on May 12, 2023

optimize dataloading to use cache, fix model token embedding sizes

aa3c3f9

winglian commited on May 12, 2023

Merge branch 'main' into patch-2

89b7f26
unverified

Nanobit commited on May 11, 2023

black formatting

2bc1a5b

winglian commited on May 10, 2023

various fixes

7a490a4

winglian commited on May 10, 2023

Fix Trainer() got multiple values for keyword argument 'callbacks'

813aab3
unverified

Nanobit commited on May 10, 2023

testing mpt triton

e2e68c3

winglian commited on May 10, 2023

fix conditional so alpaca doesn't choke

a27d594

winglian commited on May 10, 2023

Rename variable to use same convention

174b74d

Nanobit commited on May 8, 2023

Add CompletionPrompt type

cf68153

Nanobit commited on May 8, 2023

Commit History

fix tokenizer loading, got openllama 3b working e396654

fixes w/ example for super basic lora starter a5d739b

Merge pull request #55 from OpenAccess-AI-Collective/missing-validation-file de2a733 unverified

add missing file 1d7da3b

stray s f523a08

cfg.cfg fix, also de-dupe lora module list 676d7da

fix tuple add to list a8771b0

Update src/axolotl/utils/models.py 1cf21da unverified

attempt to find linear modules for qlora ffd1043

apply black formatting ce34d64

Merge branch 'main' of github.com:OpenAccess-AI-Collective/axolotl into dev ce694e2

remove un-needed code, add validation 1f5d83e

fix: handles AutoTokenizer from untrusted source 88ad05d unverified

more qlora support e8aacfb

prepare does all this already for qlora? b9d07aa

integrate qlora? maybe? 3b4d055

fix missing fp16 kwarg 2ae936f

fix enum pass as value fb100a9

Add qa style data for alpaca instructions, fix one_cycle scheduler 3a50377

don't need to set here de6da13

be able to use adam bnb 8bit and one cycle scheduler w fsdp 9493b1b

Update src/axolotl/utils/models.py for info msg 1b3e401 unverified

Update src/axolotl/utils/data.py for spelling 98a6781 unverified

make sure to use train split if loading from hf 607a4d3

make one cycle lr div factor configurable 99383f1

fix new dataset prompt tokenizers 0f74464

add missing __init__ e0602a9

pygmalion dataset prompts format, cached tokenized datasets should be hashed on the tokenizer too 2809f3f

tokenization fixes 4ea9a66

optionally be able to specify alpaca or chat style prompts 1d5ab84

Set `half` using `cfg.fp16` for 4bit 641f801 unverified

concise multiple choice and tldr summarize 1365073

support for replit lm 8c2f3cb

add alpaca multiple choice instruct dataset support b46bc02

Add `lora_modules_to_save` 2c73c81 unverified

fix prompters, especially the sharegpt prompter 5e37144

more fixes bdbca8f

more fixes 42410c7

fix torch_dtype for model load aef00b6

move filter to before saving so it doesn't happen everytime, update runpod manual script 0d28df0

whoops, gt vs lt 84c7bc4

optimize dataloading to use cache, fix model token embedding sizes aa3c3f9

Merge branch 'main' into patch-2 89b7f26 unverified

black formatting 2bc1a5b

various fixes 7a490a4

Fix Trainer() got multiple values for keyword argument 'callbacks' 813aab3 unverified

testing mpt triton e2e68c3

fix conditional so alpaca doesn't choke a27d594

Rename variable to use same convention 174b74d

Add CompletionPrompt type cf68153

fix tokenizer loading, got openllama 3b working

e396654

fixes w/ example for super basic lora starter

a5d739b

Merge pull request #55 from OpenAccess-AI-Collective/missing-validation-file

de2a733
unverified

add missing file

1d7da3b

stray s

f523a08

cfg.cfg fix, also de-dupe lora module list

676d7da

fix tuple add to list

a8771b0

Update src/axolotl/utils/models.py

1cf21da
unverified

attempt to find linear modules for qlora

ffd1043

apply black formatting

ce34d64

Merge branch 'main' of github.com:OpenAccess-AI-Collective/axolotl into dev

ce694e2

remove un-needed code, add validation

1f5d83e

fix: handles AutoTokenizer from untrusted source

88ad05d
unverified

more qlora support

e8aacfb

prepare does all this already for qlora?

b9d07aa

integrate qlora? maybe?

3b4d055

fix missing fp16 kwarg

2ae936f

fix enum pass as value

fb100a9

Add qa style data for alpaca instructions, fix one_cycle scheduler

3a50377

don't need to set here

de6da13

be able to use adam bnb 8bit and one cycle scheduler w fsdp

9493b1b

Update src/axolotl/utils/models.py for info msg

1b3e401
unverified

Update src/axolotl/utils/data.py for spelling

98a6781
unverified

make sure to use train split if loading from hf

607a4d3

make one cycle lr div factor configurable

99383f1

fix new dataset prompt tokenizers

0f74464

add missing init

e0602a9

pygmalion dataset prompts format, cached tokenized datasets should be hashed on the tokenizer too

2809f3f

tokenization fixes

4ea9a66

optionally be able to specify alpaca or chat style prompts

1d5ab84

Set `half` using `cfg.fp16` for 4bit

641f801
unverified

concise multiple choice and tldr summarize

1365073

support for replit lm

8c2f3cb

add alpaca multiple choice instruct dataset support

b46bc02

Add `lora_modules_to_save`

2c73c81
unverified

fix prompters, especially the sharegpt prompter

5e37144

more fixes

bdbca8f

more fixes

42410c7

fix torch_dtype for model load

aef00b6

move filter to before saving so it doesn't happen everytime, update runpod manual script

0d28df0

whoops, gt vs lt

84c7bc4

optimize dataloading to use cache, fix model token embedding sizes

aa3c3f9

Merge branch 'main' into patch-2

89b7f26
unverified

black formatting

2bc1a5b

various fixes

7a490a4

Fix Trainer() got multiple values for keyword argument 'callbacks'

813aab3
unverified

testing mpt triton

e2e68c3

fix conditional so alpaca doesn't choke

a27d594

Rename variable to use same convention

174b74d

Add CompletionPrompt type

cf68153