Commits · Dovakiins/qwerrwe

update table for rwkv4 support, fix process count for dataset (#822)

cdc71f7
unverified

winglian commited on Nov 5, 2023

Correct typos in datasets.py (#639)

d1236f2
unverified

felixonmars commited on Sep 27, 2023

split completion text to sequence_len (#616)

97d3776
unverified

winglian commited on Sep 22, 2023

Attention mask and position id fixes for packing (#285)

2bb0b78
unverified

winglian commited on Aug 12, 2023

feat: use multi-core

45ac7c4

Nanobit commited on Jul 19, 2023

Fixed pre-commit problems, fixed small bug in logging_config to handle LOG_LEVEL env var

b1f4f7a

theobjectivedad commited on Jul 15, 2023

Adding logging enhancement

553a86b

theobjectivedad commited on Jul 14, 2023

pylint for duplicated code for system prompts

7b57ed7

winglian commited on Jun 18, 2023

add new sharegpt, refactor prompt so it can be customized later, add exception if no data is processed

aac4b76

winglian commited on Jun 11, 2023

fix packing so that concatenated sequences reset the attention

9b8585d

winglian commited on May 31, 2023

Apply isort then black

37293dc

Nanobit commited on May 29, 2023

Lint datasets

6abb7f6

Nanobit commited on May 29, 2023

Lint and format

392dfd9

Nanobit commited on May 28, 2023

fix new dataset prompt tokenizers

0f74464

winglian commited on May 21, 2023

black formatting

2bc1a5b

winglian commited on May 10, 2023

various bugfixes

94f5e41

winglian commited on Apr 19, 2023

casts the prepared data to int16 (doesn't help with training memory)

2db9436

winglian commited on Apr 18, 2023

4bit quantized support (wip)

77fca25

winglian commited on Apr 17, 2023

various bugfixes

80b2ed2

winglian commited on Apr 15, 2023

black formatting

a6028d3

winglian commited on Apr 14, 2023

make it work with pythia in the cloud

8d959a7

winglian commited on Apr 14, 2023

WIP for axolotl trainer

ce24f5e

winglian commited on Apr 14, 2023

Spaces:

Dovakiins
/

qwerrwe

Build error

Commit History

update table for rwkv4 support, fix process count for dataset (#822)

cdc71f7
unverified

Correct typos in datasets.py (#639)

d1236f2
unverified

split completion text to sequence_len (#616)

97d3776
unverified

Attention mask and position id fixes for packing (#285)

2bb0b78
unverified

feat: use multi-core

45ac7c4

Fixed pre-commit problems, fixed small bug in logging_config to handle LOG_LEVEL env var

b1f4f7a

Adding logging enhancement

553a86b

pylint for duplicated code for system prompts

7b57ed7

add new sharegpt, refactor prompt so it can be customized later, add exception if no data is processed

aac4b76

fix packing so that concatenated sequences reset the attention

9b8585d

Apply isort then black

37293dc

Lint datasets

6abb7f6

Lint and format

392dfd9

fix new dataset prompt tokenizers

0f74464

black formatting

2bc1a5b

various bugfixes

94f5e41

casts the prepared data to int16 (doesn't help with training memory)

2db9436

4bit quantized support (wip)

77fca25

various bugfixes

80b2ed2

black formatting

a6028d3

make it work with pythia in the cloud

8d959a7

WIP for axolotl trainer

ce24f5e

Commit History

update table for rwkv4 support, fix process count for dataset (#822) cdc71f7 unverified

Correct typos in datasets.py (#639) d1236f2 unverified

split completion text to sequence_len (#616) 97d3776 unverified

Attention mask and position id fixes for packing (#285) 2bb0b78 unverified

feat: use multi-core 45ac7c4

Fixed pre-commit problems, fixed small bug in logging_config to handle LOG_LEVEL env var b1f4f7a

Adding logging enhancement 553a86b

pylint for duplicated code for system prompts 7b57ed7

add new sharegpt, refactor prompt so it can be customized later, add exception if no data is processed aac4b76

fix packing so that concatenated sequences reset the attention 9b8585d

Apply isort then black 37293dc

Lint datasets 6abb7f6

Lint and format 392dfd9

fix new dataset prompt tokenizers 0f74464

black formatting 2bc1a5b

various bugfixes 94f5e41

casts the prepared data to int16 (doesn't help with training memory) 2db9436

4bit quantized support (wip) 77fca25

various bugfixes 80b2ed2

black formatting a6028d3

make it work with pythia in the cloud 8d959a7

WIP for axolotl trainer ce24f5e

update table for rwkv4 support, fix process count for dataset (#822)

cdc71f7
unverified

Correct typos in datasets.py (#639)

d1236f2
unverified

split completion text to sequence_len (#616)

97d3776
unverified

Attention mask and position id fixes for packing (#285)

2bb0b78
unverified

feat: use multi-core

45ac7c4

Fixed pre-commit problems, fixed small bug in logging_config to handle LOG_LEVEL env var

b1f4f7a

Adding logging enhancement

553a86b

pylint for duplicated code for system prompts

7b57ed7

add new sharegpt, refactor prompt so it can be customized later, add exception if no data is processed

aac4b76

fix packing so that concatenated sequences reset the attention

9b8585d

Apply isort then black

37293dc

Lint datasets

6abb7f6

Lint and format

392dfd9

fix new dataset prompt tokenizers

0f74464

black formatting

2bc1a5b

various bugfixes

94f5e41

casts the prepared data to int16 (doesn't help with training memory)

2db9436

4bit quantized support (wip)

77fca25

various bugfixes

80b2ed2

black formatting

a6028d3

make it work with pythia in the cloud

8d959a7

WIP for axolotl trainer

ce24f5e