qwerrwe / README.md

Commit History

let hf trainer handle torch compile (#516)
a4e1bb6
unverified

winglian tmm1 commited on

Add training callback to send predictions to WandB table (#521)
5b67ea9
unverified

Glavin001 commited on

document that packaging needs to be installed before flash-attn (#559)
9845c5e
unverified

winglian commited on

ergonomic update to optimizer config doc (#548)
6d57f2f
unverified

The Objective Dad commited on

update readme to point to direct link to runpod template, cleanup install instrucitons (#532)
34c0a86
unverified

winglian commited on

Adding NCCL Timeout Guide (#536)
5e2d8a4
unverified

The Objective Dad commited on

Fix(doc): Inform Windows users to use WSL/docker (#518)
f51c9c5
unverified

Nanobit commited on

Added advanced DDP args (#515)
396a7a7
unverified

Jan Philipp Harries Jan Philipp Harries commited on

support for datasets with multiple names (#480)
5ac3392
unverified

winglian commited on

Fix(doc): Clarify no amp to full yaml docs (#496)
48c5647
unverified

Nanobit commited on

pad_to_worst_case_seq_len boolean, for testing memory limits (#498)
8e197f6
unverified

Birch-san tmm1 commited on

Feat(doc): Update eval_steps doc (#487)
ad8be43
unverified

Nanobit commited on

ReLoRA implementation (with quantization) (#322)
bde3c5a
unverified

chargoddard winglian commited on

Fix(doc): Clarify config (#466)
55c23c7
unverified

Nanobit commited on

feat: add Metharme prompt strategy (#446)
f474650
unverified

TearGosling Nanobit commited on

feat(docs): improve user customized prompts (#443)
04a42b6
unverified

Nanobit commited on

feat(doc): add pillow to lambda instructions (#445)
919f4ca
unverified

Nanobit commited on

support user defined prompters, pretokenized datasets in config, local parquet, local arrow files (#348)
d2e7f27
unverified

winglian commited on

docs(readme): add `cd axolotl` (#440)
d21318d
unverified

philpax commited on

use save_strategy from config if available (#434)
b3f5e00
unverified

winglian commited on

flash attn pip install (#426)
cf66547
unverified

mhenrichsen Ubuntu mhenrichsen Mads Henrichsen winglian commited on

Fix(docs): Remove gptq+lora and fix xformer compat list (#423)
3d1f203
unverified

Nanobit commited on

hopefully improve the README (#419)
2495909
unverified

winglian commited on

Merge pull request #413 from mhenrichsen/chore/update-deepseed-config
f806e86
unverified

mhenrichsen commited on

Feat(doc): Add lr_quadratic_warmup to readme (#412)
2b990eb
unverified

Nanobit commited on

update path to align with fsdp example
bd8cab4

mhenrichsen commited on

Fix(config): Update handling of deepspeed config (#404)
c01015f
unverified

Nanobit commited on

Fix(docs): Update flash attn requirements (#409)
72fe3f8
unverified

Nanobit commited on

update docs for tokenizer_legacy (#401)
47961fd
unverified

winglian commited on

add templates, CoC and contributing guide (#126)
31db0ec
unverified

lightningRalf winglian Nanobit commited on

Feat(doc): Add how to save by epochs (#396)
be294fd
unverified

Nanobit commited on

Feat(doc): Add max_steps to readme (#389)
41ecb45
unverified

Nanobit commited on

Feat(config): Add hub_strategy (#386)
73a0b6e
unverified

Nanobit commited on

Feat(doc): Improve sharegpt doc (#378)
729c299
unverified

Nanobit commited on

Attention mask and position id fixes for packing (#285)
2bb0b78
unverified

winglian commited on

Add wandb_entity to wandb options, update example configs, update README (#361)
7019509
unverified

Morgan McGuire Morgan McGuire winglian commited on

Feat: Add rope scaling (#343)
b521206
unverified

Nanobit commited on

Update README.md on pretraining_dataset (#360)
fae6ed8
unverified

Nanobit commited on

Clarify pre-tokenize before multigpu (#359)
94d03c8
unverified

Nanobit commited on

note pattern when using groups
b4d1d22

tmm1 commited on

update comment for group_by_length
9f99104

tmm1 commited on

python 3.10 and 3.11 both work fine, as does pytorch 2.1.0.dev
58d6659

tmm1 commited on

there is no configs folder
cc7e800

tmm1 commited on

update README for updated docker images (#328)
41a4d15
unverified

winglian commited on

Merge pull request #306 from ethanhs/xgen
dcdec44
unverified

winglian commited on

don't resize embeddings to multiples of 32x by default
1066751

winglian commited on

Add XGen info to README and example config
3881143

ethanhs commited on

Fix(readme): Improve wording for push model
165907f
unverified

Nanobit commited on

fix(readme): remove accelerate config
b64f411
unverified

Nanobit commited on

Merge pull request #279 from NanoCode012/feat/multi-gpu-readme
469c08c
unverified

winglian commited on