qwerrwe / src

Commit History

fix DefaultDict.__or__
a13e45d

tmm1 commited on

Attention mask and position id fixes for packing (#285)
2bb0b78
unverified

winglian commited on

Add wandb_entity to wandb options, update example configs, update README (#361)
7019509
unverified

Morgan McGuire Morgan McGuire winglian commited on

Fix(model loading): Warn when model revision is passed to gptq (#364)
96bd6ae
unverified

Nanobit commited on

Fix(message): Improve error message for bad format (#365)
e37d935
unverified

Nanobit commited on

Feat: Add rope scaling (#343)
b521206
unverified

Nanobit commited on

Merge pull request #356 from tmm1/load_model-args
11ddccb
unverified

tmm1 commited on

simplify load_model signature
7181022

tmm1 commited on

log GPU memory usage
e303d64

tmm1 commited on

ensure enable_input_require_grads is called on model before getting the peft model (#345)
176b888
unverified

winglian commited on

experimental llama 2 chat support (#296)
3392270
unverified

Jan Philipp Harries Jan Philipp Harries commited on

Update XFormers Attention Monkeypatch to handle Llama-2 70B (GQA) (#339)
10405b9
unverified

ssmi153 commited on

Added Orca Mini prompt strategy (#263)
c93655c
unverified

Jan Philipp Harries Jan Philipp Harries commited on

optimize the iteration when tokenizeing large datasets (#332)
fe28543
unverified

winglian commited on

fix typo
2eda9e0

tmm1 commited on

scope flash-attn+qlora fix correctly, scope to llama, add comment
78b9efb

tmm1 commited on

move flash-attn monkey patch alongside the others
312a9fa

tmm1 commited on

ensure flash-attn fixes happen in both adapter/lora modes, and use torch_dtype
248bf90

tmm1 commited on

qlora w flash attention fixes (#333)
77085ea
unverified

winglian commited on

add peft install back since it doesn't get installed by setup.py (#331)
db2a358
unverified

winglian commited on

update prompts for open orca to match the paper (#317)
3d4984b
unverified

winglian commited on

Merge pull request #307 from OpenAccess-AI-Collective/xgen-user-sharegpt-tokens
40a53ff
unverified

winglian commited on

Merge pull request #313 from OpenAccess-AI-Collective/tokenizer-llama2-embeddings
3ffb018
unverified

winglian commited on

don't resize embeddings to multiples of 32x by default
1066751

winglian commited on

better handling since xgen tokenizer breaks with convert_tokens_to_ids
2a428e8

winglian commited on

flash attention 2
9b790d3

winglian commited on

fix sdp attention to use the flash/mem-efficient context manaager
a032c9f

winglian commited on

feat: use multi-core
45ac7c4

Nanobit commited on

fix axolotl training args dataclass annotation
ebaec3c

winglian commited on

misc fixes
d75adb9

winglian commited on

Merge pull request #276 from theobjectivedad/logging_enhancement
6f16c45
unverified

winglian commited on

Fixed pre-commit problems, fixed small bug in logging_config to handle LOG_LEVEL env var
b1f4f7a

theobjectivedad commited on

Merge branch 'OpenAccess-AI-Collective:main' into logging_enhancement
83237b8
unverified

The Objective Dad commited on

Add ability to pass 'name' argument to load_dataset
88089e8

chargoddard commited on

Merge pull request #274 from OpenAccess-AI-Collective/NanoCode012-patch-2
168a7a0
unverified

Nanobit commited on

Update log message format, IMO this is easier to read.
9234b75

theobjectivedad commited on

Feat: Add save_safetensors
5491278

Nanobit commited on

Set push to hub as private by default
1514739
unverified

Nanobit commited on

support for loading a model by git revision
69a2350

winglian commited on

Merge branch 'main' into quadratic-warmup
c4cf567
unverified

winglian commited on

better configuration for quadratic warmup
c49729d

winglian commited on

params are adam_*, not adamw_*
19cf0bd

winglian commited on

skip explicit model type too if using trust_remote_code
d69da99

winglian commited on

don't use llama if trust_remote_code is set since that needs to use AutoModel path
66afb76

winglian commited on

Merge pull request #221 from utensil/local_dataset
b9b7d4c
unverified

winglian commited on

Fix future deprecation push_to_hub_model_id
e79c8e6

Nanobit commited on

Merge pull request #255 from OpenAccess-AI-Collective/open-orca-prompts
1e5014a
unverified

winglian commited on

Merge pull request #246 from OpenAccess-AI-Collective/sys-prompts-instruct
4066c78
unverified

winglian commited on

open orca support
78a1e1f

winglian commited on