Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Spaces:
Dovakiins
/
qwerrwe
like
0
Build error
App
Files
Files
Community
ea00dd0
qwerrwe
/
tests
/
e2e
/
patched
100 contributors
History:
7 commits
winglian
relora: magnitude pruning of the optimizer (#1245)
8c2e05a
unverified
8 months ago
__init__.py
pickle
0 Bytes
attempt to also run e2e tests that needs gpus (#1070)
9 months ago
test_4d_multipack_llama.py
3.78 kB
support for true batches with multipack (#1230)
8 months ago
test_falcon_samplepack.py
3.67 kB
Falcon embeddings (#1149) [skip docker]
8 months ago
test_fused_llama.py
2.25 kB
support for true batches with multipack (#1230)
8 months ago
test_llama_s2_attention.py
3.55 kB
Add shifted sparse attention (#973) [skip-ci]
8 months ago
test_lora_llama_multipack.py
4.18 kB
attempt to also run e2e tests that needs gpus (#1070)
9 months ago
test_mistral_samplepack.py
3.66 kB
relora: magnitude pruning of the optimizer (#1245)
8 months ago
test_mixtral_samplepack.py
3.64 kB
Falcon embeddings (#1149) [skip docker]
8 months ago
test_model_patches.py
3.15 kB
Multipack simplify for Mixtral (#1142)
8 months ago
test_phi_multipack.py
4 kB
Phi2 multipack (#1173)
8 months ago
test_resume.py
2.99 kB
attempt to also run e2e tests that needs gpus (#1070)
9 months ago