qwerrwe / tests /e2e /patched

Commit History

relora: magnitude pruning of the optimizer (#1245)
8c2e05a
unverified

winglian commited on

support for true batches with multipack (#1230)
00568c1
unverified

winglian commited on

Phi2 multipack (#1173)
814aee6
unverified

winglian commited on

Falcon embeddings (#1149) [skip docker]
e799e08
unverified

winglian commited on

Multipack simplify for Mixtral (#1142)
6910e6a
unverified

winglian commited on

Add shifted sparse attention (#973) [skip-ci]
1d70f24
unverified

jrc joecummings winglian commited on

attempt to also run e2e tests that needs gpus (#1070)
788649f
unverified

winglian commited on