Request: 2.75bpw

by gtkunit - opened Aug 16

Aug 16

Thanks for these quants. These don't fit on 48G VRAM. I'm limited in bandwidth so it's not feasible for me to quant and share unfortunately.
If the size is the same as the original Mistral Large Instruct, then 2.75bpw would be perfect for 48G VRAM and about 12k context. The quality of the original version has been really good for me despite being such a low quant, but I'm curious about this one.

Thanks for considering it :)

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment