Re-quantize and re-upload model

#1
by mtasic85 - opened

@bartowski llama.cpp fixed issue with BOS/EOS tokens, and all models need to be re-quantized and re-uploaded.

Please check https://github.com/ggerganov/llama.cpp/issues/9315

Additionally, please quantize RWKV 6 1b6, 3b and 14b models :)

Will do thanks for the info!

@mtasic85 it's up!

bartowski changed discussion status to closed

Sign up or log in to comment