bullerwins commited on
Commit
c6e5a43
1 Parent(s): df24c4e

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -4
README.md CHANGED
@@ -18,12 +18,11 @@ license: llama3.1
18
 
19
  ---
20
 
21
- GGUF quantized version using llama.cpp
22
 
23
- While it works, it still needs proper [RoPE support](https://github.com/ggerganov/llama.cpp/issues/8650)
24
- I will requant once merged
25
 
26
- Update 24/07 - requanted with [fixed tokenizer ](https://huggingface.co/meta-llama/Meta-Llama-3.1-8B-Instruct/discussions/28/files)
27
 
28
  ## Model Information
29
 
 
18
 
19
  ---
20
 
21
+ GGUF quantized version using [llama.cpp 5e2727f](https://github.com/ggerganov/llama.cpp/commit/5e2727fe0321c38d1664d26173c654fa1801dc5f)
22
 
23
+ Update 24/07 - requantized with [fixed tokenizer ](https://huggingface.co/meta-llama/Meta-Llama-3.1-8B-Instruct/discussions/28/files)
 
24
 
25
+ Update 28/07 - requantized with the [RoPE fix](https://github.com/ggerganov/llama.cpp/pull/8676), it should now be fully supported. You need to run [llama.cpp 5e2727f](https://github.com/ggerganov/llama.cpp/commit/5e2727fe0321c38d1664d26173c654fa1801dc5f) or higher
26
 
27
  ## Model Information
28