SpiridonSunRotator commited on
Commit
9df2164
1 Parent(s): 75d8ed3

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -3
README.md CHANGED
@@ -8,13 +8,13 @@ tags:
8
  - conversational
9
  - text-generation-inference
10
  ---
11
- Official [AQLM](https://arxiv.org/abs/2401.06118) quantization of [meta-llama/Meta-Llama-3-70B
12
- ](https://huggingface.co/meta-llama/Meta-Llama-3-70B).
13
 
14
  For this quantization, we used 1 codebook of 16 bits.
15
 
16
  Results (in progress):
17
  | Model | Quantization | Model size, Gb |
18
  |------|------|------|
19
- |meta-llama/Meta-Llama-3-70B | - | 141.2 |
20
  | | 1x16 | 21.9 |
 
8
  - conversational
9
  - text-generation-inference
10
  ---
11
+ Official [AQLM](https://arxiv.org/abs/2401.06118) quantization of [meta-llama/Meta-Llama-3-70B-Instruct
12
+ ](https://huggingface.co/meta-llama/Meta-Llama-3-70B-Instruct).
13
 
14
  For this quantization, we used 1 codebook of 16 bits.
15
 
16
  Results (in progress):
17
  | Model | Quantization | Model size, Gb |
18
  |------|------|------|
19
+ |meta-llama/Meta-Llama-3-70B-Instruct | - | 141.2 |
20
  | | 1x16 | 21.9 |