luow-amd commited on
Commit
3aa66e3
1 Parent(s): 6a6e7f7

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -6,7 +6,7 @@ license: llama3.1
6
  This model was created by applying [Quark](https://quark.docs.amd.com/latest/index.html) with calibration samples from Pile dataset.
7
  - ## Quantization Stragegy
8
  - ***Quantized Layers***:All linear layers excluding "lm_head"
9
- - ***Weight***:FP8 symmetric per-tensor
10
  - ***Activation***: FP8 symmetric per-tensor
11
  - ***KV Cache***: FP8 symmetric per-tensor
12
  - ## Quick Start
 
6
  This model was created by applying [Quark](https://quark.docs.amd.com/latest/index.html) with calibration samples from Pile dataset.
7
  - ## Quantization Stragegy
8
  - ***Quantized Layers***:All linear layers excluding "lm_head"
9
+ - ***Weight***: FP8 symmetric per-tensor
10
  - ***Activation***: FP8 symmetric per-tensor
11
  - ***KV Cache***: FP8 symmetric per-tensor
12
  - ## Quick Start