tsunemoto commited on
Commit
d30b578
1 Parent(s): eff300e

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +5 -50
README.md CHANGED
@@ -1,58 +1,13 @@
1
  ![](https://cdn-uploads.huggingface.co/production/uploads/6468ce47e134d050a58aa89c/ddzjZ1irvtLcDRCWei9vQ.png)
2
 
3
- ##GGUF's of Seraph-7B from Weyaxi
4
 
5
- https://huggingface.co/Weyaxi/Seraph-7B
6
 
7
- ##Original Model Card:
8
 
9
  Seraph-7B
10
  This is the model for Seraph-7B. I used mergekit to merge models.
11
 
12
- Prompt Templates
13
- You can use these prompt templates, but I recommend using ChatML.
14
-
15
- ChatML:
16
- <|im_start|>system
17
- {system}<|im_end|>
18
- <|im_start|>user
19
- {user}<|im_end|>
20
- <|im_start|>assistant
21
- {asistant}<|im_end|>
22
-
23
- System, User, Asistant Alpaca Style:
24
- ### System:
25
- {system}
26
- ### User:
27
- {user}
28
- ### Assistant:
29
-
30
- Yaml Config
31
- slices:
32
- - sources:
33
- - model: Weyaxi/OpenHermes-2.5-neural-chat-v3-3-Slerp
34
- layer_range: [0, 32]
35
- - model: Q-bert/MetaMath-Cybertron-Starling
36
- layer_range: [0, 32]
37
- merge_method: slerp
38
- base_model: mistralai/Mistral-7B-v0.1
39
- parameters:
40
- t:
41
- - filter: self_attn
42
- value: [0, 0.5, 0.3, 0.7, 1]
43
- - filter: mlp
44
- value: [1, 0.5, 0.7, 0.3, 0]
45
- - value: 0.5 # fallback for rest of tensors
46
- dtype: bfloat16
47
-
48
- Open LLM Leaderboard Evaluation Results
49
- Detailed results can be found here
50
-
51
- Metric Value
52
- Avg. 71.86
53
- ARC (25-shot) 67.83
54
- HellaSwag (10-shot) 86.22
55
- MMLU (5-shot) 65.07
56
- TruthfulQA (0-shot) 59.49
57
- Winogrande (5-shot) 80.66
58
- GSM8K (5-shot) 71.87
 
1
  ![](https://cdn-uploads.huggingface.co/production/uploads/6468ce47e134d050a58aa89c/ddzjZ1irvtLcDRCWei9vQ.png)
2
 
3
+ ## GGUF's of Seraph-7B from Weyaxi
4
 
5
+ Please see https://huggingface.co/Weyaxi/Seraph-7B for Full model card
6
 
7
+ ## Original Model Card:
8
 
9
  Seraph-7B
10
  This is the model for Seraph-7B. I used mergekit to merge models.
11
 
12
+ ## Prompt Templates
13
+ ChatML Recommended