arnocandel commited on
Commit
24c6af7
1 Parent(s): 959320a

commit files to HF hub

Browse files
Files changed (1) hide show
  1. README.md +16 -2
README.md CHANGED
@@ -144,9 +144,23 @@ RWConfig {
144
  Model validation results using [EleutherAI lm-evaluation-harness](https://github.com/EleutherAI/lm-evaluation-harness).
145
 
146
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
147
 
148
- TBD
149
-
150
 
151
  ## Disclaimer
152
 
 
144
  Model validation results using [EleutherAI lm-evaluation-harness](https://github.com/EleutherAI/lm-evaluation-harness).
145
 
146
 
147
+ [eval source code](https://github.com/h2oai/h2ogpt/issues/216#issuecomment-1579573101)
148
+
149
+ | Task |Version| Metric |Value | |Stderr|
150
+ |-------------|------:|--------|-----:|---|-----:|
151
+ |arc_challenge| 0|acc |0.4957|± |0.0146|
152
+ | | |acc_norm|0.5324|± |0.0146|
153
+ |arc_easy | 0|acc |0.8140|± |0.0080|
154
+ | | |acc_norm|0.7837|± |0.0084|
155
+ |boolq | 1|acc |0.8297|± |0.0066|
156
+ |hellaswag | 0|acc |0.6490|± |0.0048|
157
+ | | |acc_norm|0.8293|± |0.0038|
158
+ |openbookqa | 0|acc |0.3780|± |0.0217|
159
+ | | |acc_norm|0.4740|± |0.0224|
160
+ |piqa | 0|acc |0.8248|± |0.0089|
161
+ | | |acc_norm|0.8362|± |0.0086|
162
+ |winogrande | 0|acc |0.7837|± |0.0116|
163
 
 
 
164
 
165
  ## Disclaimer
166