--- base_model: - davidkim205/nox-solar-10.7b-v2 - chihoonlee10/T3Q-ko-solar-dpo-v6.0 library_name: transformers tags: - mergekit - merge --- # model_storage This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit). ## Merge Details ### Merge Method This model was merged using the SLERP merge method. ### Models Merged The following models were included in the merge: * [davidkim205/nox-solar-10.7b-v2](https://huggingface.co/davidkim205/nox-solar-10.7b-v2) * [chihoonlee10/T3Q-ko-solar-dpo-v6.0](https://huggingface.co/chihoonlee10/T3Q-ko-solar-dpo-v6.0) ### Configuration The following YAML configuration was used to produce this model: ```yaml base_model: model: path: chihoonlee10/T3Q-ko-solar-dpo-v6.0 dtype: float16 merge_method: slerp parameters: t: - filter: self_attn value: [0.0, 0.5, 0.3, 0.7, 1.0] - filter: mlp value: [1.0, 0.5, 0.7, 0.3, 0.0] - value: 0.5 slices: - sources: - layer_range: [0, 47] model: model: path: chihoonlee10/T3Q-ko-solar-dpo-v6.0 - layer_range: [0, 47] model: model: path: davidkim205/nox-solar-10.7b-v2 ``` ### Evaluation Results | 모델 명칭 | **Average** | HellaSwag | COPA | BooIQ | |-------------------|-------------|-----------|---------|--------| | KoGPT | 58.2 | 55.9 | 73.5 | 45.1 | | Polyglot-ko-13B | 62.4 | **59.5** | **79.4**| 48.2 | | LLaMA 2-13B | 45.2 | 41.3 | 59.3 | 34.9 | | Baichuan 2-13B | 52.7 | 39.2 | 60.6 | 58.4 | | QWEN-14B | 47.8 | 45.3 | 64.9 | 33.4 | | Orion-14B-Chat | 68.8 | 47.0 | 77.7 | 81.6 | | Ocelot-ko-10.8B | **72.5** | 50.0 | 75.8 |**91.7**|