tyzhu
/

lmind_nq_train6000_eval6489_v1_qa_3e-5_lora2

Safetensors

Generated from Trainer

Eval Results

Model card Files Files and versions Community

tyzhu commited on Jun 8

Commit

9cf6849

•

1 Parent(s): 6a00ffe

Model save

Browse files

Files changed (1) hide show

README.md +61 -75

README.md CHANGED Viewed

@@ -1,26 +1,13 @@
 ---
-license: other
-base_model: Qwen/Qwen1.5-4B
 tags:
 - generated_from_trainer
-datasets:
-- tyzhu/lmind_nq_train6000_eval6489_v1_qa
 metrics:
 - accuracy
 model-index:
 - name: lmind_nq_train6000_eval6489_v1_qa_3e-5_lora2
-  results:
-  - task:
-      name: Causal Language Modeling
-      type: text-generation
-    dataset:
-      name: tyzhu/lmind_nq_train6000_eval6489_v1_qa
-      type: tyzhu/lmind_nq_train6000_eval6489_v1_qa
-    metrics:
-    - name: Accuracy
-      type: accuracy
-      value: 0.5511794871794872
-library_name: peft
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -28,10 +15,10 @@ should probably proofread and complete it, then remove this comment. -->
 # lmind_nq_train6000_eval6489_v1_qa_3e-5_lora2
-This model is a fine-tuned version of [Qwen/Qwen1.5-4B](https://huggingface.co/Qwen/Qwen1.5-4B) on the tyzhu/lmind_nq_train6000_eval6489_v1_qa dataset.
 It achieves the following results on the evaluation set:
-- Loss: 2.8024
-- Accuracy: 0.5512
 ## Model description
@@ -66,64 +53,63 @@ The following hyperparameters were used during training:
 ### Training results
-| Training Loss | Epoch   | Step | Validation Loss | Accuracy |
-|:-------------:|:-------:|:----:|:---------------:|:--------:|
-| 1.963         | 0.9973  | 187  | 1.6439          | 0.5695   |
-| 1.6099        | 2.0     | 375  | 1.6183          | 0.5733   |
-| 1.524         | 2.9973  | 562  | 1.6164          | 0.5744   |
-| 1.3938        | 4.0     | 750  | 1.6376          | 0.5729   |
-| 1.2685        | 4.9973  | 937  | 1.6846          | 0.5699   |
-| 1.1591        | 6.0     | 1125 | 1.7547          | 0.5673   |
-| 1.0444        | 6.9973  | 1312 | 1.8395          | 0.5643   |
-| 0.9535        | 8.0     | 1500 | 1.9008          | 0.5613   |
-| 0.8235        | 8.9973  | 1687 | 2.0268          | 0.5592   |
-| 0.7635        | 10.0    | 1875 | 2.0937          | 0.5568   |
-| 0.6978        | 10.9973 | 2062 | 2.1558          | 0.5570   |
-| 0.6615        | 12.0    | 2250 | 2.2400          | 0.5552   |
-| 0.6262        | 12.9973 | 2437 | 2.2687          | 0.5556   |
-| 0.5958        | 14.0    | 2625 | 2.3582          | 0.5537   |
-| 0.5778        | 14.9973 | 2812 | 2.3960          | 0.5534   |
-| 0.5661        | 16.0    | 3000 | 2.4322          | 0.5534   |
-| 0.5277        | 16.9973 | 3187 | 2.4828          | 0.5515   |
-| 0.5211        | 18.0    | 3375 | 2.5106          | 0.5516   |
-| 0.5189        | 18.9973 | 3562 | 2.5706          | 0.5515   |
-| 0.5166        | 20.0    | 3750 | 2.5422          | 0.5526   |
-| 0.5132        | 20.9973 | 3937 | 2.5948          | 0.5509   |
-| 0.5115        | 22.0    | 4125 | 2.6048          | 0.5512   |
-| 0.5083        | 22.9973 | 4312 | 2.5811          | 0.5521   |
-| 0.5081        | 24.0    | 4500 | 2.5662          | 0.5513   |
-| 0.4862        | 24.9973 | 4687 | 2.6429          | 0.5522   |
-| 0.4845        | 26.0    | 4875 | 2.6020          | 0.5534   |
-| 0.4869        | 26.9973 | 5062 | 2.6339          | 0.5522   |
-| 0.4862        | 28.0    | 5250 | 2.6162          | 0.5524   |
-| 0.4856        | 28.9973 | 5437 | 2.6764          | 0.5526   |
-| 0.4871        | 30.0    | 5625 | 2.6703          | 0.5526   |
-| 0.4863        | 30.9973 | 5812 | 2.6787          | 0.5533   |
-| 0.4884        | 32.0    | 6000 | 2.6848          | 0.5528   |
-| 0.467         | 32.9973 | 6187 | 2.6689          | 0.5531   |
-| 0.4694        | 34.0    | 6375 | 2.7013          | 0.5525   |
-| 0.4712        | 34.9973 | 6562 | 2.7065          | 0.5521   |
-| 0.4733        | 36.0    | 6750 | 2.6707          | 0.5523   |
-| 0.4752        | 36.9973 | 6937 | 2.6757          | 0.5532   |
-| 0.4744        | 38.0    | 7125 | 2.7016          | 0.5534   |
-| 0.4759        | 38.9973 | 7312 | 2.7263          | 0.5526   |
-| 0.4759        | 40.0    | 7500 | 2.7360          | 0.5525   |
-| 0.4569        | 40.9973 | 7687 | 2.7580          | 0.5524   |
-| 0.4585        | 42.0    | 7875 | 2.7459          | 0.5521   |
-| 0.4602        | 42.9973 | 8062 | 2.7965          | 0.5522   |
-| 0.4631        | 44.0    | 8250 | 2.7995          | 0.5516   |
-| 0.4615        | 44.9973 | 8437 | 2.7972          | 0.5519   |
-| 0.4647        | 46.0    | 8625 | 2.8381          | 0.5519   |
-| 0.4663        | 46.9973 | 8812 | 2.7762          | 0.5535   |
-| 0.4672        | 48.0    | 9000 | 2.8142          | 0.5526   |
-| 0.4505        | 48.9973 | 9187 | 2.7870          | 0.5528   |
-| 0.4537        | 49.8667 | 9350 | 2.8024          | 0.5512   |
 ### Framework versions
-- PEFT 0.5.0
-- Transformers 4.41.1
 - Pytorch 2.1.0+cu121
-- Datasets 2.19.1
-- Tokenizers 0.19.1

 ---
+license: llama2
+base_model: meta-llama/Llama-2-7b-hf
 tags:
 - generated_from_trainer
 metrics:
 - accuracy
 model-index:
 - name: lmind_nq_train6000_eval6489_v1_qa_3e-5_lora2
+  results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 # lmind_nq_train6000_eval6489_v1_qa_3e-5_lora2
+This model is a fine-tuned version of [meta-llama/Llama-2-7b-hf](https://huggingface.co/meta-llama/Llama-2-7b-hf) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 2.4443
+- Accuracy: 0.5966
 ## Model description
 ### Training results
+| Training Loss | Epoch | Step | Validation Loss | Accuracy |
+|:-------------:|:-----:|:----:|:---------------:|:--------:|
+| 2.0369        | 1.0   | 187  | 1.2953          | 0.6128   |
+| 1.2821        | 2.0   | 375  | 1.2741          | 0.6146   |
+| 1.1987        | 3.0   | 562  | 1.2715          | 0.6162   |
+| 1.066         | 4.0   | 750  | 1.3011          | 0.6151   |
+| 0.9381        | 5.0   | 937  | 1.3728          | 0.6126   |
+| 0.8238        | 6.0   | 1125 | 1.4599          | 0.6091   |
+| 0.7289        | 7.0   | 1312 | 1.5455          | 0.6064   |
+| 0.6559        | 8.0   | 1500 | 1.6359          | 0.6026   |
+| 0.5733        | 9.0   | 1687 | 1.7149          | 0.6006   |
+| 0.5336        | 10.0  | 1875 | 1.8006          | 0.5989   |
+| 0.5116        | 11.0  | 2062 | 1.8851          | 0.5982   |
+| 0.4934        | 12.0  | 2250 | 1.9262          | 0.5982   |
+| 0.4823        | 13.0  | 2437 | 1.9413          | 0.5974   |
+| 0.47          | 14.0  | 2625 | 2.0121          | 0.5967   |
+| 0.4661        | 15.0  | 2812 | 2.0250          | 0.5968   |
+| 0.462         | 16.0  | 3000 | 1.9805          | 0.5990   |
+| 0.4357        | 17.0  | 3187 | 2.0656          | 0.5976   |
+| 0.4348        | 18.0  | 3375 | 2.0308          | 0.5979   |
+| 0.4331        | 19.0  | 3562 | 2.0629          | 0.5990   |
+| 0.4341        | 20.0  | 3750 | 2.0815          | 0.5983   |
+| 0.434         | 21.0  | 3937 | 2.1253          | 0.5968   |
+| 0.4335        | 22.0  | 4125 | 2.1789          | 0.5971   |
+| 0.4346        | 23.0  | 4312 | 2.1455          | 0.5952   |
+| 0.4326        | 24.0  | 4500 | 2.1990          | 0.5971   |
+| 0.4139        | 25.0  | 4687 | 2.1890          | 0.5976   |
+| 0.4139        | 26.0  | 4875 | 2.1939          | 0.5968   |
+| 0.4162        | 27.0  | 5062 | 2.2190          | 0.5965   |
+| 0.4177        | 28.0  | 5250 | 2.2781          | 0.5955   |
+| 0.4173        | 29.0  | 5437 | 2.2681          | 0.5976   |
+| 0.4187        | 30.0  | 5625 | 2.2996          | 0.5959   |
+| 0.4199        | 31.0  | 5812 | 2.2395          | 0.5981   |
+| 0.4213        | 32.0  | 6000 | 2.2991          | 0.5957   |
+| 0.4015        | 33.0  | 6187 | 2.3223          | 0.5952   |
+| 0.4058        | 34.0  | 6375 | 2.3266          | 0.5957   |
+| 0.4056        | 35.0  | 6562 | 2.3779          | 0.5946   |
+| 0.4078        | 36.0  | 6750 | 2.3453          | 0.5951   |
+| 0.4097        | 37.0  | 6937 | 2.3379          | 0.5965   |
+| 0.4105        | 38.0  | 7125 | 2.3624          | 0.5969   |
+| 0.4116        | 39.0  | 7312 | 2.3846          | 0.5962   |
+| 0.4121        | 40.0  | 7500 | 2.3748          | 0.5945   |
+| 0.3973        | 41.0  | 7687 | 2.3797          | 0.5956   |
+| 0.3985        | 42.0  | 7875 | 2.3599          | 0.5967   |
+| 0.4014        | 43.0  | 8062 | 2.3475          | 0.5971   |
+| 0.4032        | 44.0  | 8250 | 2.3937          | 0.5987   |
+| 0.4028        | 45.0  | 8437 | 2.3863          | 0.5967   |
+| 0.4027        | 46.0  | 8625 | 2.4195          | 0.5956   |
+| 0.4046        | 47.0  | 8812 | 2.3832          | 0.5970   |
+| 0.4067        | 48.0  | 9000 | 2.3805          | 0.5973   |
+| 0.3923        | 49.0  | 9187 | 2.4460          | 0.5957   |
+| 0.3949        | 49.87 | 9350 | 2.4443          | 0.5966   |
 ### Framework versions
+- Transformers 4.34.0
 - Pytorch 2.1.0+cu121
+- Datasets 2.18.0
+- Tokenizers 0.14.1