dtorber commited on
Commit
afcca25
1 Parent(s): ac278fa

Model save

Browse files
Files changed (2) hide show
  1. README.md +7 -12
  2. generation_config.json +1 -1
README.md CHANGED
@@ -1,6 +1,5 @@
1
  ---
2
  tags:
3
- - summarization
4
  - generated_from_trainer
5
  model-index:
6
  - name: BioNLP-intro-disc-eLife
@@ -32,22 +31,18 @@ More information needed
32
 
33
  The following hyperparameters were used during training:
34
  - learning_rate: 1.3739167643078955e-06
35
- - train_batch_size: 8
36
- - eval_batch_size: 8
37
  - seed: 42
38
  - distributed_type: multi-GPU
39
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
40
  - lr_scheduler_type: linear
41
- - num_epochs: 15
42
  - mixed_precision_training: Native AMP
43
 
44
- ### Training results
45
-
46
-
47
-
48
  ### Framework versions
49
 
50
- - Transformers 4.35.2
51
- - Pytorch 1.13.1+cu117
52
- - Datasets 2.16.1
53
- - Tokenizers 0.15.2
 
1
  ---
2
  tags:
 
3
  - generated_from_trainer
4
  model-index:
5
  - name: BioNLP-intro-disc-eLife
 
31
 
32
  The following hyperparameters were used during training:
33
  - learning_rate: 1.3739167643078955e-06
34
+ - train_batch_size: 4
35
+ - eval_batch_size: 4
36
  - seed: 42
37
  - distributed_type: multi-GPU
38
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
39
  - lr_scheduler_type: linear
40
+ - num_epochs: 10
41
  - mixed_precision_training: Native AMP
42
 
 
 
 
 
43
  ### Framework versions
44
 
45
+ - Transformers 4.40.1
46
+ - Pytorch 2.3.0+cu121
47
+ - Datasets 2.18.0
48
+ - Tokenizers 0.19.1
generation_config.json CHANGED
@@ -4,5 +4,5 @@
4
  "decoder_start_token_id": 2,
5
  "eos_token_id": 2,
6
  "pad_token_id": 1,
7
- "transformers_version": "4.35.2"
8
  }
 
4
  "decoder_start_token_id": 2,
5
  "eos_token_id": 2,
6
  "pad_token_id": 1,
7
+ "transformers_version": "4.40.1"
8
  }