julien-c HF staff commited on
Commit
ad86706
1 Parent(s): bd6d8cb

Migrate model card from transformers-repo

Browse files

Read announcement at https://discuss.huggingface.co/t/announcement-all-model-cards-will-be-migrated-to-hf-co-model-repos/2755
Original file history: https://github.com/huggingface/transformers/commits/master/model_cards/deepset/gelectra-large-generator/README.md

Files changed (1) hide show
  1. README.md +56 -0
README.md ADDED
@@ -0,0 +1,56 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language: de
3
+ license: mit
4
+ datasets:
5
+ - wikipedia
6
+ - OPUS
7
+ - OpenLegalData
8
+ - oscar
9
+ ---
10
+
11
+ # German ELECTRA large generator
12
+
13
+ Released, Oct 2020, this is the generator component of the German ELECTRA language model trained collaboratively by the makers of the original German BERT (aka "bert-base-german-cased") and the dbmdz BERT (aka bert-base-german-dbmdz-cased). In our [paper](https://arxiv.org/pdf/2010.10906.pdf), we outline the steps taken to train our model.
14
+
15
+ The generator is useful for performing masking experiments. If you are looking for a regular language model for embedding extraction, or downstream tasks like NER, classification or QA, please use deepset/gelectra-large.
16
+
17
+ ## Overview
18
+ **Paper:** [here](https://arxiv.org/pdf/2010.10906.pdf)
19
+ **Architecture:** ELECTRA large (generator)
20
+ **Language:** German
21
+
22
+ ## Performance
23
+ ```
24
+ GermEval18 Coarse: 80.70
25
+ GermEval18 Fine: 55.16
26
+ GermEval14: 88.95
27
+ ```
28
+
29
+ See also:
30
+ deepset/gbert-base
31
+ deepset/gbert-large
32
+ deepset/gelectra-base
33
+ deepset/gelectra-large
34
+ deepset/gelectra-base-generator
35
+ deepset/gelectra-large-generator
36
+
37
+ ## Authors
38
+ Branden Chan: `branden.chan [at] deepset.ai`
39
+ Stefan Schweter: `stefan [at] schweter.eu`
40
+ Timo Möller: `timo.moeller [at] deepset.ai`
41
+
42
+ ## About us
43
+ ![deepset logo](https://raw.githubusercontent.com/deepset-ai/FARM/master/docs/img/deepset_logo.png)
44
+
45
+ We bring NLP to the industry via open source!
46
+ Our focus: Industry specific language models & large scale QA systems.
47
+
48
+ Some of our work:
49
+ - [German BERT (aka "bert-base-german-cased")](https://deepset.ai/german-bert)
50
+ - [FARM](https://github.com/deepset-ai/FARM)
51
+ - [Haystack](https://github.com/deepset-ai/haystack/)
52
+
53
+ Get in touch:
54
+ [Twitter](https://twitter.com/deepset_ai) | [LinkedIn](https://www.linkedin.com/company/deepset-ai/) | [Website](https://deepset.ai)
55
+
56
+