rccmsu commited on
Commit
082dbdf
1 Parent(s): 946002a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -0
README.md CHANGED
@@ -11,6 +11,8 @@ Up to 60% faster generation and 35% training (on identical russian text sequence
11
 
12
  Colab: https://colab.research.google.com/drive/109ZhEB6STy-0jO-Z_4ttkWr1jg_FCTRW?usp=sharing
13
 
 
 
14
  ## Model description
15
 
16
  Instruction version (Saiga datasets) of Russian adaptation of LLaMa-2-7B by replacing the tokenizer.
 
11
 
12
  Colab: https://colab.research.google.com/drive/109ZhEB6STy-0jO-Z_4ttkWr1jg_FCTRW?usp=sharing
13
 
14
+ Paper: Tikhomirov M., Chernyshev D. Impact of Tokenization on LLaMa Russian Adaptation //arXiv preprint arXiv:2312.02598. – 2023.
15
+
16
  ## Model description
17
 
18
  Instruction version (Saiga datasets) of Russian adaptation of LLaMa-2-7B by replacing the tokenizer.