OrionStarAI
/

Orion-14B-Chat-RAG

Text Generation

Model card Files Files and versions Community

Du Chen commited on Jan 20

Commit

cbd5ffb

•

1 Parent(s): 2e76651

update readme

Files changed (2) hide show

README.md +2 -0
README_cn.md +2 -0

README.md CHANGED Viewed

@@ -45,6 +45,8 @@ pipeline_tag: text-generation
 # Model Introduction
 - Orion-14B series models are open-source multilingual large language models trained from scratch by OrionStarAI.  The base model is trained on 2.5T multilingual corpus, including Chinese, English, Japanese, Korean, etc, and it exhibits superior performance in these languages.
 - The Orion-14B series models exhibit the following features:

 # Model Introduction
+- **Orion-14B-Chat-RAG:**  A chat-model fine-tuned on a custom retrieval augmented generation dataset, achieving superior performance in retrieval augmented generation tasks.
 - Orion-14B series models are open-source multilingual large language models trained from scratch by OrionStarAI.  The base model is trained on 2.5T multilingual corpus, including Chinese, English, Japanese, Korean, etc, and it exhibits superior performance in these languages.
 - The Orion-14B series models exhibit the following features:

README_cn.md CHANGED Viewed

@@ -45,6 +45,8 @@ pipeline_tag: text-generation
 # 模型介绍
 - Orion-14B-Base是一个具有140亿参数的多语种大模型，该模型在一个包含2.5万亿token的多样化数据集上进行了训练，涵盖了中文、英语、日语、韩语等多种语言。在多语言环境下的一系列任务中展现出卓越的性能。在主流的公开基准评测中，Orion-14B系列模型表现优异，多项指标显著超越同等参数基本的其他模型。
 - Orion-14B系列大模型有以下几个特点：

 # 模型介绍
+- **Orion-14B-Chat-RAG:**  在一个定制的检索增强生成数据集上进行微调的聊天模型，在检索增强生成任务中取得了卓越的性能。
 - Orion-14B-Base是一个具有140亿参数的多语种大模型，该模型在一个包含2.5万亿token的多样化数据集上进行了训练，涵盖了中文、英语、日语、韩语等多种语言。在多语言环境下的一系列任务中展现出卓越的性能。在主流的公开基准评测中，Orion-14B系列模型表现优异，多项指标显著超越同等参数基本的其他模型。
 - Orion-14B系列大模型有以下几个特点：