alokabhishek
/

Mistral-7B-Instruct-v0.2-bnb-4bit

Text Generation

text-generation-inference

Inference Endpoints

4-bit precision

Model card Files Files and versions Community

alokabhishek commited on Mar 5

Commit

4af1377

•

1 Parent(s): 9b66a98

update readme

Files changed (1) hide show

README.md +17 -1

README.md CHANGED Viewed

@@ -3,18 +3,34 @@ library_name: transformers
 tags: []
 ---
-# Model Card for Model ID
 <!-- Provide a quick summary of what the model is/does. -->
 ## Model Details
 ### Model Description
 <!-- Provide a longer summary of what this model is. -->
 This is the model card of a 🤗 transformers model that has been pushed on the Hub. This model card has been automatically generated.
 - **Developed by:** [More Information Needed]

 tags: []
 ---
+# Mistral-7B-Instruct-v0.2-bnb-4bit
 <!-- Provide a quick summary of what the model is/does. -->
+This repo contains 4-bit quantized (using bitsandbytes) model Mistral AI_'s Mistral-7B-Instruct-v0.2
 ## Model Details
+Model creator: [Mistral AI_](https://huggingface.co/mistralai)
+Original model: [Mistral-7B-Instruct-v0.2](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.2)
+### About 4 bit quantization using bitsandbytes
+QLoRA: Efficient Finetuning of Quantized LLMs: [arXiv - QLoRA: Efficient Finetuning of Quantized LLMs] (https://arxiv.org/abs/2305.14314)
+Hugging Face Blog post on 4-bit quantization using bitsandbytes: [Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA] (https://huggingface.co/blog/4bit-transformers-bitsandbytes)
+bitsandbytes github repo: [bitsandbytes github repo] (https://github.com/TimDettmers/bitsandbytes)
 ### Model Description
 <!-- Provide a longer summary of what this model is. -->
 This is the model card of a 🤗 transformers model that has been pushed on the Hub. This model card has been automatically generated.
 - **Developed by:** [More Information Needed]