AntonV
/

mamba2-130m-hf

Inference Endpoints

Model card Files Files and versions Community

AntonV commited on 7 days ago

Commit

97d2a95

•

1 Parent(s): ee4eabb

Update README.md

Files changed (1) hide show

README.md +39 -3

README.md CHANGED Viewed

@@ -1,3 +1,39 @@
----
-license: mit
----

+---
+tags:
+- mamba2
+license: mit
+library_name: transformers
+---
+# mamba2-130m-hf
+Converted files of the original model at [mamba2-130m](https://huggingface.co/state-spaces/mamba2-130m) to HF transformers compatible formats.
+Not affiliated with both the original authors or hf.
+## Usage
+```python
+from transformers import AutoModelForCausalLM
+tokenizer = AutoTokenizer.from_pretrained("AntonV/mamba2-130m-hf")
+model = AutoModelForCausalLM.from_pretrained("AntonV/mamba2-130m-hf")
+input_ids = tokenizer("Hey how are you doing?", return_tensors="pt")["input_ids"]
+out = model.generate(input_ids, max_new_tokens=10)
+print(tokenizer.batch_decode(out))
+```
+## Citation
+<!-- If there is a paper or blog post introducing the model, the APA and Bibtex information for that should go in this section. -->
+**BibTeX:**
+```bibtex
+@inproceedings{mamba2,
+ title={Transformers are {SSM}s: Generalized Models and Efficient Algorithms Through Structured State Space Duality},
+ author={Dao, Tri and Gu, Albert},
+ booktitle={International Conference on Machine Learning (ICML)},
+ year={2024}
+}
+```