Edit model card

This is a Llama 2 architecture model series trained on the FineWeb dataset. This is ~500 Million model and uses tiktoken cl100k_base model as tokenizer

Downloads last month
11
Inference Examples
Inference API (serverless) is not available, repository is disabled.

Dataset used to train sabareesh88/fw14k