Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Neko-Institute-of-Science
/
LLaMA-65B-4bit-32g
like
10
Text Generation
Transformers
llama
text-generation-inference
Inference Endpoints
Model card
Files
Files and versions
Community
3
Train
Deploy
Use this model
6132c4b
LLaMA-65B-4bit-32g
1 contributor
History:
5 commits
Neko-Institute-of-Science
Update README.md
6132c4b
over 1 year ago
.gitattributes
1.48 kB
initial commit
over 1 year ago
README.md
588 Bytes
Update README.md
over 1 year ago
config.json
507 Bytes
GPTQ triton forgot head but still works on 49efe0b67db4b40eac2ae963819ebc055da64074
over 1 year ago
generation_config.json
137 Bytes
GPTQ triton forgot head but still works on 49efe0b67db4b40eac2ae963819ebc055da64074
over 1 year ago
llama-65b-4bit-32g.safetensors
38.5 GB
LFS
GPTQ triton forgot head but still works on 49efe0b67db4b40eac2ae963819ebc055da64074
over 1 year ago
special_tokens_map.json
2 Bytes
GPTQ triton forgot head but still works on 49efe0b67db4b40eac2ae963819ebc055da64074
over 1 year ago
tokenizer.model
500 kB
LFS
GPTQ triton forgot head but still works on 49efe0b67db4b40eac2ae963819ebc055da64074
over 1 year ago
tokenizer_config.json
141 Bytes
GPTQ triton forgot head but still works on 49efe0b67db4b40eac2ae963819ebc055da64074
over 1 year ago