Neko-Institute-of-Science
/

LLaMA-65B-4bit-32g

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

LLaMA-65B-4bit-32g

1 contributor

History: 5 commits

Neko-Institute-of-Science's picture

Neko-Institute-of-Science

Update README.md

6132c4b over 1 year ago

.gitattributes

1.48 kB

initial commit over 1 year ago
README.md

588 Bytes

Update README.md over 1 year ago
config.json

507 Bytes

GPTQ triton forgot head but still works on 49efe0b67db4b40eac2ae963819ebc055da64074 over 1 year ago
generation_config.json

137 Bytes

GPTQ triton forgot head but still works on 49efe0b67db4b40eac2ae963819ebc055da64074 over 1 year ago
llama-65b-4bit-32g.safetensors

38.5 GB
LFS

GPTQ triton forgot head but still works on 49efe0b67db4b40eac2ae963819ebc055da64074 over 1 year ago
special_tokens_map.json

2 Bytes

GPTQ triton forgot head but still works on 49efe0b67db4b40eac2ae963819ebc055da64074 over 1 year ago
tokenizer.model

500 kB
LFS

GPTQ triton forgot head but still works on 49efe0b67db4b40eac2ae963819ebc055da64074 over 1 year ago
tokenizer_config.json

141 Bytes

GPTQ triton forgot head but still works on 49efe0b67db4b40eac2ae963819ebc055da64074 over 1 year ago