Original Model: https://huggingface.co/MarinaraSpaghetti/Nemomix-v4.0-12B
made with https://huggingface.co/FantasiaFoundry/GGUF-Quantization-Script
Models Q2_K_L, Q4_K_L, Q5_K_L, Q6_K_L, are using Q_8 output tensors and token embeddings
using bartowski's imatrix dataset
- Downloads last month
- 36
Inference API (serverless) is not available, repository is disabled.
Model tree for Reiterate3680/Nemomix-v4.0-12B-GGUF
Base model
MarinaraSpaghetti/Nemomix-v4.0-12B
Quantized
this model