Edit model card

This model is the DeciLM-6b-Instruct model, trained specifically for medicine

Galen uses the

### User: {prompt}

### Response:

or

{prompt} 

Prompt templates

Galen Training Recipe:

  • target_modules = ["q_proj", "v_proj", "gate_proj", "down_proj", "up_proj", "k_proj", "o_proj"]
  • Learning Rate: 4e-4
  • LR Scheduler: constant
  • 250 StepsLoss

T3: 1 Hour

Downloads last month
110
Inference Examples
Inference API (serverless) is not available, repository is disabled.

Dataset used to train NewstaR/StableGalen-6b