t3 / README.md
ayushutkarsh's picture
Update README.md
a35242c
metadata
license: apache-2.0
datasets:
  - McGill-NLP/FaithDial
language:
  - en
metrics:
  - bleu
  - bertscore
  - accuracy
pipeline_tag: conversational

T3 stands for Terribly Tiny Transformers that are an efficient way of creating tiny distilled (student) models for hallucination-free LLM models in parameter-constrained environment (edge devices). The base model is a T3 adaptation of T5 model. The paradigm of T3 models can be extended to all types of models ( encoder only, decoder only & seq2seq)