blue2959's picture
Update README.md
d2cb987 verified
metadata
language:
  - ko
metrics:
  - accuracy
  - f1
pipeline_tag: text-classification

XLM-Roberta-base --> 8emotions!

Label Dictionry

  • label_dictionary
  • emo2int = { "๊ธฐ์จ": 0, "๋‹นํ™ฉ": 1, "๋ถ„๋…ธ": 2, "๋ถˆ์•ˆ": 3, "์ƒ์ฒ˜": 4, "์Šฌํ””": 5, "์ค‘๋ฆฝ": 6 }
  • kore2en = { "๊ธฐ์จ": "joy", "๋‹นํ™ฉ": "surprise", "๋ถ„๋…ธ": "anger", "๋ถˆ์•ˆ": "fear", "์ƒ์ฒ˜": "hurt", "์Šฌํ””": "sadness", "์ค‘๋ฆฝ": "neutral" }

Dataset

๊ฐ์„ฑ๋Œ€ํ™”๋ง๋ญ‰์น˜(AI Hub)

ํ•œ๊ตญ์–ด ๊ฐ์ • ์ •๋ณด๊ฐ€ ํฌํ•จ๋œ ์—ฐ์†์  ๋Œ€ํ™” ๋ฐ์ดํ„ฐ์…‹(AIHub)

Input Format(Please Use Special Tokens [USR], [BOT] to use model API!)

  • (example) [USR] ์•ˆ๋…•. [BOT] ์•ˆ๋…•ํ•˜์„ธ์š”! ๋ฌด์—‡์„ ๋„์™€๋“œ๋ฆด๊นŒ์š”? [USR] ๋ณ„์ผ ์—†์–ด.

  • ์ด ๋‘๊ฐœ์˜ ํŠน์ˆ˜ ํ† ํฐ์€ ๋ฐ˜๋“œ์‹œ ์‚ฌ์šฉํ•ด์ฃผ์‹œ๊ธธ ๋ถ€ํƒ๋“œ๋ฆฝ๋‹ˆ๋‹ค.

  • And these are a part of real data. input_format

Metrics(F1, Accuracy, and Confusion Matrix!)

  • and confusion matrix like this.. ConfusionMatrix

  • and so on.. F1, Accuracy

Training_Steps