File size: 1,057 Bytes

d92f956


---
language: en
license: mit
tags:
- VIT
- image-classification
- drowsiness-detection
---

# VIT_Drowsiness

This model is a fine-tuned version of [google/vit-base-patch16-224](https://huggingface.co/google/vit-base-patch16-224) for drowsiness detection.

## Model description

This model is a Vision Transformer (ViT) fine-tuned for drowsiness detection. It classifies images into two categories: drowsy and not drowsy.

## Intended uses & limitations

This model is intended for drowsiness detection in images. It should be used on facial images similar to those in the training dataset.

## Training data

The model was trained on a custom dataset located at /kaggle/input/nthuddd2/train_data. The dataset was split into 70% training, 15% validation, and 15% test sets.

## Training procedure

The model was trained for 10 epochs using the Lion optimizer with a learning rate of 0.0001 and weight decay of 0.01. A cosine learning rate scheduler with 0.1 warmup ratio was used.

## Evaluation results

[Add your evaluation results here after training]