Edit model card

This model was derived from the bert-base-uncased checkpoint by replacing the GELU with ReLU activation function and continued pre-training to adapt it to the change of the activation function.

Downloads last month
19
Safetensors
Model size
110M params
Tensor type
I64
·
F32
·
Inference Examples
Inference API (serverless) is not available, repository is disabled.

Datasets used to train mpiorczynski/relu-bert-base-uncased