NullGPT2 / README.md
Inoob's picture
Update README.md
31a4385 verified
|
raw
history blame
No virus
726 Bytes
---
license: mit
base_model: openai-community/gpt2
---
---
license: mit
base_model: openai-community/gpt2-large
---
# Null-GPT2
## Description
This is a GPT2 Model, but only with the architecture, no pre-trained weights, biases, attention, etc.
This is useful for researchers who want to play with training the model (not tuning).
Generated via the github repo [Model Architecture Generator](https://github.com/ivanhe123/Model-Architecture-Generator)
## Use
First go into the directory of the model,
```
git clone https://github.com/ivanhe123/Model-Architecture-Generator
```
```
python -m randomnize_params -in "./NullGPT2" -out path_model_out
```
path_model_out is just the output path of the newly randomnized model.