I am using gpt-2-simple to do the training on google colab, especially for the 355M one would require more than 8GB of VRAM to train.
![](https://i0.wp.com/blog.thetobysiu.com/wp-content/uploads/2020/05/Screenshot-from-2020-04-15-15-30-48.png)
![](https://i1.wp.com/blog.thetobysiu.com/wp-content/uploads/2020/05/Screenshot-from-2020-04-15-15-32-26.png)
![](https://i2.wp.com/blog.thetobysiu.com/wp-content/uploads/2020/05/Screenshot-from-2020-04-16-00-30-56.png)
SIU KING WAI
I am using gpt-2-simple to do the training on google colab, especially for the 355M one would require more than 8GB of VRAM to train.