If you want to learn how to train bigger LLMs (I realize that's a pleonasm π
, meaning roughly 1B+ params models) here is a couple of amazing resources
If you want to learn how to train bigger LLMs (I realize that's a pleonasm π
, meaning roughly 1B+ params models) here is a couple of amazing resources