Misha Laskin

@MishaLaskin

Staff Research Scientist @DeepMind. Previously @berkeley_ai. YC alum.

NYC t.co Joined Aug 2013

View on 𝕏

Threads

584

views

8.2K

Followers

668

Tweets

Threads

Recent Most Read Most Liked

GPT has been a core part of the unsupervised learning revolution that’s been happening in NLP. In part 2 of the transformer series, we’ll build GPT from the ground up. This thread...

Misha Laskin

@MishaLaskin

Transformers are arguably the most impactful deep learning architecture from the last 5 yrs. In the next few threads, we’ll cover multi-head attention, GPT and BERT, Vision Transf...

Misha Laskin

@MishaLaskin

Patch extraction is a fundamental operation in deep learning, especially for computer vision. By the end of this thread, you’ll know how to implement an efficient vectorized patch...

Misha Laskin

@MishaLaskin

Misha Laskin

Threads

Unroll Thread