Decoder-only transformers are just the decoder portion of the transformer architecture! However, the cross attention portion of the decoder is removed
Decoder-only transformers are just the decoder portion of the transformer architecture! However, the cross attention portion of the decoder is removed