An Argument for Modeling Consolidation
Or What Happens if Attention (and Transformer) Is All We Need?
✋ This post uses images, so if you are reading this on an email client, enable the “display images from sender” to get the most out of the post. If you are a new subscriber, welcome! If you find these writings valuable, consider sharing them with a colleague or giving a shoutout on Twitter.
What if a Transformer is all we need?
Even if you don’t believe…