the.com/transformer
the machine that learned to read everything by paying attention to nothing in particular
the paperNamed after one line: Attention Is All You Need
powers chatgptThe T in GPT means transformer
no recurrenceReads whole sequences at once, not word by word
born 2017Google researchers reshaped AI in eight pages
attention mathEvery word secretly weighs every other word