albertcardona,
@albertcardona@mathstodon.xyz avatar

The as the public record and version control system of a scientific manuscript: 7 versions spanning 6 years.

"[Submitted on 12 Jun 2017 (v1), last revised 2 Aug 2023 (this version, v7)]"

"Attention Is All You Need" by Vaswani et al.
https://arxiv.org/abs/1706.03762

That's the paper that introduced the architecture, dispensing with recurrence and convolutions to achieve much faster training times and higher performance in a language task.

  • All
  • Subscribed
  • Moderated
  • Favorites
  • machinelearning
  • mdbf
  • rosin
  • Youngstown
  • everett
  • khanakhh
  • slotface
  • InstantRegret
  • Durango
  • ngwrru68w68
  • kavyap
  • modclub
  • DreamBathrooms
  • GTA5RPClips
  • magazineikmin
  • provamag3
  • thenastyranch
  • cubers
  • cisconetworking
  • osvaldo12
  • ethstaker
  • normalnudes
  • Leos
  • tester
  • megavids
  • tacticalgear
  • anitta
  • JUstTest
  • lostlight
  • All magazines