haxor@derp.fooMB to Hacker News@derp.fooEnglish · 1 year agoThink Before You Speak: Training Language Models with Pause Tokensarxiv.orgexternal-linkmessage-square0fedilinkarrow-up14arrow-down12file-textcross-posted to: machinelearning@kbin.socialsingularity@lemmit.onlinetechnews@radiation.party
arrow-up12arrow-down1external-linkThink Before You Speak: Training Language Models with Pause Tokensarxiv.orghaxor@derp.fooMB to Hacker News@derp.fooEnglish · 1 year agomessage-square0fedilinkfile-textcross-posted to: machinelearning@kbin.socialsingularity@lemmit.onlinetechnews@radiation.party