ShiW. x 2

bookRéférences 2

In-Context Pretraining: Language Modeling Beyond Document Boundaries

… Recursive deep models for semantic compositionality over a sentiment treebank. In Proce...

LiM.ShiW.MinS.LomeliM.ZhouC.LinV.

Scaling Expert Language Models with Unsupervised Domain Discovery

Large language models are typically trained densely: all parameters are updated with respect to all ...

LiM.ShiW.GururanganS.LewisM.AlthoffT.

Mots-clés associés