bookDon't Sweep your Learning Rate under the Rug: A Closer Look at Cross-modal Transfer of Pretrained Transformers
… Subsequently, the model is fine-tuned on a given task of interest, eg sentiment analysi...
LiM.RothermelD.RocktäschelT.FoersterJ.