In a sense the model *is* universal. It's just a 100GB (give or take) neural net...

		hnfong on March 16, 2023 \| parent \| context \| favorite \| on: Anyone else witnessing a panic inside NLP orgs of ... In a sense the model is universal. It's just a 100GB (give or take) neural network. And apparently (or so I heard, I think) feeding transformer models training data of Language A could improve its ability to understand Language B. So maybe there's something truly universal in some sense.