Why BERT is Not GPT
Towards AI
JUNE 12, 2024
It all started with Word2Vec and N-Grams in 2013 as the most recent in language modelling. 2013 Word2Vec is a neural network model that uses n-grams by training on context windows of words. 2013 Word2Vec using n-grams was introduced by Mahajan, Patil, and Sankar in their 2013 paper titled, ‘Word2Vec Using Character N–Grams’.
Let's personalize your content