Quick Learnology

Statistical Approaches :

Statistical analysis is a group of analytical methods within natural language processing. What these methods have in common is the probabilistic linking of grammar rules. Grammar rules have traditionally been viewed in computational linguistics as defining valid sentences in a language.

Word 2 Vector Concepts:

  • A word vector is simply a vector of numbers representing the meaning of a word.
  • More simply, a word vector is a set of real values.

Where do word vectors come from?

A good question at this point is where do these weights and measurements come from! There are two common ways word vectors are generated.

  • Counts of word/context co-occurrences
  • Predictions of context-specific words (skip-gram neural network model, i.e. word2vec)

Word2 Vec : Similar words are mapped together in vector space. Notice how “cats” and “dogs” resemble “pets,” how “elephants,” “lions,” “tigers,” are grouped into zoos, wilds, and so on.