Hierarchical softmax Hierarchical Softmax in neural network language model word2vec References Hierarchical Probabilistic Neural Network Language Model Distributed Representations of Words and Phrases and their Compositionality