How to : Extract all (word, vector) pairs from spacy Vocab ?
iteration like:
sort([ w.text for w in nlp.vocab ])
array(['\t', '\n', ' ', '"', "'", "''", "'Cause", "'Cos", "'Coz", "'Cuz", "'S", "'bout", "'cause", "'cos", "'coz", "'cuz", "'d", "'em", "'ll", "'m", "'nuff", "'re", "'s",
"'ve", "'y", '(', '(*_*)', '(-8', '(-:', '(-;', '(-_-)', ...
returns only ~800 weird items
nlp.vocab has 764 words. Only radicals:
output: 764
nlp.vocab has no vector:
output: []
You may find information map at: https://spacy.io/api