The .vec
files contain only the aggregated word vectors, in plain-text. The .bin
files in addition contain the model parameters, and crucially, the vectors for all the n-grams.
So if you want to encode words you did not train with using those n-grams (FastText's famous "subword information"), you need to find an API that can handle FastText .bin
files (most only support the .vec
files, however...).
与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…