public interface Vectorizer
BinaryVectorizer
,
TextFormatVectorizer
Modifier and Type | Method and Description |
---|---|
boolean |
contains(String token)
True if the token is known by the vectorizer.
|
int |
dimensions()
The dimensionality of the embeddings
|
boolean |
isCaseless() |
int |
size()
The total number of known tokens.
|
float[] |
unknownVector()
The vector for unknown tokens.
|
float[] |
vectorize(String token)
Get the vector for a token.
|
float[] vectorize(String token) throws IOException
unknownVector()
.token
- a token StringIOException
- if there was an error accessing the vector file.boolean contains(String token)
token
- a token Stringfloat[] unknownVector()
int dimensions()
int size()
boolean isCaseless()
Copyright © 2007–2018 Ubiquitous Knowledge Processing (UKP) Lab, Technische Universität Darmstadt. All rights reserved.