title | description | author | ms.author | ms.date | ms.service | ms.subservice | ms.topic | keywords | ms.devlang | monikerRange | ||
---|---|---|---|---|---|---|---|---|---|---|---|---|
n_gram_hash: n_gram_hash |
Extracts NGrams from text and convert them to vector using hashing trick. |
VanMSFT |
vanto |
07/15/2019 |
sql |
machine-learning-services |
reference |
|
python |
>=sql-server-2017||>=sql-server-linux-ver15 |
microsoftml.n_gram_hash(hash_bits: numbers.Real = 16,
ngram_length: numbers.Real = 1, skip_length: numbers.Real = 0,
all_lengths: bool = True, seed: numbers.Real = 314489979,
ordered: bool = True, invert_hash: numbers.Real = 0)
Extracts NGrams from text and convert them to vector using hashing trick.
Number of bits to hash into. Must be between 1 and 30, inclusive. (settings).
Ngram length (settings).
Maximum number of tokens to skip when constructing an ngram (settings).
Whether to include all ngram lengths up to ngramLength or only ngramLength (settings).
Hashing seed (settings).
Whether the position of each source column should be included in the hash (when there are multiple source columns). (settings).
Limit the number of keys used to generate the slot name to this many. 0 means no invert hashing, -1 means no limit. (settings).