merlin.models.utils.schema_utils.get_embedding_size_from_cardinality#
- merlin.models.utils.schema_utils.get_embedding_size_from_cardinality(cardinality: int, multiplier: float = 2.0, ensure_multiple_of_8: bool = False) int [source]#
Provides a heuristic (from Google) that suggests the embedding dimension as a function (forth root) of the feature cardinality.
- Parameters
cardinality (int) – The number of unique values of a categorical feature
multiplier (float, optional) – Multiplier to be applied on the forth root of the cardinality. Google recommends multiplier in the [2.0,10.0] range, by default 2.0
ensure_multiple_of_8 (bool, optional) – If enabled, adjusts the embedding dim to the smallest greater number multiple of 8, to ensure best performance with GPU ops, by default False
- Returns
The suggested embedding dimension based on the feature cardinality
- Return type