Finding frequent items for columns, possibly with false positives.
Finding frequent items for columns, possibly with false positives. Using the
frequent element count algorithm described in
here, proposed by Karp, Schenker,
and Papadimitriou.
The support
should be greater than 1e-4.
For Internal use only.
The input DataFrame
the names of the columns to search frequent items in
The minimum frequency for an item to be considered frequent
. Should be greater
than 1e-4.
A Local DataFrame with the Array of frequent items for each column.