Shim to allow us to implement scala.Iterator in Java.
Shim to allow us to implement scala.Iterator in Java. Scala 2.11+ has an AbstractIterator
class for this, but that class is private[scala]
in 2.10. We need to explicitly fix this to
Row
in order to work around a spurious IntelliJ compiler error. This cannot be an abstract
class because that leads to compilation errors under Scala 2.11.
Builds a map in which keys are case insensitive.
Builds a map in which keys are case insensitive. Input map can be accessed for cases where case-sensitive information is required. The primary constructor is marked private to avoid nested case-insensitive map creation, otherwise the keys in the original map will become case-insensitive in this scenario.
This is an internal data representation for map type in Spark SQL.
This is an internal data representation for map type in Spark SQL. This should not implement
equals
and hashCode
because the type cannot be used as join keys, grouping keys, or
in equality tests. See SPARK-9415 and PR#13847 for the discussions.
Helper class to compute approximate quantile summary.
Helper class to compute approximate quantile summary. This implementation is based on the algorithm proposed in the paper: "Space-efficient Online Computation of Quantile Summaries" by Greenwald, Michael and Khanna, Sanjeev. (http://dx.doi.org/10.1145/375663.375670)
In order to optimize for speed, it maintains an internal buffer of the last seen samples, and only inserts them after crossing a certain size threshold. This guarantees a near-constant runtime complexity compared to the original algorithm.
An implementation of ArrayData serialized like a SerializedRow
that pays attention to the platform endianness and uses a consistent
endian format (little-endian) so is suitable for storage.
An implementation of MapData serialized like SerializedRow
elements
that pays attention to the platform endianness and uses a consistent endian
format (little-endian) so is suitable for storage.
Like UnsafeRow uses raw memory bytes to encode the data in a row instead of objects.
Like UnsafeRow uses raw memory bytes to encode the data in a row instead of objects. The difference is that it pays attention to endianness to use a consistent endian format (little-endian) so is suitable for storage.
Contains the main code for SerializedRow
and SerializedArray
.
Helper functions for converting between internal and external date and time representations.
Helper functions for converting between internal and external date and time representations. Dates are exposed externally as java.sql.Date and are represented internally as the number of dates since the Unix epoch (1970-01-01). Timestamps are exposed externally as java.sql.Timestamp and are stored internally as longs, which are capable of storing timestamps with 100 nanosecond precision.
Build a map with String type of key, and it also supports either key case sensitive or insensitive.
Helper functions to check for valid data types.
Silences output to stderr or stdout for the duration of f