Caches the data produced by the logical representation of the given Dataset.
Caches the data produced by the logical representation of the given Dataset.
Unlike RDD.cache()
, the default storage level is set to be MEMORY_AND_DISK
because
recomputing the in-memory columnar representation of the underlying table is expensive.
Clears all cached tables.
Checks if the cache is empty.
Optionally returns cached data for the given LogicalPlan.
Optionally returns cached data for the given Dataset
Tries to re-cache all the cache entries that contain resourcePath
in one or more
HadoopFsRelation
node(s) as part of its logical plan.
Tries to re-cache all the cache entries that refer to the given plan.
Un-cache all the cache entries that refer to the given plan.
Un-cache all the cache entries that refer to the given plan.
Replaces segments of the given logical plan with cached versions where possible.
Provides support in a SQLContext for caching query results and automatically using these cached results when subsequent queries are executed. Data is cached using byte buffers stored in an InMemoryRelation. This relation is automatically substituted query plans that return the
sameResult
as the originally cached query.Internal to Spark SQL.