expressions

Type Members

case class Abs(child: Expression) extends UnaryExpression with ExpectsInputTypes with NullIntolerant with Product with Serializable

A function that get the absolute value of the numeric value.
A function that get the absolute value of the numeric value.

Annotations
@ExpressionDescription()
case class Acos(child: Expression) extends UnaryMathExpression with Product with Serializable

Annotations
@ExpressionDescription()
case class Add(left: Expression, right: Expression) extends BinaryArithmetic with NullIntolerant with Product with Serializable

Annotations
@ExpressionDescription()
case class AddMonths(startDate: Expression, numMonths: Expression) extends BinaryExpression with ImplicitCastInputTypes with Product with Serializable

Returns the date that is num_months after start_date.
Returns the date that is num_months after start_date.

Annotations
@ExpressionDescription()
abstract class AggregateWindowFunction extends DeclarativeAggregate with WindowFunction
case class Alias(child: Expression, name: String)(exprId: ExprId = NamedExpression.newExprId, qualifier: Option[String] = None, explicitMetadata: Option[Metadata] = None, isGenerated: Boolean = false) extends UnaryExpression with NamedExpression with Product with Serializable

Used to assign a new name to a computation.
Used to assign a new name to a computation. For example the SQL expression "1 + 1 AS a" could be represented as follows: Alias(Add(Literal(1), Literal(1)), "a")()
Note that exprId and qualifiers are in a separate parameter list because we only pattern match on child and name.
child
The computation being performed
name
The name to be associated with the result of computing child.
exprId
A globally unique id used to check if an AttributeReference refers to this alias. Auto-assigned if left blank.
qualifier
An optional string that can be used to referred to this attribute in a fully qualified way. Consider the examples tableName.name, subQueryAlias.name. tableName and subQueryAlias are possible qualifiers.
explicitMetadata
Explicit metadata associated with this alias that overwrites child's.
isGenerated
A flag to indicate if this alias is generated by Catalyst
case class And(left: Expression, right: Expression) extends BinaryOperator with Predicate with Product with Serializable

Annotations
@ExpressionDescription()
case class ArrayContains(left: Expression, right: Expression) extends BinaryExpression with ImplicitCastInputTypes with Product with Serializable

Checks if the array (left) has the element (right)
Checks if the array (left) has the element (right)

Annotations
@ExpressionDescription()
case class Ascii(child: Expression) extends UnaryExpression with ImplicitCastInputTypes with Product with Serializable

Returns the numeric value of the first character of str.
Returns the numeric value of the first character of str.

Annotations
@ExpressionDescription()
case class Asin(child: Expression) extends UnaryMathExpression with Product with Serializable

Annotations
@ExpressionDescription()
case class AssertTrue(child: Expression) extends UnaryExpression with ImplicitCastInputTypes with Product with Serializable

A function throws an exception if 'condition' is not true.
A function throws an exception if 'condition' is not true.

Annotations
@ExpressionDescription()
case class AtLeastNNonNulls(n: Int, children: Seq[Expression]) extends Expression with Predicate with Product with Serializable

A predicate that is evaluated to be true if there are at least n non-null and non-NaN values.
case class Atan(child: Expression) extends UnaryMathExpression with Product with Serializable

Annotations
@ExpressionDescription()
case class Atan2(left: Expression, right: Expression) extends BinaryMathExpression with Product with Serializable

Annotations
@ExpressionDescription()
abstract class Attribute extends LeafExpression with NamedExpression with NullIntolerant
class AttributeEquals extends AnyRef

Attributes
protected
class AttributeMap[A] extends Map[Attribute, A] with Serializable
case class AttributeReference(name: String, dataType: DataType, nullable: Boolean = true, metadata: Metadata = Metadata.empty)(exprId: ExprId = NamedExpression.newExprId, qualifier: Option[String] = None, isGenerated: Boolean = false) extends Attribute with Unevaluable with Product with Serializable

A reference to an attribute produced by another operator in the tree.
A reference to an attribute produced by another operator in the tree.
name
The name of this attribute, should only be used during analysis or for debugging.
dataType
The DataType of this attribute.
nullable
True if null is a valid value for this attribute.
metadata
The metadata of this attribute.
exprId
A globally unique id used to check if different AttributeReferences refer to the same attribute.
qualifier
An optional string that can be used to referred to this attribute in a fully qualified way. Consider the examples tableName.name, subQueryAlias.name. tableName and subQueryAlias are possible qualifiers.
isGenerated
A flag to indicate if this reference is generated by Catalyst
implicit class AttributeSeq extends Serializable

Helper functions for working with Seq[Attribute].
class AttributeSet extends Traversable[Attribute] with Serializable

A Set designed to hold AttributeReference objects, that performs equality checking using expression id instead of standard java equality.
A Set designed to hold AttributeReference objects, that performs equality checking using expression id instead of standard java equality. Using expression id means that these sets will correctly test for membership, even when the AttributeReferences in question differ cosmetically (e.g., the names have different capitalizations).
Note that we do not override equality for Attribute references as it is really weird when AttributeReference("a"...) == AttributeReference("b", ...). This tactic leads to broken tests, and also makes doing transformations hard (we always try keep older trees instead of new ones when the transformation was a no-op).
case class BRound(child: Expression, scale: Expression) extends RoundBase with Serializable with ImplicitCastInputTypes with Product

Round an expression to d decimal places using HALF_EVEN rounding mode, also known as Gaussian rounding or bankers' rounding.
Round an expression to d decimal places using HALF_EVEN rounding mode, also known as Gaussian rounding or bankers' rounding. round(2.5) = 2.0, round(3.5) = 4.0.

Annotations
@ExpressionDescription()
case class Base64(child: Expression) extends UnaryExpression with ImplicitCastInputTypes with Product with Serializable

Converts the argument from binary to a base 64 string.
Converts the argument from binary to a base 64 string.

Annotations
@ExpressionDescription()
trait BaseGenericInternalRow extends InternalRow

An extended version of InternalRow that implements all special getters, toString and equals/hashCode by genericGet.
case class Bin(child: Expression) extends UnaryExpression with Serializable with ImplicitCastInputTypes with Product

Annotations
@ExpressionDescription()
abstract class BinaryArithmetic extends BinaryOperator
abstract class BinaryComparison extends BinaryOperator with Predicate
abstract class BinaryExpression extends Expression

An expression with two inputs and one output.
An expression with two inputs and one output. The output is by default evaluated to null if any input is evaluated to null.
abstract class BinaryMathExpression extends BinaryExpression with Serializable with ImplicitCastInputTypes

A binary expression specifically for math functions that take two Doubles as input and returns a Double.
abstract class BinaryOperator extends BinaryExpression with ExpectsInputTypes

A BinaryExpression that is an operator, with two properties:
A BinaryExpression that is an operator, with two properties:
1. The string representation is "x symbol y", rather than "funcName(x, y)". 2. Two inputs are expected to the be same type. If the two inputs have different types, the analyzer will find the tightest common type and do the proper type casting.
case class BitwiseAnd(left: Expression, right: Expression) extends BinaryArithmetic with Product with Serializable

A function that calculates bitwise and(&) of two numbers.
A function that calculates bitwise and(&) of two numbers.
Code generation inherited from BinaryArithmetic.

Annotations
@ExpressionDescription()
case class BitwiseNot(child: Expression) extends UnaryExpression with ExpectsInputTypes with Product with Serializable

A function that calculates bitwise not(~) of a number.
A function that calculates bitwise not(~) of a number.

Annotations
@ExpressionDescription()
case class BitwiseOr(left: Expression, right: Expression) extends BinaryArithmetic with Product with Serializable

A function that calculates bitwise or(|) of two numbers.
A function that calculates bitwise or(|) of two numbers.
Code generation inherited from BinaryArithmetic.

Annotations
@ExpressionDescription()
case class BitwiseXor(left: Expression, right: Expression) extends BinaryArithmetic with Product with Serializable

A function that calculates bitwise xor of two numbers.
A function that calculates bitwise xor of two numbers.
Code generation inherited from BinaryArithmetic.

Annotations
@ExpressionDescription()
case class BoundReference(ordinal: Int, dataType: DataType, nullable: Boolean) extends LeafExpression with Product with Serializable

A bound reference points to a specific slot in the input tuple, allowing the actual value to be retrieved more efficiently.
A bound reference points to a specific slot in the input tuple, allowing the actual value to be retrieved more efficiently. However, since operations like column pruning can change the layout of intermediate tuples, BindReferences should be run after all such transformations.
case class CallMethodViaReflection(children: Seq[Expression]) extends Expression with CodegenFallback with Product with Serializable

An expression that invokes a method on a class via reflection.
An expression that invokes a method on a class via reflection.
For now, only types defined in Reflect.typeMapping are supported (basically primitives and string) as input types, and the output is turned automatically to a string.
Note that unlike Hive's reflect function, this expression calls only static methods (i.e. does not support calling non-static methods).
We should also look into how to consolidate this expression with org.apache.spark.sql.catalyst.expressions.objects.StaticInvoke in the future.
children
the first element should be a literal string for the class name, and the second element should be a literal string for the method name, and the remaining are input arguments to the Java method.

Annotations
@ExpressionDescription()
case class CaseWhen(branches: Seq[(Expression, Expression)], elseValue: Option[Expression] = None) extends CaseWhenBase with CodegenFallback with Serializable with Product

Case statements of the form "CASE WHEN a THEN b [WHEN c THEN d]* [ELSE e] END".
Case statements of the form "CASE WHEN a THEN b [WHEN c THEN d]* [ELSE e] END". When a = true, returns b; when c = true, returns d; else returns e.
branches
seq of (branch condition, branch value)
elseValue
optional value for the else branch

Annotations
@ExpressionDescription()
abstract class CaseWhenBase extends Expression with Serializable

Abstract parent class for common logic in CaseWhen and CaseWhenCodegen.
case class CaseWhenCodegen(branches: Seq[(Expression, Expression)], elseValue: Option[Expression] = None) extends CaseWhenBase with Serializable with Product

CaseWhen expression used when code generation condition is satisfied.
CaseWhen expression used when code generation condition is satisfied. OptimizeCodegen optimizer replaces CaseWhen into CaseWhenCodegen.
branches
seq of (branch condition, branch value)
elseValue
optional value for the else branch
case class Cast(child: Expression, dataType: DataType) extends UnaryExpression with NullIntolerant with Product with Serializable

Cast the child expression to the target data type.
Cast the child expression to the target data type.

Annotations
@ExpressionDescription()
case class Cbrt(child: Expression) extends UnaryMathExpression with Product with Serializable

Annotations
@ExpressionDescription()
case class Ceil(child: Expression) extends UnaryMathExpression with Product with Serializable

Annotations
@ExpressionDescription()
case class CheckOverflow(child: Expression, dataType: DecimalType) extends UnaryExpression with Product with Serializable

Rounds the decimal to given scale and check whether the decimal can fit in provided precision or not, returns null if not.
case class Coalesce(children: Seq[Expression]) extends Expression with Product with Serializable

An expression that is evaluated to the first non-null input.
An expression that is evaluated to the first non-null input.
```
coalesce(1, 2) => 1
coalesce(null, 1, 2) => 1
coalesce(null, null, 2) => 2
coalesce(null, null, null) => null
```
Annotations
@ExpressionDescription()
case class Concat(children: Seq[Expression]) extends Expression with ImplicitCastInputTypes with Product with Serializable

An expression that concatenates multiple input strings into a single string.
An expression that concatenates multiple input strings into a single string. If any input is null, concat returns null.

Annotations
@ExpressionDescription()
case class ConcatWs(children: Seq[Expression]) extends Expression with ImplicitCastInputTypes with Product with Serializable

An expression that concatenates multiple input strings or array of strings into a single string, using a given separator (the first child).
An expression that concatenates multiple input strings or array of strings into a single string, using a given separator (the first child).
Returns null if the separator is null. Otherwise, concat_ws skips all null values.

Annotations
@ExpressionDescription()
case class Contains(left: Expression, right: Expression) extends BinaryExpression with StringPredicate with Product with Serializable

A function that returns true if the string left contains the string right.
case class Conv(numExpr: Expression, fromBaseExpr: Expression, toBaseExpr: Expression) extends TernaryExpression with ImplicitCastInputTypes with Product with Serializable

Convert a num from one base to another
Convert a num from one base to another
numExpr
the number to be converted
fromBaseExpr
from which base
toBaseExpr
to which base

Annotations
@ExpressionDescription()
case class Cos(child: Expression) extends UnaryMathExpression with Product with Serializable

Annotations
@ExpressionDescription()
case class Cosh(child: Expression) extends UnaryMathExpression with Product with Serializable

Annotations
@ExpressionDescription()
case class Crc32(child: Expression) extends UnaryExpression with ImplicitCastInputTypes with Product with Serializable

A function that computes a cyclic redundancy check value and returns it as a bigint For input of type BinaryType
A function that computes a cyclic redundancy check value and returns it as a bigint For input of type BinaryType

Annotations
@ExpressionDescription()
case class CreateArray(children: Seq[Expression]) extends Expression with Product with Serializable

Returns an Array containing the evaluation of all children expressions.
Returns an Array containing the evaluation of all children expressions.

Annotations
@ExpressionDescription()
case class CreateMap(children: Seq[Expression]) extends Expression with Product with Serializable

Returns a catalyst Map containing the evaluation of all children expressions as keys and values.
Returns a catalyst Map containing the evaluation of all children expressions as keys and values. The children are a flatted sequence of kv pairs, e.g. (key1, value1, key2, value2, ...)

Annotations
@ExpressionDescription()
case class CreateNamedStruct(children: Seq[Expression]) extends Expression with CreateNamedStructLike with Product with Serializable

Creates a struct with the given field names and values
Creates a struct with the given field names and values
children
Seq(name1, val1, name2, val2, ...)

Annotations
@ExpressionDescription()
trait CreateNamedStructLike extends Expression

Common base class for both CreateNamedStruct and CreateNamedStructUnsafe.
case class CreateNamedStructUnsafe(children: Seq[Expression]) extends Expression with CreateNamedStructLike with Product with Serializable

Creates a struct with the given field names and values.
Creates a struct with the given field names and values. This is a variant that returns UnsafeRow directly. The unsafe projection operator replaces CreateStruct with this expression automatically at runtime.
children
Seq(name1, val1, name2, val2, ...)
case class Cube(groupByExprs: Seq[Expression]) extends Expression with GroupingSet with Product with Serializable
case class CumeDist() extends RowNumberLike with SizeBasedWindowFunction with Product with Serializable

The CumeDist function computes the position of a value relative to all values in the partition.
The CumeDist function computes the position of a value relative to all values in the partition. The result is the number of rows preceding or equal to the current row in the ordering of the partition divided by the total number of rows in the window partition. Any tie values in the ordering will evaluate to the same position.
This documentation has been based upon similar documentation for the Hive and Presto projects.

Annotations
@ExpressionDescription()
case class CurrentBatchTimestamp(timestampMs: Long, dataType: DataType) extends LeafExpression with Nondeterministic with CodegenFallback with Product with Serializable

Expression representing the current batch time, which is used by StreamExecution to 1.
Expression representing the current batch time, which is used by StreamExecution to 1. prevent optimizer from pushing this expression below a stateful operator 2. allow IncrementalExecution to substitute this expression with a Literal(timestamp)
There is no code generation since this expression should be replaced with a literal.
case class CurrentDatabase() extends LeafExpression with Unevaluable with Product with Serializable

Returns the current database of the SessionCatalog.
Returns the current database of the SessionCatalog.

Annotations
@ExpressionDescription()
case class CurrentDate() extends LeafExpression with CodegenFallback with Product with Serializable

Returns the current date at the start of query evaluation.
Returns the current date at the start of query evaluation. All calls of current_date within the same query return the same value.
There is no code generation since this expression should get constant folded by the optimizer.

Annotations
@ExpressionDescription()
case class CurrentTimestamp() extends LeafExpression with CodegenFallback with Product with Serializable

Returns the current timestamp at the start of query evaluation.
Returns the current timestamp at the start of query evaluation. All calls of current_timestamp within the same query return the same value.
There is no code generation since this expression should get constant folded by the optimizer.

Annotations
@ExpressionDescription()
case class DateAdd(startDate: Expression, days: Expression) extends BinaryExpression with ImplicitCastInputTypes with Product with Serializable

Adds a number of days to startdate.
Adds a number of days to startdate.

Annotations
@ExpressionDescription()
case class DateDiff(endDate: Expression, startDate: Expression) extends BinaryExpression with ImplicitCastInputTypes with Product with Serializable

Returns the number of days from startDate to endDate.
Returns the number of days from startDate to endDate.

Annotations
@ExpressionDescription()
case class DateFormatClass(left: Expression, right: Expression) extends BinaryExpression with ImplicitCastInputTypes with Product with Serializable

Annotations
@ExpressionDescription()
case class DateSub(startDate: Expression, days: Expression) extends BinaryExpression with ImplicitCastInputTypes with Product with Serializable

Subtracts a number of days to startdate.
Subtracts a number of days to startdate.

Annotations
@ExpressionDescription()
case class DayOfMonth(child: Expression) extends UnaryExpression with ImplicitCastInputTypes with Product with Serializable

Annotations
@ExpressionDescription()
case class DayOfYear(child: Expression) extends UnaryExpression with ImplicitCastInputTypes with Product with Serializable

Annotations
@ExpressionDescription()
case class Decode(bin: Expression, charset: Expression) extends BinaryExpression with ImplicitCastInputTypes with Product with Serializable

Decodes the first argument into a String using the provided character set (one of 'US-ASCII', 'ISO-8859-1', 'UTF-8', 'UTF-16BE', 'UTF-16LE', 'UTF-16').
Decodes the first argument into a String using the provided character set (one of 'US-ASCII', 'ISO-8859-1', 'UTF-8', 'UTF-16BE', 'UTF-16LE', 'UTF-16'). If either argument is null, the result will also be null.

Annotations
@ExpressionDescription()
case class DenseRank(children: Seq[Expression]) extends RankLike with Product with Serializable

The DenseRank function computes the rank of a value in a group of values.
The DenseRank function computes the rank of a value in a group of values. The result is one plus the previously assigned rank value. Unlike Rank, DenseRank will not produce gaps in the ranking sequence.
This documentation has been based upon similar documentation for the Hive and Presto projects.
children
to base the rank on; a change in the value of one the children will trigger a change in rank. This is an internal parameter and will be assigned by the Analyser.

Annotations
@ExpressionDescription()
final class DirectStringConsumer extends MemoryConsumer
case class Divide(left: Expression, right: Expression) extends BinaryArithmetic with NullIntolerant with Product with Serializable

Annotations
@ExpressionDescription()
case class DynamicFoldableExpression(expr: Expression) extends UnaryExpression with DynamicReplacableConstant with KryoSerializable with Product with Serializable

Wrap any TokenizedLiteral expression with this so that we can invoke literal initialization code within the .init() method of the generated class.
Wrap any TokenizedLiteral expression with this so that we can invoke literal initialization code within the .init() method of the generated class.

This pushes itself as reference object and uses a call to eval() on itself for actual evaluation and avoids embedding any generated code. This allows it to keep the generated code identical regardless of the constant expression (and in addition DynamicReplacableConstant trait casts to itself rather than actual object type).

We try to locate first foldable expression in a query tree such that all its child is foldable but parent isn't. That way we locate the exact point where an expression is safe to evaluate once instead of evaluating every row.

Expressions like select c from tab where case col2 when 1 then col3 else 'y' end = 22 like queries don't convert literal evaluation into init method.
expr
minimal expression tree that can be evaluated only once and turn into a constant.
case class DynamicInSet(child: Expression, hset: IndexedSeq[Expression]) extends UnaryExpression with Predicate with Product with Serializable

Unlike Spark's InSet expression, this allows for TokenizedLiterals that can change dynamically in executions.
trait DynamicReplacableConstant extends Expression
case class Elt(children: Seq[Expression]) extends Expression with ImplicitCastInputTypes with Product with Serializable

Annotations
@ExpressionDescription()
case class Encode(value: Expression, charset: Expression) extends BinaryExpression with ImplicitCastInputTypes with Product with Serializable

Encodes the first argument into a BINARY using the provided character set (one of 'US-ASCII', 'ISO-8859-1', 'UTF-8', 'UTF-16BE', 'UTF-16LE', 'UTF-16').
Encodes the first argument into a BINARY using the provided character set (one of 'US-ASCII', 'ISO-8859-1', 'UTF-8', 'UTF-16BE', 'UTF-16LE', 'UTF-16'). If either argument is null, the result will also be null.

Annotations
@ExpressionDescription()
case class EndsWith(left: Expression, right: Expression) extends BinaryExpression with StringPredicate with Product with Serializable

A function that returns true if the string left ends with the string right.
case class EqualNullSafe(left: Expression, right: Expression) extends BinaryComparison with Product with Serializable

Annotations
@ExpressionDescription()
case class EqualTo(left: Expression, right: Expression) extends BinaryComparison with NullIntolerant with Product with Serializable

Annotations
@ExpressionDescription()
class EquivalentExpressions extends AnyRef

This class is used to compute equality of (sub)expression trees.
This class is used to compute equality of (sub)expression trees. Expressions can be added to this class and they subsequently query for expression equality. Expression trees are considered equal if for the same input(s), the same result is produced.
case class EulerNumber() extends LeafMathExpression with Product with Serializable

Euler's number.
Euler's number. Note that there is no code generation because this is only evaluated by the optimizer during constant folding.

Annotations
@ExpressionDescription()
case class Exists(plan: LogicalPlan, exprId: ExprId = NamedExpression.newExprId) extends SubqueryExpression with Predicate with Unevaluable with Product with Serializable

The Exists expression checks if a row exists in a subquery given some correlated condition.
The Exists expression checks if a row exists in a subquery given some correlated condition. For example (SQL):
```
SELECT  *
FROM    a
WHERE   EXISTS (SELECT  *
                FROM    b
                WHERE   b.id = a.id)
```
case class Exp(child: Expression) extends UnaryMathExpression with Product with Serializable

Annotations
@ExpressionDescription()
trait ExpectsInputTypes extends Expression

A trait that gets mixin to define the expected input types of an expression.
A trait that gets mixin to define the expected input types of an expression.
This trait is typically used by operator expressions (e.g. Add, Subtract) to define expected input types without any implicit casting.
Most function expressions (e.g. Substring should extends ImplicitCastInputTypes) instead.
case class Explode(child: Expression) extends ExplodeBase with Product with Serializable

Given an input array produces a sequence of rows for each value in the array.
Given an input array produces a sequence of rows for each value in the array.
```
SELECT explode(array(10,20)) ->
10
20
```
Annotations
@ExpressionDescription()
abstract class ExplodeBase extends UnaryExpression with Generator with CodegenFallback with Serializable

A base class for Explode and PosExplode
case class Expm1(child: Expression) extends UnaryMathExpression with Product with Serializable

Annotations
@ExpressionDescription()
case class ExprId(id: Long, jvmId: UUID) extends Product with Serializable

A globally unique id for a given named expression.
A globally unique id for a given named expression. Used to identify which attribute output by a relation is being referenced in a subsequent computation.
The id field is unique within a given JVM, while the uuid is used to uniquely identify JVMs.
abstract class Expression extends TreeNode[Expression]

An expression in Catalyst.
An expression in Catalyst.
If an expression wants to be exposed in the function registry (so users can call it with "name(arguments...)", the concrete implementation must be a case class whose constructor arguments are all Expressions types. See Substring for an example.
There are a few important traits:
- Nondeterministic: an expression that is not deterministic. - Unevaluable: an expression that is not supposed to be evaluated. - CodegenFallback: an expression that does not have code gen implemented and falls back to interpreted mode.
- LeafExpression: an expression that has no child. - UnaryExpression: an expression that has one child. - BinaryExpression: an expression that has two children. - TernaryExpression: an expression that has three children. - BinaryOperator: a special case of BinaryExpression that requires two children to have the same output data type.
class ExpressionDescription extends Annotation with Annotation with ClassfileAnnotation
class ExpressionInfo extends AnyRef
class ExpressionSet extends Set[Expression]

A Set where membership is determined based on a canonical representation of an Expression (i.e.
A Set where membership is determined based on a canonical representation of an Expression (i.e. one that attempts to ignore cosmetic differences). See Canonicalize for more details.
Internally this set uses the canonical representation, but keeps also track of the original expressions to ease debugging. Since different expressions can share the same canonical representation, this means that operations that extract expressions from this set are only guaranteed to see at least one such expression. For example:
```
val set = AttributeSet(a + 1, 1 + a)

set.iterator => Iterator(a + 1)
set.contains(a + 1) => true
set.contains(1 + a) => true
set.contains(a + 2) => false
```
trait ExtractValue extends Expression
case class Factorial(child: Expression) extends UnaryExpression with ImplicitCastInputTypes with Product with Serializable

Annotations
@ExpressionDescription()
case class FindInSet(left: Expression, right: Expression) extends BinaryExpression with ImplicitCastInputTypes with Product with Serializable

A function that returns the index (1-based) of the given string (left) in the comma- delimited list (right).
A function that returns the index (1-based) of the given string (left) in the comma- delimited list (right). Returns 0, if the string wasn't found or if the given string (left) contains a comma.

Annotations
@ExpressionDescription()
final class FixedLengthRowBasedKeyValueBatch extends RowBasedKeyValueBatch
case class Floor(child: Expression) extends UnaryMathExpression with Product with Serializable

Annotations
@ExpressionDescription()
case class FormatNumber(x: Expression, d: Expression) extends BinaryExpression with ExpectsInputTypes with Product with Serializable

Formats the number X to a format like '#,###,###.##', rounded to D decimal places, and returns the result as a string.
Formats the number X to a format like '#,###,###.##', rounded to D decimal places, and returns the result as a string. If D is 0, the result has no decimal point or fractional part.

Annotations
@ExpressionDescription()
case class FormatString(children: Expression*) extends Expression with ImplicitCastInputTypes with Product with Serializable

Returns the input formatted according do printf-style format strings
Returns the input formatted according do printf-style format strings

Annotations
@ExpressionDescription()
sealed trait FrameBoundary extends AnyRef

The trait used to represent the type of a Window Frame Boundary.
sealed trait FrameType extends AnyRef

The trait used to represent the type of a Window Frame.
case class FromUTCTimestamp(left: Expression, right: Expression) extends BinaryExpression with ImplicitCastInputTypes with Product with Serializable

Given a timestamp, which corresponds to a certain time of day in UTC, returns another timestamp that corresponds to the same time of day in the given timezone.
Given a timestamp, which corresponds to a certain time of day in UTC, returns another timestamp that corresponds to the same time of day in the given timezone.

Annotations
@ExpressionDescription()
case class FromUnixTime(sec: Expression, format: Expression) extends BinaryExpression with ImplicitCastInputTypes with Product with Serializable

Converts the number of seconds from unix epoch (1970-01-01 00:00:00 UTC) to a string representing the timestamp of that moment in the current system time zone in the given format.
Converts the number of seconds from unix epoch (1970-01-01 00:00:00 UTC) to a string representing the timestamp of that moment in the current system time zone in the given format. If the format is missing, using format like "1970-01-01 00:00:00". Note that hive Language Manual says it returns 0 if fail, but in fact it returns null.

Annotations
@ExpressionDescription()
trait Generator extends Expression

An expression that produces zero or more rows given a single input row.
An expression that produces zero or more rows given a single input row.
Generators produce multiple output rows instead of a single value like other expressions, and thus they must have a schema to associate with the rows that are output.
However, unlike row producing relational operators, which are either leaves or determine their output schema functionally from their input, generators can contain other expressions that might result in their modification by rules. This structure means that they might be copied multiple times after first determining their output schema. If a new output schema is created for each copy references up the tree might be rendered invalid. As a result generators must instead define a function makeOutput which is called only once when the schema is first requested. The attributes produced by this function will be automatically copied anytime rules result in changes to the Generator or its children.
class GenericInternalRow extends InternalRow with BaseGenericInternalRow

An internal row implementation that uses an array of objects as the underlying storage.
An internal row implementation that uses an array of objects as the underlying storage. Note that, while the array is not copied, and thus could technically be mutated after creation, this is not allowed.
class GenericRow extends Row

A row implementation that uses an array of objects as the underlying storage.
A row implementation that uses an array of objects as the underlying storage. Note that, while the array is not copied, and thus could technically be mutated after creation, this is not allowed.
class GenericRowWithSchema extends GenericRow
case class GetArrayItem(child: Expression, ordinal: Expression) extends BinaryExpression with ExpectsInputTypes with ExtractValue with Product with Serializable

Returns the field at ordinal in the Array child.
Returns the field at ordinal in the Array child.
We need to do type checking here as ordinal expression maybe unresolved.
case class GetArrayStructFields(child: Expression, field: StructField, ordinal: Int, numFields: Int, containsNull: Boolean) extends UnaryExpression with ExtractValue with Product with Serializable

For a child whose data type is an array of structs, extracts the ordinal-th fields of all array elements, and returns them as a new array.
For a child whose data type is an array of structs, extracts the ordinal-th fields of all array elements, and returns them as a new array.
No need to do type checking since it is handled by ExtractValue.
case class GetJsonObject(json: Expression, path: Expression) extends BinaryExpression with ExpectsInputTypes with CodegenFallback with Product with Serializable

Extracts json object from a json string based on json path specified, and returns json string of the extracted json object.
Extracts json object from a json string based on json path specified, and returns json string of the extracted json object. It will return null if the input json string is invalid.

Annotations
@ExpressionDescription()
case class GetMapValue(child: Expression, key: Expression) extends BinaryExpression with ImplicitCastInputTypes with ExtractValue with Product with Serializable

Returns the value of key key in Map child.
Returns the value of key key in Map child.
We need to do type checking here as key expression maybe unresolved.
case class GetStructField(child: Expression, ordinal: Int, name: Option[String] = None) extends UnaryExpression with ExtractValue with Product with Serializable

Returns the value of fields in the Struct child.
Returns the value of fields in the Struct child.
No need to do type checking since it is handled by ExtractValue.
Note that we can pass in the field name directly to keep case preserving in toString. For example, when get field yEAr from <year: int, month: int>, we should pass in yEAr.
case class GreaterThan(left: Expression, right: Expression) extends BinaryComparison with NullIntolerant with Product with Serializable

Annotations
@ExpressionDescription()
case class GreaterThanOrEqual(left: Expression, right: Expression) extends BinaryComparison with NullIntolerant with Product with Serializable

Annotations
@ExpressionDescription()
case class Greatest(children: Seq[Expression]) extends Expression with Product with Serializable

A function that returns the greatest value of all parameters, skipping null values.
A function that returns the greatest value of all parameters, skipping null values. It takes at least 2 parameters, and returns null iff all parameters are null.

Annotations
@ExpressionDescription()
case class Grouping(child: Expression) extends Expression with Unevaluable with Product with Serializable

Indicates whether a specified column expression in a GROUP BY list is aggregated or not.
Indicates whether a specified column expression in a GROUP BY list is aggregated or not. GROUPING returns 1 for aggregated or 0 for not aggregated in the result set.
case class GroupingID(groupByExprs: Seq[Expression]) extends Expression with Unevaluable with Product with Serializable

GroupingID is a function that computes the level of grouping.
GroupingID is a function that computes the level of grouping.
If groupByExprs is empty, it means all grouping expressions in GroupingSets.
trait GroupingSet extends Expression with CodegenFallback

A placeholder expression for cube/rollup, which will be replaced by analyzer
abstract class HashExpression[E] extends Expression

A function that calculates hash value for a group of expressions.
A function that calculates hash value for a group of expressions. Note that the seed argument is not exposed to users and should only be set inside spark SQL.
The hash value for an expression depends on its type and seed:
- null: seed
- boolean: turn boolean into int, 1 for true, 0 for false, and then use murmur3 to hash this int with seed.
- byte, short, int: use murmur3 to hash the input as int with seed.
- long: use murmur3 to hash the long input with seed.
- float: turn it into int: java.lang.Float.floatToIntBits(input), and hash it.
- double: turn it into long: java.lang.Double.doubleToLongBits(input), and hash it.
- decimal: if it's a small decimal, i.e. precision <= 18, turn it into long and hash it. Else, turn it into bytes and hash it.
- calendar interval: hash microseconds first, and use the result as seed to hash months.
- binary: use murmur3 to hash the bytes with seed.
- string: get the bytes of string and hash it.
- array: The result starts with seed, then use result as seed, recursively calculate hash value for each element, and assign the element hash value to result.
- map: The result starts with seed, then use result as seed, recursively calculate hash value for each key-value, and assign the key-value hash value to result.
- struct: The result starts with seed, then use result as seed, recursively calculate hash value for each field, and assign the field hash value to result.
Finally we aggregate the hash values for each expression by the same way of struct.
case class Hex(child: Expression) extends UnaryExpression with ImplicitCastInputTypes with Product with Serializable

If the argument is an INT or binary, hex returns the number as a STRING in hexadecimal format.
If the argument is an INT or binary, hex returns the number as a STRING in hexadecimal format. Otherwise if the number is a STRING, it converts each character into its hex representation and returns the resulting STRING. Negative numbers would be treated as two's complement.

Annotations
@ExpressionDescription()
case class HiveHash(children: Seq[Expression]) extends HashExpression[Int] with Product with Serializable

Simulates Hive's hashing function at org.apache.hadoop.hive.serde2.objectinspector.ObjectInspectorUtils#hashcode() in Hive
Simulates Hive's hashing function at org.apache.hadoop.hive.serde2.objectinspector.ObjectInspectorUtils#hashcode() in Hive
We should use this hash function for both shuffle and bucket of Hive tables, so that we can guarantee shuffle and bucketing have same data distribution
TODO: Support Decimal and date related types

Annotations
@ExpressionDescription()
class HiveHasher extends AnyRef
case class Hour(child: Expression) extends UnaryExpression with ImplicitCastInputTypes with Product with Serializable

Annotations
@ExpressionDescription()
case class Hypot(left: Expression, right: Expression) extends BinaryMathExpression with Product with Serializable

Annotations
@ExpressionDescription()
case class If(predicate: Expression, trueValue: Expression, falseValue: Expression) extends Expression with Product with Serializable

Annotations
@ExpressionDescription()
case class IfNull(left: Expression, right: Expression, child: Expression) extends UnaryExpression with RuntimeReplaceable with Product with Serializable

Annotations
@ExpressionDescription()
trait ImplicitCastInputTypes extends Expression with ExpectsInputTypes

A mixin for the analyzer to perform implicit type casting using org.apache.spark.sql.catalyst.analysis.TypeCoercion.ImplicitTypeCasts.
case class In(value: Expression, list: Seq[Expression]) extends Expression with Predicate with ImplicitCastInputTypes with Product with Serializable

Evaluates to true if list contains value.
Evaluates to true if list contains value.

Annotations
@ExpressionDescription()
case class InSet(child: Expression, hset: Set[Any]) extends UnaryExpression with Predicate with Product with Serializable

Optimized version of In clause, when all filter values of In clause are static.
case class InitCap(child: Expression) extends UnaryExpression with ImplicitCastInputTypes with Product with Serializable

Returns string, with the first letter of each word in uppercase, all other letters in lowercase.
Returns string, with the first letter of each word in uppercase, all other letters in lowercase. Words are delimited by whitespace.

Annotations
@ExpressionDescription()
case class Inline(child: Expression) extends UnaryExpression with Generator with CodegenFallback with Product with Serializable

Explodes an array of structs into a table.
Explodes an array of structs into a table.

Annotations
@ExpressionDescription()
case class InputFileName() extends LeafExpression with Nondeterministic with Product with Serializable

Expression that returns the name of the current file being read.
Expression that returns the name of the current file being read.

Annotations
@ExpressionDescription()
abstract class InterpretedHashFunction extends AnyRef

Base class for interpreted hash functions.
case class InterpretedMutableProjection(expressions: Seq[Expression]) extends MutableProjection with Product with Serializable

A MutableProjection that is calculated by calling eval on each of the specified expressions.
A MutableProjection that is calculated by calling eval on each of the specified expressions.
expressions
a sequence of expressions that determine the value of each column of the output row.
class InterpretedOrdering extends Ordering[InternalRow]

An interpreted row ordering comparator.
class InterpretedProjection extends Projection

A Projection that is calculated by calling the eval of each of the specified expressions.
case class IntervalExpression(children: Seq[Expression], units: Seq[Long]) extends Expression with ImplicitCastInputTypes with Product with Serializable
case class IsNaN(child: Expression) extends UnaryExpression with Predicate with ImplicitCastInputTypes with Product with Serializable

Evaluates to true iff it's NaN.
Evaluates to true iff it's NaN.

Annotations
@ExpressionDescription()
case class IsNotNull(child: Expression) extends UnaryExpression with Predicate with Product with Serializable

An expression that is evaluated to true if the input is not null.
An expression that is evaluated to true if the input is not null.

Annotations
@ExpressionDescription()
case class IsNull(child: Expression) extends UnaryExpression with Predicate with Product with Serializable

An expression that is evaluated to true if the input is null.
An expression that is evaluated to true if the input is null.

Annotations
@ExpressionDescription()
class JoinedRow extends InternalRow

A mutable wrapper that makes two rows appear as a single concatenated row.
A mutable wrapper that makes two rows appear as a single concatenated row. Designed to be instantiated once per thread and reused.
case class JsonToStruct(schema: StructType, options: Map[String, String], child: Expression) extends UnaryExpression with CodegenFallback with ExpectsInputTypes with Product with Serializable

Converts an json input string to a StructType with the specified schema.
case class JsonTuple(children: Seq[Expression]) extends Expression with Generator with CodegenFallback with Product with Serializable

Annotations
@ExpressionDescription()
case class Lag(input: Expression, offset: Expression, default: Expression) extends OffsetWindowFunction with Product with Serializable

The Lag function returns the value of input at the offsetth row before the current row in the window.
The Lag function returns the value of input at the offsetth row before the current row in the window. Offsets start at 0, which is the current row. The offset must be constant integer value. The default offset is 1. When the value of input is null at the offsetth row, null is returned. If there is no such offset row, the default expression is evaluated.
input
expression to evaluate offset rows before the current row.
offset
rows to jump back in the partition.
default
to use when the offset row does not exist.

Annotations
@ExpressionDescription()
case class LastDay(startDate: Expression) extends UnaryExpression with ImplicitCastInputTypes with Product with Serializable

Returns the last day of the month which the date belongs to.
Returns the last day of the month which the date belongs to.

Annotations
@ExpressionDescription()
case class Lead(input: Expression, offset: Expression, default: Expression) extends OffsetWindowFunction with Product with Serializable

The Lead function returns the value of input at the offsetth row after the current row in the window.
The Lead function returns the value of input at the offsetth row after the current row in the window. Offsets start at 0, which is the current row. The offset must be constant integer value. The default offset is 1. When the value of input is null at the offsetth row, null is returned. If there is no such offset row, the default expression is evaluated.
input
expression to evaluate offset rows after the current row.
offset
rows to jump ahead in the partition.
default
to use when the offset is larger than the window. The default value is null.

Annotations
@ExpressionDescription()
abstract class LeafExpression extends Expression

A leaf expression, i.e.
A leaf expression, i.e. one without any child expressions.
abstract class LeafMathExpression extends LeafExpression with CodegenFallback with Serializable

A leaf expression specifically for math constants.
A leaf expression specifically for math constants. Math constants expect no input.
There is no code generation because they should get constant folded by the optimizer.
case class Least(children: Seq[Expression]) extends Expression with Product with Serializable

A function that returns the least value of all parameters, skipping null values.
A function that returns the least value of all parameters, skipping null values. It takes at least 2 parameters, and returns null iff all parameters are null.

Annotations
@ExpressionDescription()
case class Length(child: Expression) extends UnaryExpression with ImplicitCastInputTypes with Product with Serializable

A function that return the length of the given string or binary expression.
A function that return the length of the given string or binary expression.

Annotations
@ExpressionDescription()
case class LessThan(left: Expression, right: Expression) extends BinaryComparison with NullIntolerant with Product with Serializable

Annotations
@ExpressionDescription()
case class LessThanOrEqual(left: Expression, right: Expression) extends BinaryComparison with NullIntolerant with Product with Serializable

Annotations
@ExpressionDescription()
case class Levenshtein(left: Expression, right: Expression) extends BinaryExpression with ImplicitCastInputTypes with Product with Serializable

A function that return the Levenshtein distance between the two given strings.
A function that return the Levenshtein distance between the two given strings.

Annotations
@ExpressionDescription()
case class Like(left: Expression, right: Expression) extends BinaryExpression with StringRegexExpression with Product with Serializable

Simple RegEx pattern matching function
Simple RegEx pattern matching function

Annotations
@ExpressionDescription()
case class ListQuery(plan: LogicalPlan, exprId: ExprId = NamedExpression.newExprId) extends SubqueryExpression with Unevaluable with Product with Serializable

A ListQuery expression defines the query which we want to search in an IN subquery expression.
A ListQuery expression defines the query which we want to search in an IN subquery expression. It should and can only be used in conjunction with an IN expression.
For example (SQL):
```
SELECT  *
FROM    a
WHERE   a.id IN (SELECT  id
                 FROM    b)
```
case class Literal(value: Any, dataType: DataType) extends LeafExpression with CodegenFallback with Product with Serializable

In order to do type checking, use Literal.create() instead of constructor
case class Log(child: Expression) extends UnaryLogExpression with Product with Serializable

Annotations
@ExpressionDescription()
case class Log10(child: Expression) extends UnaryLogExpression with Product with Serializable

Annotations
@ExpressionDescription()
case class Log1p(child: Expression) extends UnaryLogExpression with Product with Serializable

Annotations
@ExpressionDescription()
case class Log2(child: Expression) extends UnaryLogExpression with Product with Serializable

Annotations
@ExpressionDescription()
case class Logarithm(left: Expression, right: Expression) extends BinaryMathExpression with Product with Serializable

Computes the logarithm of a number.
Computes the logarithm of a number.
left
the logarithm base, default to e.
right
the number to compute the logarithm of.

Annotations
@ExpressionDescription()
case class Lower(child: Expression) extends UnaryExpression with String2StringExpression with Product with Serializable

A function that converts the characters of a string to lowercase.
A function that converts the characters of a string to lowercase.

Annotations
@ExpressionDescription()
case class MakeDecimal(child: Expression, precision: Int, scale: Int) extends UnaryExpression with Product with Serializable

Create a Decimal from an unscaled Long value.
Create a Decimal from an unscaled Long value. Note: this expression is internal and created only by the optimizer, we don't need to do type check for it.
case class MapKeys(child: Expression) extends UnaryExpression with ExpectsInputTypes with Product with Serializable

Returns an unordered array containing the keys of the map.
Returns an unordered array containing the keys of the map.

Annotations
@ExpressionDescription()
case class MapValues(child: Expression) extends UnaryExpression with ExpectsInputTypes with Product with Serializable

Returns an unordered array containing the values of the map.
Returns an unordered array containing the values of the map.

Annotations
@ExpressionDescription()
case class Md5(child: Expression) extends UnaryExpression with ImplicitCastInputTypes with Product with Serializable

A function that calculates an MD5 128-bit checksum and returns it as a hex string For input of type BinaryType
A function that calculates an MD5 128-bit checksum and returns it as a hex string For input of type BinaryType

Annotations
@ExpressionDescription()
case class Minute(child: Expression) extends UnaryExpression with ImplicitCastInputTypes with Product with Serializable

Annotations
@ExpressionDescription()
case class MonotonicallyIncreasingID() extends LeafExpression with Nondeterministic with Product with Serializable

Returns monotonically increasing 64-bit integers.
Returns monotonically increasing 64-bit integers.
The generated ID is guaranteed to be monotonically increasing and unique, but not consecutive. The current implementation puts the partition ID in the upper 31 bits, and the lower 33 bits represent the record number within each partition. The assumption is that the data frame has less than 1 billion partitions, and each partition has less than 8 billion records.
Since this expression is stateful, it cannot be a case object.

Annotations
@ExpressionDescription()
case class Month(child: Expression) extends UnaryExpression with ImplicitCastInputTypes with Product with Serializable

Annotations
@ExpressionDescription()
case class MonthsBetween(date1: Expression, date2: Expression) extends BinaryExpression with ImplicitCastInputTypes with Product with Serializable

Returns number of months between dates date1 and date2.
Returns number of months between dates date1 and date2.

Annotations
@ExpressionDescription()
case class Multiply(left: Expression, right: Expression) extends BinaryArithmetic with NullIntolerant with Product with Serializable

Annotations
@ExpressionDescription()
case class Murmur3Hash(children: Seq[Expression], seed: Int) extends HashExpression[Int] with Product with Serializable

A MurMur3 Hash expression.
A MurMur3 Hash expression.
We should use this hash function for both shuffle and bucket, so that we can guarantee shuffle and bucketing have same data distribution.

Annotations
@ExpressionDescription()
final class MutableAny extends MutableValue
final class MutableBoolean extends MutableValue
final class MutableByte extends MutableValue
final class MutableDouble extends MutableValue
final class MutableFloat extends MutableValue
final class MutableInt extends MutableValue
final class MutableLong extends MutableValue
abstract class MutableProjection extends Projection

Converts a InternalRow to another Row given a sequence of expression that define each column of the new row.
Converts a InternalRow to another Row given a sequence of expression that define each column of the new row. If the schema of the input row is specified, then the given expression will be bound to that schema.
In contrast to a normal projection, a MutableProjection reuses the same underlying row object each time an input row is added. This significantly reduces the cost of calculating the projection, but means that it is not safe to hold on to a reference to a InternalRow after next() has been called on the Iterator that produced it. Instead, the user must call InternalRow.copy() and hold on to the returned InternalRow before calling next().
final class MutableShort extends MutableValue

abstract class MutableValue extends Serializable

A parent class for mutable container objects that are reused when the values are changed, resulting in less garbage.

A parent class for mutable container objects that are reused when the values are changed, resulting in less garbage. These values are held by a SpecificInternalRow.

The following code was roughly used to generate these objects:

val types = "Int,Float,Boolean,Double,Short,Long,Byte,Any".split(",")
types.map {tpe =>
s"""
final class Mutable$tpe extends MutableValue {
  var value: $tpe = 0
  def boxed = if (isNull) null else value
  def update(v: Any) = value = {
    isNull = false
    v.asInstanceOf[$tpe]
  }
  def copy() = {
    val newCopy = new Mutable$tpe
    newCopy.isNull = isNull
    newCopy.value = value
    newCopy
  }
}"""
}.foreach(println)

types.map { tpe =>
s"""
  override def set$tpe(ordinal: Int, value: $tpe): Unit = {
    val currentValue = values(ordinal).asInstanceOf[Mutable$tpe]
    currentValue.isNull = false
    currentValue.value = value
  }

  override def get$tpe(i: Int): $tpe = {
    values(i).asInstanceOf[Mutable$tpe].value
  }"""
}.foreach(println)

case class NTile(buckets: Expression) extends RowNumberLike with SizeBasedWindowFunction with Product with Serializable

The NTile function divides the rows for each window partition into n buckets ranging from 1 to at most n.
The NTile function divides the rows for each window partition into n buckets ranging from 1 to at most n. Bucket values will differ by at most 1. If the number of rows in the partition does not divide evenly into the number of buckets, then the remainder values are distributed one per bucket, starting with the first bucket.
The NTile function is particularly useful for the calculation of tertiles, quartiles, deciles and other common summary statistics
The function calculates two variables during initialization: The size of a regular bucket, and the number of buckets that will have one extra row added to it (when the rows do not evenly fit into the number of buckets); both variables are based on the size of the current partition. During the calculation process the function keeps track of the current row number, the current bucket number, and the row number at which the bucket will change (bucketThreshold). When the current row number reaches bucket threshold, the bucket value is increased by one and the the threshold is increased by the bucket size (plus one extra if the current bucket is padded).
This documentation has been based upon similar documentation for the Hive and Presto projects.
buckets
number of buckets to divide the rows in. Default value is 1.

Annotations
@ExpressionDescription()
case class NaNvl(left: Expression, right: Expression) extends BinaryExpression with ImplicitCastInputTypes with Product with Serializable

An Expression evaluates to left iff it's not NaN, or evaluates to right otherwise.
An Expression evaluates to left iff it's not NaN, or evaluates to right otherwise. This Expression is useful for mapping NaN values to null.

Annotations
@ExpressionDescription()
trait NamedExpression extends Expression

An Expression that is named.
case class NextDay(startDate: Expression, dayOfWeek: Expression) extends BinaryExpression with ImplicitCastInputTypes with Product with Serializable

Returns the first date which is later than startDate and named as dayOfWeek.
Returns the first date which is later than startDate and named as dayOfWeek. For example, NextDay(2015-07-27, Sunday) would return 2015-08-02, which is the first Sunday later than 2015-07-27.
Allowed "dayOfWeek" is defined in DateTimeUtils.getDayOfWeekFromString.

Annotations
@ExpressionDescription()
trait NonSQLExpression extends Expression

Expressions that don't have SQL representation should extend this trait.
Expressions that don't have SQL representation should extend this trait. Examples are ScalaUDF, ScalaUDAF, and object expressions like MapObjects and Invoke.
trait Nondeterministic extends Expression

An expression that is nondeterministic.
case class Not(child: Expression) extends UnaryExpression with Predicate with ImplicitCastInputTypes with NullIntolerant with Product with Serializable

Annotations
@ExpressionDescription()
case class NullIf(left: Expression, right: Expression, child: Expression) extends UnaryExpression with RuntimeReplaceable with Product with Serializable

Annotations
@ExpressionDescription()
trait NullIntolerant extends AnyRef

When an expression inherits this, meaning the expression is null intolerant (i.e.
When an expression inherits this, meaning the expression is null intolerant (i.e. any null input will result in null output). We will use this information during constructing IsNotNull constraints.
sealed abstract class NullOrdering extends AnyRef
case class Nvl(left: Expression, right: Expression, child: Expression) extends UnaryExpression with RuntimeReplaceable with Product with Serializable

Annotations
@ExpressionDescription()
case class Nvl2(expr1: Expression, expr2: Expression, expr3: Expression, child: Expression) extends UnaryExpression with RuntimeReplaceable with Product with Serializable

Annotations
@ExpressionDescription()
abstract class OffsetWindowFunction extends Expression with WindowFunction with Unevaluable with ImplicitCastInputTypes

An offset window function is a window function that returns the value of the input column offset by a number of rows within the partition.
An offset window function is a window function that returns the value of the input column offset by a number of rows within the partition. For instance: an OffsetWindowfunction for value x with offset -2, will get the value of x 2 rows back in the partition.
case class Or(left: Expression, right: Expression) extends BinaryOperator with Predicate with Product with Serializable

Annotations
@ExpressionDescription()
case class OuterReference(e: NamedExpression) extends LeafExpression with NamedExpression with Unevaluable with Product with Serializable

A place holder used to hold a reference that has been resolved to a field outside of the current plan.
A place holder used to hold a reference that has been resolved to a field outside of the current plan. This is used for correlated subqueries.
case class ParamLiteral(value: Any, dataType: DataType, pos: Int, execId: Int, tokenized: Boolean = false, positionIndependent: Boolean = false, valueEquals: Boolean = false) extends LeafExpression with TokenizedLiteral with KryoSerializable with Product with Serializable

In addition to TokenLiteral, this class can also be used in plan caching so allows for internal value to be updated in subsequent runs when the plan is re-used with different constants.
In addition to TokenLiteral, this class can also be used in plan caching so allows for internal value to be updated in subsequent runs when the plan is re-used with different constants. For that reason this does not extend Literal (to avoid Analyzer/Optimizer etc doing constant propagation for example) and its hash/equals ignores the value matching and only the position of the literal in the plan is used with the data type.
Where ever ParamLiteral case matching is required, it must match for DynamicReplacableConstant and use .eval(..) for code generation. see SNAP-1597 for more details. For cases of common-subexpression elimination that depend on constant values being equal in different parts of the tree, a new RefParamLiteral has been added that points to a ParamLiteral and is always equal to it, see SNAP-2462 for more details.
trait ParamLiteralHolder extends AnyRef
case class ParseUrl(children: Seq[Expression]) extends Expression with ExpectsInputTypes with CodegenFallback with Product with Serializable

Extracts a part from a URL
Extracts a part from a URL

Annotations
@ExpressionDescription()
case class PercentRank(children: Seq[Expression]) extends RankLike with SizeBasedWindowFunction with Product with Serializable

The PercentRank function computes the percentage ranking of a value in a group of values.
The PercentRank function computes the percentage ranking of a value in a group of values. The result the rank of the minus one divided by the total number of rows in the partition minus one: (r - 1) / (n - 1). If a partition only contains one row, the function will return 0.
The PercentRank function is similar to the CumeDist function, but it uses rank values instead of row counts in the its numerator.
This documentation has been based upon similar documentation for the Hive and Presto projects.
children
to base the rank on; a change in the value of one the children will trigger a change in rank. This is an internal parameter and will be assigned by the Analyser.

Annotations
@ExpressionDescription()
case class Pi() extends LeafMathExpression with Product with Serializable

Pi.
Pi. Note that there is no code generation because this is only evaluated by the optimizer during constant folding.

Annotations
@ExpressionDescription()
abstract class PlanExpression[T <: QueryPlan[_]] extends Expression

An interface for expressions that contain a QueryPlan.
case class Pmod(left: Expression, right: Expression) extends BinaryArithmetic with NullIntolerant with Product with Serializable

Annotations
@ExpressionDescription()
case class PosExplode(child: Expression) extends ExplodeBase with Product with Serializable

Given an input array produces a sequence of rows for each position and value in the array.
Given an input array produces a sequence of rows for each position and value in the array.
```
SELECT posexplode(array(10,20)) ->
0  10
1  20
```
Annotations
@ExpressionDescription()
case class Pow(left: Expression, right: Expression) extends BinaryMathExpression with Product with Serializable

Annotations
@ExpressionDescription()
case class PreciseTimestamp(child: Expression) extends UnaryExpression with ExpectsInputTypes with Product with Serializable

Expression used internally to convert the TimestampType to Long without losing precision, i.e.
Expression used internally to convert the TimestampType to Long without losing precision, i.e. in microseconds. Used in time windowing.
trait Predicate extends Expression

An Expression that returns a boolean value.
trait PredicateHelper extends AnyRef
case class PredicateSubquery(plan: LogicalPlan, children: Seq[Expression] = Seq.empty, nullAware: Boolean = false, exprId: ExprId = NamedExpression.newExprId) extends SubqueryExpression with Predicate with Unevaluable with Product with Serializable

A predicate subquery checks the existence of a value in a sub-query.
A predicate subquery checks the existence of a value in a sub-query. We currently only allow PredicateSubquery expressions within a Filter plan (i.e. WHERE or a HAVING clause). This will be rewritten into a left semi/anti join during analysis.
case class PrettyAttribute(name: String, dataType: DataType = NullType) extends Attribute with Unevaluable with Product with Serializable

A place holder used when printing expressions without debugging information such as the expression id or the unresolved indicator.
case class PrintToStderr(child: Expression) extends UnaryExpression with Product with Serializable

Print the result of an expression to stderr (used for debugging codegen).
abstract class Projection extends (InternalRow) ⇒ InternalRow

Converts a InternalRow to another Row given a sequence of expression that define each column of the new row.
Converts a InternalRow to another Row given a sequence of expression that define each column of the new row. If the schema of the input row is specified, then the given expression will be bound to that schema.
case class PromotePrecision(child: Expression) extends UnaryExpression with Product with Serializable

An expression used to wrap the children when promote the precision of DecimalType to avoid promote multiple times.
case class Quarter(child: Expression) extends UnaryExpression with ImplicitCastInputTypes with Product with Serializable

Annotations
@ExpressionDescription()
abstract class RDG extends UnaryExpression with ExpectsInputTypes with Nondeterministic

A Random distribution generating expression.
A Random distribution generating expression. TODO: This can be made generic to generate any type of random distribution, or any type of StructType.
Since this expression is stateful, it cannot be a case object.
case class RLike(left: Expression, right: Expression) extends BinaryExpression with StringRegexExpression with Product with Serializable

Annotations
@ExpressionDescription()
case class Rand(child: Expression) extends RDG with Product with Serializable

Generate a random column with i.i.d.
Generate a random column with i.i.d. uniformly distributed values in [0, 1).

Annotations
@ExpressionDescription()
case class Randn(child: Expression) extends RDG with Product with Serializable

Generate a random column with i.i.d.
Generate a random column with i.i.d. values drawn from the standard normal distribution.

Annotations
@ExpressionDescription()
case class Rank(children: Seq[Expression]) extends RankLike with Product with Serializable

The Rank function computes the rank of a value in a group of values.
The Rank function computes the rank of a value in a group of values. The result is one plus the number of rows preceding or equal to the current row in the ordering of the partition. The values will produce gaps in the sequence.
This documentation has been based upon similar documentation for the Hive and Presto projects.
children
to base the rank on; a change in the value of one the children will trigger a change in rank. This is an internal parameter and will be assigned by the Analyser.

Annotations
@ExpressionDescription()
abstract class RankLike extends AggregateWindowFunction

A RankLike function is a WindowFunction that changes its value based on a change in the value of the order of the window in which is processed.
A RankLike function is a WindowFunction that changes its value based on a change in the value of the order of the window in which is processed. For instance, when the value of input changes in a window ordered by input the rank function also changes. The size of the change of the rank function is (typically) not dependent on the size of the change in input.
This documentation has been based upon similar documentation for the Hive and Presto projects.
final class RefParamLiteral extends ParamLiteral

This class is used as a substitution for ParamLiteral when two ParamLiterals have same constant values during parsing.
This class is used as a substitution for ParamLiteral when two ParamLiterals have same constant values during parsing. This behaves like being equal to the ParamLiteral it points to in all respects but will be different from other ParamLiterals. Two RefParamLiterals will be equal iff their respective ParamLiterals are.
The above policy allows an expression like "a = 4 and b = 4" to be equal to "a = 5 and b = 5" after tokenization but will be different from "a = 5 and b = 6". This distinction is required because former can lead to a different execution plan after common-subexpression processing etc that can apply on because the actual values for the two tokenized values are equal in this instance. Hence it can lead to a different plan in case where actual constants are different, so after tokenization they should act as different expressions. See TPCH Q19 for an example where equal values in two different positions lead to an optimized plan due to common-subexpression being pulled out of OR conditions as a separate AND condition which leads to further filter push down which is not possible if the actual values are different.
Note: This class maintains its own copy of value since it can change in execution (e.g. ROUND can change precision of underlying Decimal value) which should not lead to a change of value of referenced ParamLiteral or vice-versa. However, during planning, code generation and other phases before runJob, the value and dataType should match exactly which is checked by referenceEquals. After deserialization on remote executor, the class no longer maintains a reference and falls back to behaving like a regular ParamLiteral since the required analysis and other phases are already done, and final code generation requires a copy of the values.
case class ReferenceToExpressions(result: Expression, children: Seq[Expression]) extends Expression with Product with Serializable

A special expression that evaluates BoundReferences by given expressions instead of the input row.
A special expression that evaluates BoundReferences by given expressions instead of the input row.
result
The expression that contains BoundReference and produces the final output.
children
The expressions that used as input values for BoundReference.
case class RegExpExtract(subject: Expression, regexp: Expression, idx: Expression) extends TernaryExpression with ImplicitCastInputTypes with Product with Serializable

Extract a specific(idx) group identified by a Java regex.
Extract a specific(idx) group identified by a Java regex.
NOTE: this expression is not THREAD-SAFE, as it has some internal mutable status.

Annotations
@ExpressionDescription()
case class RegExpReplace(subject: Expression, regexp: Expression, rep: Expression) extends TernaryExpression with ImplicitCastInputTypes with Product with Serializable

Replace all substrings of str that match regexp with rep.
Replace all substrings of str that match regexp with rep.
NOTE: this expression is not THREAD-SAFE, as it has some internal mutable status.

Annotations
@ExpressionDescription()
case class Remainder(left: Expression, right: Expression) extends BinaryArithmetic with NullIntolerant with Product with Serializable

Annotations
@ExpressionDescription()
case class Rint(child: Expression) extends UnaryMathExpression with Product with Serializable

Annotations
@ExpressionDescription()
case class Rollup(groupByExprs: Seq[Expression]) extends Expression with GroupingSet with Product with Serializable
case class Round(child: Expression, scale: Expression) extends RoundBase with Serializable with ImplicitCastInputTypes with Product

Round an expression to d decimal places using HALF_UP rounding mode.
Round an expression to d decimal places using HALF_UP rounding mode. round(2.5) == 3.0, round(3.5) == 4.0.

Annotations
@ExpressionDescription()
abstract class RoundBase extends BinaryExpression with Serializable with ImplicitCastInputTypes

Round the child's result to scale decimal place when scale >= 0 or round at integral part when scale < 0.
Round the child's result to scale decimal place when scale >= 0 or round at integral part when scale < 0.
Child of IntegralType would round to itself when scale >= 0. Child of FractionalType whose value is NaN or Infinite would always round to itself.
Round's dataType would always equal to child's dataType except for DecimalType, which would lead scale decrease from the origin DecimalType.
abstract class RowBasedKeyValueBatch extends MemoryConsumer
case class RowNumber() extends RowNumberLike with Product with Serializable

The RowNumber function computes a unique, sequential number to each row, starting with one, according to the ordering of rows within the window partition.
The RowNumber function computes a unique, sequential number to each row, starting with one, according to the ordering of rows within the window partition.
This documentation has been based upon similar documentation for the Hive and Presto projects.

Annotations
@ExpressionDescription()
abstract class RowNumberLike extends AggregateWindowFunction
trait RuntimeReplaceable extends UnaryExpression with Unevaluable

An expression that gets replaced at runtime (currently by the optimizer) into a different expression for evaluation.
An expression that gets replaced at runtime (currently by the optimizer) into a different expression for evaluation. This is mainly used to provide compatibility with other databases. For example, we use this to support "nvl" by replacing it with "coalesce".
A RuntimeReplaceable should have the original parameters along with a "child" expression in the case class constructor, and define a normal constructor that accepts only the original parameters. For an example, see Nvl. To make sure the explain plan and expression SQL works correctly, the implementation should also override flatArguments method and sql method.
case class ScalaUDF(function: AnyRef, dataType: DataType, children: Seq[Expression], inputTypes: Seq[DataType] = Nil, udfName: Option[String] = None) extends Expression with ImplicitCastInputTypes with NonSQLExpression with Product with Serializable

User-defined function.
User-defined function. Note that the user-defined functions must be deterministic.
function
The user defined scala function to run. Note that if you use primitive parameters, you are not able to check if it is null or not, and the UDF will return null for you if the primitive input is null. Use boxed type or Option if you wanna do the null-handling yourself.
dataType
Return type of function.
children
The input expressions of this UDF.
inputTypes
The expected input types of this UDF, used to perform type coercion. If we do not want to perform coercion, simply use "Nil". Note that it would've been better to use Option of Seq[DataType] so we can use "None" as the case for no type coercion. However, that would require more refactoring of the codebase.
udfName
The user-specified name of this UDF.
case class ScalarSubquery(plan: LogicalPlan, children: Seq[Expression] = Seq.empty, exprId: ExprId = NamedExpression.newExprId) extends SubqueryExpression with Unevaluable with Product with Serializable

A subquery that will return only one row and one column.
A subquery that will return only one row and one column. This will be converted into a physical scalar subquery during planning.
Note: exprId is used to have a unique name in explain string output.
case class Second(child: Expression) extends UnaryExpression with ImplicitCastInputTypes with Product with Serializable

Annotations
@ExpressionDescription()
case class Sentences(str: Expression, language: Expression = Literal(""), country: Expression = Literal("")) extends Expression with ImplicitCastInputTypes with CodegenFallback with Product with Serializable

Splits a string into arrays of sentences, where each sentence is an array of words.
Splits a string into arrays of sentences, where each sentence is an array of words. The 'lang' and 'country' arguments are optional, and if omitted, the default locale is used.

Annotations
@ExpressionDescription()
case class Sha1(child: Expression) extends UnaryExpression with ImplicitCastInputTypes with Product with Serializable

A function that calculates a sha1 hash value and returns it as a hex string For input of type BinaryType or StringType
A function that calculates a sha1 hash value and returns it as a hex string For input of type BinaryType or StringType

Annotations
@ExpressionDescription()
case class Sha2(left: Expression, right: Expression) extends BinaryExpression with Serializable with ImplicitCastInputTypes with Product

A function that calculates the SHA-2 family of functions (SHA-224, SHA-256, SHA-384, and SHA-512) and returns it as a hex string.
A function that calculates the SHA-2 family of functions (SHA-224, SHA-256, SHA-384, and SHA-512) and returns it as a hex string. The first argument is the string or binary to be hashed. The second argument indicates the desired bit length of the result, which must have a value of 224, 256, 384, 512, or 0 (which is equivalent to 256). SHA-224 is supported starting from Java 8. If asking for an unsupported SHA function, the return value is NULL. If either argument is NULL or the hash length is not one of the permitted values, the return value is NULL.

Annotations
@ExpressionDescription()
case class ShiftLeft(left: Expression, right: Expression) extends BinaryExpression with ImplicitCastInputTypes with Product with Serializable

Bitwise left shift.
Bitwise left shift.
left
the base number to shift.
right
number of bits to left shift.

Annotations
@ExpressionDescription()
case class ShiftRight(left: Expression, right: Expression) extends BinaryExpression with ImplicitCastInputTypes with Product with Serializable

Bitwise (signed) right shift.
Bitwise (signed) right shift.
left
the base number to shift.
right
number of bits to right shift.

Annotations
@ExpressionDescription()
case class ShiftRightUnsigned(left: Expression, right: Expression) extends BinaryExpression with ImplicitCastInputTypes with Product with Serializable

Bitwise unsigned right shift, for integer and long data type.
Bitwise unsigned right shift, for integer and long data type.
left
the base number.
right
the number of bits to right shift.

Annotations
@ExpressionDescription()
case class Signum(child: Expression) extends UnaryMathExpression with Product with Serializable

Annotations
@ExpressionDescription()
case class Sin(child: Expression) extends UnaryMathExpression with Product with Serializable

Annotations
@ExpressionDescription()
case class Sinh(child: Expression) extends UnaryMathExpression with Product with Serializable

Annotations
@ExpressionDescription()
case class Size(child: Expression) extends UnaryExpression with ExpectsInputTypes with Product with Serializable

Given an array or map, returns its size.
Given an array or map, returns its size. Returns -1 if null.

Annotations
@ExpressionDescription()
trait SizeBasedWindowFunction extends AggregateWindowFunction

A SizeBasedWindowFunction needs the size of the current window for its calculation.
case class SortArray(base: Expression, ascendingOrder: Expression) extends BinaryExpression with ExpectsInputTypes with CodegenFallback with Product with Serializable

Sorts the input array in ascending / descending order according to the natural ordering of the array elements and returns it.
Sorts the input array in ascending / descending order according to the natural ordering of the array elements and returns it.

Annotations
@ExpressionDescription()
sealed abstract class SortDirection extends AnyRef
case class SortOrder(child: Expression, direction: SortDirection, nullOrdering: NullOrdering) extends UnaryExpression with Unevaluable with Product with Serializable

An expression that can be used to sort a tuple.
An expression that can be used to sort a tuple. This class extends expression primarily so that transformations over expression will descend into its child.
case class SortPrefix(child: SortOrder) extends UnaryExpression with Product with Serializable

An expression to generate a 64-bit long prefix used in sorting.
An expression to generate a 64-bit long prefix used in sorting. If the sort must operate over null keys as well, this.nullValue can be used in place of emitted null prefixes in the sort.
case class SoundEx(child: Expression) extends UnaryExpression with ExpectsInputTypes with Product with Serializable

A function that return Soundex code of the given string expression.
A function that return Soundex code of the given string expression.

Annotations
@ExpressionDescription()
case class SparkPartitionID() extends LeafExpression with Nondeterministic with Product with Serializable

Expression that returns the current partition id.
Expression that returns the current partition id.

Annotations
@ExpressionDescription()
trait SpecializedGetters extends AnyRef
final class SpecificInternalRow extends InternalRow with BaseGenericInternalRow

A row type that holds an array specialized container objects, of type MutableValue, chosen based on the dataTypes of each column.
A row type that holds an array specialized container objects, of type MutableValue, chosen based on the dataTypes of each column. The intent is to decrease garbage when modifying the values of primitive columns.
case class SpecifiedWindowFrame(frameType: FrameType, frameStart: FrameBoundary, frameEnd: FrameBoundary) extends WindowFrame with Product with Serializable

A specified Window Frame.
case class Sqrt(child: Expression) extends UnaryMathExpression with Product with Serializable

Annotations
@ExpressionDescription()
case class Stack(children: Seq[Expression]) extends Expression with Generator with CodegenFallback with Product with Serializable

Separate v1, ..., vk into n rows.
Separate v1, ..., vk into n rows. Each row will have k/n columns. n must be constant.
```
SELECT stack(2, 1, 2, 3) ->
1      2
3      NULL
```
Annotations
@ExpressionDescription()
case class StartsWith(left: Expression, right: Expression) extends BinaryExpression with StringPredicate with Product with Serializable

A function that returns true if the string left starts with the string right.
trait String2StringExpression extends Expression with ImplicitCastInputTypes
case class StringInstr(str: Expression, substr: Expression) extends BinaryExpression with ImplicitCastInputTypes with Product with Serializable

A function that returns the position of the first occurrence of substr in the given string.
A function that returns the position of the first occurrence of substr in the given string. Returns null if either of the arguments are null and returns 0 if substr could not be found in str.
NOTE: that this is not zero based, but 1-based index. The first character in str has index 1.

Annotations
@ExpressionDescription()
case class StringLPad(str: Expression, len: Expression, pad: Expression) extends TernaryExpression with ImplicitCastInputTypes with Product with Serializable

Returns str, left-padded with pad to a length of len.
Returns str, left-padded with pad to a length of len.

Annotations
@ExpressionDescription()
case class StringLocate(substr: Expression, str: Expression, start: Expression) extends TernaryExpression with ImplicitCastInputTypes with Product with Serializable

A function that returns the position of the first occurrence of substr in given string after position pos.
A function that returns the position of the first occurrence of substr in given string after position pos.

Annotations
@ExpressionDescription()
trait StringPredicate extends Expression with Predicate with ImplicitCastInputTypes

A base trait for functions that compare two strings, returning a boolean.
case class StringRPad(str: Expression, len: Expression, pad: Expression) extends TernaryExpression with ImplicitCastInputTypes with Product with Serializable

Returns str, right-padded with pad to a length of len.
Returns str, right-padded with pad to a length of len.

Annotations
@ExpressionDescription()
trait StringRegexExpression extends Expression with ImplicitCastInputTypes
case class StringRepeat(str: Expression, times: Expression) extends BinaryExpression with ImplicitCastInputTypes with Product with Serializable

Returns the string which repeat the given string value n times.
Returns the string which repeat the given string value n times.

Annotations
@ExpressionDescription()
case class StringReverse(child: Expression) extends UnaryExpression with String2StringExpression with Product with Serializable

Returns the reversed given string.
Returns the reversed given string.

Annotations
@ExpressionDescription()
case class StringSpace(child: Expression) extends UnaryExpression with ImplicitCastInputTypes with Product with Serializable

Returns a string consisting of n spaces.
Returns a string consisting of n spaces.

Annotations
@ExpressionDescription()
case class StringSplit(str: Expression, pattern: Expression) extends BinaryExpression with ImplicitCastInputTypes with Product with Serializable

Splits str around pat (pattern is a regular expression).
Splits str around pat (pattern is a regular expression).

Annotations
@ExpressionDescription()
case class StringToMap(text: Expression, pairDelim: Expression, keyValueDelim: Expression) extends TernaryExpression with CodegenFallback with ExpectsInputTypes with Product with Serializable

Creates a map after splitting the input text into key/value pairs using delimiters
Creates a map after splitting the input text into key/value pairs using delimiters

Annotations
@ExpressionDescription()
case class StringTranslate(srcExpr: Expression, matchingExpr: Expression, replaceExpr: Expression) extends TernaryExpression with ImplicitCastInputTypes with Product with Serializable

A function translate any character in the srcExpr by a character in replaceExpr.
A function translate any character in the srcExpr by a character in replaceExpr. The characters in replaceExpr is corresponding to the characters in matchingExpr. The translate will happen when any character in the string matching with the character in the matchingExpr.

Annotations
@ExpressionDescription()
case class StringTrim(child: Expression) extends UnaryExpression with String2StringExpression with Product with Serializable

A function that trim the spaces from both ends for the specified string.
A function that trim the spaces from both ends for the specified string.

Annotations
@ExpressionDescription()
case class StringTrimLeft(child: Expression) extends UnaryExpression with String2StringExpression with Product with Serializable

A function that trim the spaces from left end for given string.
A function that trim the spaces from left end for given string.

Annotations
@ExpressionDescription()
case class StringTrimRight(child: Expression) extends UnaryExpression with String2StringExpression with Product with Serializable

A function that trim the spaces from right end for given string.
A function that trim the spaces from right end for given string.

Annotations
@ExpressionDescription()
case class StructToJson(options: Map[String, String], child: Expression) extends UnaryExpression with CodegenFallback with ExpectsInputTypes with Product with Serializable

Converts a StructType to a json output string.
abstract class SubqueryExpression extends PlanExpression[LogicalPlan]

A base interface for expressions that contain a LogicalPlan.
case class Substring(str: Expression, pos: Expression, len: Expression) extends TernaryExpression with ImplicitCastInputTypes with Product with Serializable

A function that takes a substring of its first argument starting at a given position.
A function that takes a substring of its first argument starting at a given position. Defined for String and Binary types.
NOTE: that this is not zero based, but 1-based index. The first character in str has index 1.

Annotations
@ExpressionDescription()
case class SubstringIndex(strExpr: Expression, delimExpr: Expression, countExpr: Expression) extends TernaryExpression with ImplicitCastInputTypes with Product with Serializable

Returns the substring from string str before count occurrences of the delimiter delim.
Returns the substring from string str before count occurrences of the delimiter delim. If count is positive, everything the left of the final delimiter (counting from left) is returned. If count is negative, every to the right of the final delimiter (counting from the right) is returned. substring_index performs a case-sensitive match when searching for delim.

Annotations
@ExpressionDescription()
case class Subtract(left: Expression, right: Expression) extends BinaryArithmetic with NullIntolerant with Product with Serializable

Annotations
@ExpressionDescription()
case class Tan(child: Expression) extends UnaryMathExpression with Product with Serializable

Annotations
@ExpressionDescription()
case class Tanh(child: Expression) extends UnaryMathExpression with Product with Serializable

Annotations
@ExpressionDescription()
case class TermValues(literalValueRef: String, isNull: String, valueTerm: String) extends Product with Serializable
abstract class TernaryExpression extends Expression

An expression with three inputs and one output.
An expression with three inputs and one output. The output is by default evaluated to null if any input is evaluated to null.
case class TimeAdd(start: Expression, interval: Expression) extends BinaryExpression with ImplicitCastInputTypes with Product with Serializable

Adds an interval to timestamp.
case class TimeSub(start: Expression, interval: Expression) extends BinaryExpression with ImplicitCastInputTypes with Product with Serializable

Subtracts an interval from timestamp.
case class TimeWindow(timeColumn: Expression, windowDuration: Long, slideDuration: Long, startTime: Long) extends UnaryExpression with ImplicitCastInputTypes with Unevaluable with NonSQLExpression with Product with Serializable
case class ToDate(child: Expression) extends UnaryExpression with ImplicitCastInputTypes with Product with Serializable

Returns the date part of a timestamp or string.
Returns the date part of a timestamp or string.

Annotations
@ExpressionDescription()
case class ToDegrees(child: Expression) extends UnaryMathExpression with Product with Serializable

Annotations
@ExpressionDescription()
case class ToRadians(child: Expression) extends UnaryMathExpression with Product with Serializable

Annotations
@ExpressionDescription()
case class ToUTCTimestamp(left: Expression, right: Expression) extends BinaryExpression with ImplicitCastInputTypes with Product with Serializable

Given a timestamp, which corresponds to a certain time of day in the given timezone, returns another timestamp that corresponds to the same time of day in UTC.
Given a timestamp, which corresponds to a certain time of day in the given timezone, returns another timestamp that corresponds to the same time of day in UTC.

Annotations
@ExpressionDescription()
case class ToUnixTimestamp(timeExp: Expression, format: Expression) extends UnixTime with Product with Serializable

Converts time string with given pattern.
Converts time string with given pattern. Deterministic version of UnixTimestamp, must have at least one parameter.

Annotations
@ExpressionDescription()
final class TokenLiteral extends Literal with TokenizedLiteral with KryoSerializable

A Literal that passes its value as a reference object in generated code instead of embedding as a constant to allow generated code reuse.
trait TokenizedLiteral extends LeafExpression with DynamicReplacableConstant
case class TruncDate(date: Expression, format: Expression) extends BinaryExpression with ImplicitCastInputTypes with Product with Serializable

Returns date truncated to the unit specified by the format.
Returns date truncated to the unit specified by the format.

Annotations
@ExpressionDescription()
case class UnBase64(child: Expression) extends UnaryExpression with ImplicitCastInputTypes with Product with Serializable

Converts the argument from a base 64 string to BINARY.
Converts the argument from a base 64 string to BINARY.

Annotations
@ExpressionDescription()
abstract class UnaryExpression extends Expression

An expression with one input and one output.
An expression with one input and one output. The output is by default evaluated to null if the input is evaluated to null.
abstract class UnaryLogExpression extends UnaryMathExpression
abstract class UnaryMathExpression extends UnaryExpression with Serializable with ImplicitCastInputTypes

A unary expression specifically for math functions.
A unary expression specifically for math functions. Math Functions expect a specific type of input format, therefore these functions extend ExpectsInputTypes.
case class UnaryMinus(child: Expression) extends UnaryExpression with ExpectsInputTypes with NullIntolerant with Product with Serializable

Annotations
@ExpressionDescription()
case class UnaryPositive(child: Expression) extends UnaryExpression with ExpectsInputTypes with NullIntolerant with Product with Serializable

Annotations
@ExpressionDescription()
trait Unevaluable extends Expression

An expression that cannot be evaluated.
An expression that cannot be evaluated. Some expressions don't live past analysis or optimization time (e.g. Star). This trait is used by those expressions.
case class Unhex(child: Expression) extends UnaryExpression with ImplicitCastInputTypes with Product with Serializable

Performs the inverse operation of HEX.
Performs the inverse operation of HEX. Resulting characters are returned as a byte array.

Annotations
@ExpressionDescription()
abstract class UnixTime extends BinaryExpression with ExpectsInputTypes
case class UnixTimestamp(timeExp: Expression, format: Expression) extends UnixTime with Product with Serializable

Converts time string with given pattern.
Converts time string with given pattern. (see [http://docs.oracle.com/javase/tutorial/i18n/format/simpleDateFormat.html]) to Unix time stamp (in seconds), returns null if fail. Note that hive Language Manual says it returns 0 if fail, but in fact it returns null. If the second parameter is missing, use "yyyy-MM-dd HH:mm:ss". If no parameters provided, the first parameter will be current_timestamp. If the first parameter is a Date or Timestamp instead of String, we will ignore the second parameter.

Annotations
@ExpressionDescription()
case class UnresolvedWindowExpression(child: Expression, windowSpec: WindowSpecReference) extends UnaryExpression with Unevaluable with Product with Serializable
final class UnsafeArrayData extends ArrayData
final class UnsafeMapData extends MapData
abstract class UnsafeProjection extends Projection

A projection that returns UnsafeRow.
final class UnsafeRow extends InternalRow with Externalizable with KryoSerializable
case class UnscaledValue(child: Expression) extends UnaryExpression with Product with Serializable

Return the unscaled Long value of a Decimal, assuming it fits in a Long.
Return the unscaled Long value of a Decimal, assuming it fits in a Long. Note: this expression is internal and created only by the optimizer, we don't need to do type check for it.
case class UpCast(child: Expression, dataType: DataType, walkedTypePath: Seq[String]) extends UnaryExpression with Unevaluable with Product with Serializable

Cast the child expression to the target data type, but will throw error if the cast might truncate, e.g.
Cast the child expression to the target data type, but will throw error if the cast might truncate, e.g. long -> int, timestamp -> data.
case class Upper(child: Expression) extends UnaryExpression with String2StringExpression with Product with Serializable

A function that converts the characters of a string to uppercase.
A function that converts the characters of a string to uppercase.

Annotations
@ExpressionDescription()
case class UserDefinedGenerator(elementSchema: StructType, function: (Row) ⇒ TraversableOnce[InternalRow], children: Seq[Expression]) extends Expression with Generator with CodegenFallback with Product with Serializable

A generator that produces its output using the provided lambda function.
case class ValueFollowing(value: Int) extends FrameBoundary with Product with Serializable

<value> FOLLOWING boundary.
case class ValuePreceding(value: Int) extends FrameBoundary with Product with Serializable

<value> PRECEDING boundary.
final class VariableLengthRowBasedKeyValueBatch extends RowBasedKeyValueBatch
case class WeekOfYear(child: Expression) extends UnaryExpression with ImplicitCastInputTypes with Product with Serializable

Annotations
@ExpressionDescription()
case class WindowExpression(windowFunction: Expression, windowSpec: WindowSpecDefinition) extends Expression with Unevaluable with Product with Serializable
sealed trait WindowFrame extends AnyRef

The trait used to represent the a Window Frame.
trait WindowFunction extends Expression

A window function is a function that can only be evaluated in the context of a window operator.
sealed trait WindowSpec extends AnyRef

The trait of the Window Specification (specified in the OVER clause or WINDOW clause) for Window Functions.
case class WindowSpecDefinition(partitionSpec: Seq[Expression], orderSpec: Seq[SortOrder], frameSpecification: WindowFrame) extends Expression with WindowSpec with Unevaluable with Product with Serializable

The specification for a window function.
The specification for a window function.
partitionSpec
It defines the way that input rows are partitioned.
orderSpec
It defines the ordering of rows in a partition.
frameSpecification
It defines the window frame in a partition.
case class WindowSpecReference(name: String) extends WindowSpec with Product with Serializable

A Window specification reference that refers to the WindowSpecDefinition defined under the name name.
final class XXH64 extends AnyRef
case class XxHash64(children: Seq[Expression], seed: Long) extends HashExpression[Long] with Product with Serializable

A xxHash64 64-bit hash expression.
case class Year(child: Expression) extends UnaryExpression with ImplicitCastInputTypes with Product with Serializable

Annotations
@ExpressionDescription()

Value Members

object Ascending extends SortDirection with Product with Serializable
object AttributeMap extends Serializable

Builds a map that is keyed by an Attribute's expression id.
Builds a map that is keyed by an Attribute's expression id. Using the expression id allows values to be looked up even when the attributes used differ cosmetically (i.e., the capitalization of the name, or the expected nullability).
object AttributeSet extends Serializable
object BinaryArithmetic
object BinaryComparison
object BinaryOperator
object BindReferences extends internal.Logging
object CallMethodViaReflection extends Serializable
object Canonicalize

Rewrites an expression using rules that are guaranteed preserve the result while attempting to remove cosmetic variations.
Rewrites an expression using rules that are guaranteed preserve the result while attempting to remove cosmetic variations. Deterministic expressions that are equal after canonicalization will always return the same answer given the same input (i.e. false positives should not be possible). However, it is possible that two canonical expressions that are not equal will in fact return the same answer given any input (i.e. false negatives are possible).
The following rules are applied:
- Names and nullability hints for org.apache.spark.sql.types.DataTypes are stripped.
- Commutative and associative operations (Add and Multiply) have their children ordered by hashCode.
- EqualTo and EqualNullSafe are reordered by hashCode.
- Other comparisons (GreaterThan, LessThan) are reversed by hashCode.
object CaseKeyWhen

Case statements of the form "CASE a WHEN b THEN c [WHEN d THEN e]* [ELSE f] END".
Case statements of the form "CASE a WHEN b THEN c [WHEN d THEN e]* [ELSE f] END". When a = b, returns c; when a = d, returns e; else returns f.
object CaseWhen extends Serializable

Factory methods for CaseWhen.
object Cast extends Serializable
object CreateStruct extends FunctionBuilder

Returns a Row containing the evaluation of all children expressions.
object CurrentRow extends FrameBoundary with Product with Serializable

CURRENT ROW boundary.
object DecimalLiteral

Extractor for and other utility methods for decimal literals.
object Descending extends SortDirection with Product with Serializable
val EmptyRow: InternalRow

Used as input into expressions whose output does not depend on any input value.
object Equality

An extractor that matches both standard 3VL equality and null-safe equality.
object ExprId extends Serializable
object ExpressionSet
object ExtractValue
object Factorial extends Serializable
object FrameBoundary

Extractor for making working with frame boundaries easier.
object FromUnsafeProjection

A projection that could turn UnsafeRow into GenericInternalRow
object Hex extends Serializable
object HiveHashFunction extends InterpretedHashFunction
object IntegerLiteral

Extractor for retrieving Int literals.
object InterpretedOrdering extends Serializable
object InterpretedPredicate
object Literal extends Serializable
object Murmur3HashFunction extends InterpretedHashFunction
object NamePlaceholder extends LeafExpression with Unevaluable with Product with Serializable

An expression representing a not yet available attribute name.
An expression representing a not yet available attribute name. This expression is unevaluable and as its name suggests it is a temporary place holder until we're able to determine the actual attribute name.
object NamedExpression
object NonNullLiteral

An extractor that matches non-null literal values
object NullsFirst extends NullOrdering with Product with Serializable
object NullsLast extends NullOrdering with Product with Serializable
object ParseUrl extends Serializable
object PredicateSubquery extends Serializable
object Rand extends Serializable
object Randn extends Serializable
object RangeFrame extends FrameType with Product with Serializable

RangeFrame treats rows in a partition as groups of peers.
RangeFrame treats rows in a partition as groups of peers. All rows having the same ORDER BY ordering are considered as peers. When a ValuePreceding or a ValueFollowing is used as its FrameBoundary, the value is considered as a logical offset. For example, assuming the value of the current row's ORDER BY expression expr is v, RANGE BETWEEN 1 PRECEDING AND 1 FOLLOWING represents a frame containing rows whose values expr are in the range of [v-1, v+1].
If ORDER BY clause is not defined, all rows in the partition is considered as peers of the current row.
object RowFrame extends FrameType with Product with Serializable

RowFrame treats rows in a partition individually.
RowFrame treats rows in a partition individually. When a ValuePreceding or a ValueFollowing is used as its FrameBoundary, the value is considered as a physical offset. For example, ROW BETWEEN 1 PRECEDING AND 1 FOLLOWING represents a 3-row frame, from the row precedes the current row to the row follows the current row.
object RowOrdering
object ScalarSubquery extends Serializable
object SizeBasedWindowFunction extends Serializable
object SortOrder extends Serializable
object SpecifiedWindowFrame extends Serializable
object StringTranslate extends Serializable
object SubqueryExpression
object TimeWindow extends Serializable
object TokenLiteral extends Serializable
object UnboundedFollowing extends FrameBoundary with Product with Serializable

UNBOUNDED FOLLOWING boundary.
object UnboundedPreceding extends FrameBoundary with Product with Serializable

UNBOUNDED PRECEDING boundary.
object UnsafeProjection
object UnspecifiedFrame extends WindowFrame with Product with Serializable

Used as a place holder when a frame specification is not defined.
object VirtualColumn
object XxHash64Function extends InterpretedHashFunction
package aggregate
package codegen

A collection of generators that build custom bytecode at runtime for performing the evaluation of catalyst expression.
package objects
package xml

package expressions

Standard Expressions

Named Expressions

Evaluation

Type Members

case class Abs(child: Expression) extends UnaryExpression with ExpectsInputTypes with NullIntolerant with Product with Serializable

case class Acos(child: Expression) extends UnaryMathExpression with Product with Serializable

case class Add(left: Expression, right: Expression) extends BinaryArithmetic with NullIntolerant with Product with Serializable

case class AddMonths(startDate: Expression, numMonths: Expression) extends BinaryExpression with ImplicitCastInputTypes with Product with Serializable

abstract class AggregateWindowFunction extends DeclarativeAggregate with WindowFunction

case class Alias(child: Expression, name: String)(exprId: ExprId = NamedExpression.newExprId, qualifier: Option[String] = None, explicitMetadata: Option[Metadata] = None, isGenerated: Boolean = false) extends UnaryExpression with NamedExpression with Product with Serializable

case class And(left: Expression, right: Expression) extends BinaryOperator with Predicate with Product with Serializable

case class ArrayContains(left: Expression, right: Expression) extends BinaryExpression with ImplicitCastInputTypes with Product with Serializable

case class Ascii(child: Expression) extends UnaryExpression with ImplicitCastInputTypes with Product with Serializable

case class Asin(child: Expression) extends UnaryMathExpression with Product with Serializable

case class AssertTrue(child: Expression) extends UnaryExpression with ImplicitCastInputTypes with Product with Serializable

case class AtLeastNNonNulls(n: Int, children: Seq[Expression]) extends Expression with Predicate with Product with Serializable

case class Atan(child: Expression) extends UnaryMathExpression with Product with Serializable

case class Atan2(left: Expression, right: Expression) extends BinaryMathExpression with Product with Serializable

abstract class Attribute extends LeafExpression with NamedExpression with NullIntolerant

class AttributeEquals extends AnyRef

class AttributeMap[A] extends Map[Attribute, A] with Serializable

implicit class AttributeSeq extends Serializable

class AttributeSet extends Traversable[Attribute] with Serializable

case class BRound(child: Expression, scale: Expression) extends RoundBase with Serializable with ImplicitCastInputTypes with Product

case class Base64(child: Expression) extends UnaryExpression with ImplicitCastInputTypes with Product with Serializable

trait BaseGenericInternalRow extends InternalRow

case class Bin(child: Expression) extends UnaryExpression with Serializable with ImplicitCastInputTypes with Product

abstract class BinaryArithmetic extends BinaryOperator

abstract class BinaryComparison extends BinaryOperator with Predicate

abstract class BinaryExpression extends Expression

abstract class BinaryMathExpression extends BinaryExpression with Serializable with ImplicitCastInputTypes

abstract class BinaryOperator extends BinaryExpression with ExpectsInputTypes

case class BitwiseAnd(left: Expression, right: Expression) extends BinaryArithmetic with Product with Serializable

case class BitwiseNot(child: Expression) extends UnaryExpression with ExpectsInputTypes with Product with Serializable

case class BitwiseOr(left: Expression, right: Expression) extends BinaryArithmetic with Product with Serializable

case class BitwiseXor(left: Expression, right: Expression) extends BinaryArithmetic with Product with Serializable

case class BoundReference(ordinal: Int, dataType: DataType, nullable: Boolean) extends LeafExpression with Product with Serializable

case class CallMethodViaReflection(children: Seq[Expression]) extends Expression with CodegenFallback with Product with Serializable

case class CaseWhen(branches: Seq[(Expression, Expression)], elseValue: Option[Expression] = None) extends CaseWhenBase with CodegenFallback with Serializable with Product

abstract class CaseWhenBase extends Expression with Serializable

case class CaseWhenCodegen(branches: Seq[(Expression, Expression)], elseValue: Option[Expression] = None) extends CaseWhenBase with Serializable with Product

case class Cast(child: Expression, dataType: DataType) extends UnaryExpression with NullIntolerant with Product with Serializable

case class Cbrt(child: Expression) extends UnaryMathExpression with Product with Serializable

case class Ceil(child: Expression) extends UnaryMathExpression with Product with Serializable

case class CheckOverflow(child: Expression, dataType: DecimalType) extends UnaryExpression with Product with Serializable

case class Coalesce(children: Seq[Expression]) extends Expression with Product with Serializable

case class Concat(children: Seq[Expression]) extends Expression with ImplicitCastInputTypes with Product with Serializable

case class ConcatWs(children: Seq[Expression]) extends Expression with ImplicitCastInputTypes with Product with Serializable

case class Contains(left: Expression, right: Expression) extends BinaryExpression with StringPredicate with Product with Serializable

case class Conv(numExpr: Expression, fromBaseExpr: Expression, toBaseExpr: Expression) extends TernaryExpression with ImplicitCastInputTypes with Product with Serializable

case class Cos(child: Expression) extends UnaryMathExpression with Product with Serializable

case class Cosh(child: Expression) extends UnaryMathExpression with Product with Serializable

case class Crc32(child: Expression) extends UnaryExpression with ImplicitCastInputTypes with Product with Serializable

case class CreateArray(children: Seq[Expression]) extends Expression with Product with Serializable

case class CreateMap(children: Seq[Expression]) extends Expression with Product with Serializable

case class CreateNamedStruct(children: Seq[Expression]) extends Expression with CreateNamedStructLike with Product with Serializable

trait CreateNamedStructLike extends Expression

case class CreateNamedStructUnsafe(children: Seq[Expression]) extends Expression with CreateNamedStructLike with Product with Serializable

case class Cube(groupByExprs: Seq[Expression]) extends Expression with GroupingSet with Product with Serializable

case class CumeDist() extends RowNumberLike with SizeBasedWindowFunction with Product with Serializable

case class CurrentBatchTimestamp(timestampMs: Long, dataType: DataType) extends LeafExpression with Nondeterministic with CodegenFallback with Product with Serializable

case class CurrentDatabase() extends LeafExpression with Unevaluable with Product with Serializable

case class CurrentDate() extends LeafExpression with CodegenFallback with Product with Serializable

case class CurrentTimestamp() extends LeafExpression with CodegenFallback with Product with Serializable

case class DateAdd(startDate: Expression, days: Expression) extends BinaryExpression with ImplicitCastInputTypes with Product with Serializable

case class DateDiff(endDate: Expression, startDate: Expression) extends BinaryExpression with ImplicitCastInputTypes with Product with Serializable

case class DateFormatClass(left: Expression, right: Expression) extends BinaryExpression with ImplicitCastInputTypes with Product with Serializable

case class DateSub(startDate: Expression, days: Expression) extends BinaryExpression with ImplicitCastInputTypes with Product with Serializable

case class DayOfMonth(child: Expression) extends UnaryExpression with ImplicitCastInputTypes with Product with Serializable

case class DayOfYear(child: Expression) extends UnaryExpression with ImplicitCastInputTypes with Product with Serializable

case class Decode(bin: Expression, charset: Expression) extends BinaryExpression with ImplicitCastInputTypes with Product with Serializable

case class DenseRank(children: Seq[Expression]) extends RankLike with Product with Serializable

final class DirectStringConsumer extends MemoryConsumer

case class Divide(left: Expression, right: Expression) extends BinaryArithmetic with NullIntolerant with Product with Serializable

case class DynamicFoldableExpression(expr: Expression) extends UnaryExpression with DynamicReplacableConstant with KryoSerializable with Product with Serializable

case class DynamicInSet(child: Expression, hset: IndexedSeq[Expression]) extends UnaryExpression with Predicate with Product with Serializable

trait DynamicReplacableConstant extends Expression

case class Elt(children: Seq[Expression]) extends Expression with ImplicitCastInputTypes with Product with Serializable