Whether to scan latest files first.
Whether to scan latest files first. If it's true, when the source finds unprocessed files in a trigger, it will first process the latest files.
Maximum age of a file that can be found in this directory, before it is ignored.
Maximum age of a file that can be found in this directory, before it is ignored. For the
first batch all files will be considered valid. If latestFirst
is set to true
and
maxFilesPerTrigger
is set, then this parameter will be ignored, because old files that are
valid, and should be processed, may be ignored. Please refer to SPARK-19813 for details.
The max age is specified with respect to the timestamp of the latest file, and not the timestamp of the current system. That this means if the last file has timestamp 1000, and the current system time is 2000, and max age is 200, the system will purge files older than 800 (rather than 1800) from the internal state.
Default to a week.
Options as specified by the user, in a case-insensitive map, without "path" set.
User specified options for file streams.