Skip to main content

extract

Synopsis

starlake extract [options]

Description

Run both extract-schema and extract-data in sequence for a given configuration. This is the recommended way to extract table schemas and data from a JDBC database in a single step. See Extract Tutorial.

Parameters

This command accepts the combined options of extract-schema and extract-data.

ParameterCardinalityDescription
--config <value>RequiredDatabase tables & connection info
--outputDir <value>RequiredWhere to output files (YML schemas and CSV data)
--tables <value>OptionalDatabase tables to extract
--connectionRef <value>OptionalDatabase connection to use
--allOptionalExtract all schemas and tables to external folder
--externalOptionalOutput YML files in the external folder
--parallelism <value>OptionalParallelism level of the extraction process. By default equals to the available cores
--snakecaseOptionalApply snake case when name sanitization is done
--limit <value>OptionalLimit number of records
--numPartitions <value>OptionalParallelism level regarding partitioned tables
--ignoreExtractionFailureOptionalDon't fail extraction job when any extraction fails
--cleanOptionalClean all files of table only when it is extracted
--incrementalOptionalExport only new data since last extraction
--ifExtractedBefore <value>OptionalDateTime to compare with the last beginning extraction dateTime. If it is before that date, extraction is done else skipped
--includeSchemas <value>OptionalDomains to include during extraction
--excludeSchemas <value>OptionalDomains to exclude during extraction. If includeSchemas is defined, this option is ignored
--includeTables <value>OptionalTables to include during extraction
--excludeTables <value>OptionalTables to exclude during extraction. If includeTables is defined, this option is ignored