Create your YAML file

A YAML file for Fakelake is composed of two parts:

Columns

A list of columns with a name, a provider, provider's parameters and options.
Click here for the list of available providers.

Example of a file with one column:

columns:
  - name: unique_id
    provider: Increment.integer
    start: 100

Info

To setup the generated file, see here.

Example of a parquet file of 10 million rows:

info:
  output_name: generated_file
  output_format: parquet
  rows: 10_000_000

Example

columns:
  - name: unique_id
    provider: Increment.integer
    start: 100

info:
  output_name: generated_file
  output_format: parquet
  rows: 10_000_000

That's it ! This is enough to generate a parquet file.

Next step, generate it.