Clean the BAF Base dataset and write to 03_primary
clean_baf_base.RdClean the BAF Base dataset and write to 03_primary
Usage
clean_baf_base(
in_prefix,
out_prefix = "baf-fraud/03_primary/variant=Base",
bucket_name = "lake",
partitioning = "month",
existing_data_behavior = c("overwrite", "error", "delete_matching"),
verbose = TRUE
)Arguments
- in_prefix
Character. Input dataset prefix inside bucket (e.g. "02_intermediate/variant=Base").
- out_prefix
Character. Output dataset prefix inside bucket (e.g. "03_primary/variant=Base").
- bucket_name
Character. Bucket name. Default "lake".
- partitioning
Character vector of columns to partition by. Default "month". Set NULL to disable.
- existing_data_behavior
One of "overwrite", "error", "delete_matching". Default "overwrite".
- verbose
Logical. Emit progress messages. Default TRUE.