Convert BAF CSV to partitioned Parquet in MinIO (S3)
convert_to_parquet.RdReads Base.csv from a MinIO/S3 bucket prefix (e.g., "01_raw") and writes a
Hive-style partitioned Parquet dataset to another prefix (e.g., "02_intermediate"),
partitioned by variant (e.g., variant=Base/part-*.parquet).
Details
Connection settings are taken from environment variables:
BAF_ENDPOINT(e.g."minio:9000"or"192.168.4.xx:9000")BAF_KEY(MinIO access key)BAF_SECRET(MinIO secret key)
Examples
if (FALSE) { # \dontrun{
Sys.setenv(
BAF_ENDPOINT = "minio:9000",
BAF_KEY = "YOUR_ACCESS_KEY",
BAF_SECRET = "YOUR_SECRET_KEY"
)
convert_to_parquet(from_prefix = "01_raw", to_prefix = "02_intermediate", bucket_name = "lake")
} # }