your datalake needs structure
A typical datalake contains raw data from many sources. The data can be in various forms: unstructured, semi-structured or structured. UnStrucutured TXT CSV Semi-Structured XML JSON Structured MySQL Parquet This post will be about improving your datalake by using parquet files for better downstream processing. Why use Parquet files? Structured files are preferred for data… Read More »