Data Engineering Data Mapping Kin
Project : June 2024
Business Background
Kin has three datasets about their products. Each dataset has different columns but the product name value and product code value are the same. Kin wants to merge the datasets so that it can be used for the analysis process.
Understanding Dataset
Kin have 3 Datasets with value of product name of each datasets are similar. For example Ellips Cologne Gel with varian MONDAY.
Then take a look of each product code value.
Dataset 1 : FG30408.126.0050.C
Dataset 2 : FG20408.126.0502.C
Dataset 3 : FG20408.126.0502.C
They have similar value. We could use it as relational key.
Task Done
- Split Product Code Check the similarity using python code
- Create Relational Key Creating key for join using python code
- Merge & Mapping Based on Key using python code
Result
Below is sample of merged dataset: