I very much enjoyed your tutorial. It was the first most comprehensive step by step one. What would you recommend for the following scenario. I have more than 90 tables in SQL and for data quality report, I need to report N and col % for all categorical variables within each table grouped by sites. And, report min, max, median, mean, BoxPlot graph for continuous variables. I can do that manually for each variables in a table but we are talking about close to 2000 variables. I appreciate any guidance/suggestions
Glad you found it useful. There are many ways to do it, I will list in order of my preference: 1. if you are familiar with pyspark, you can easily do this programmatically fairly easily. 2. If you need to do it in Power BI, you can use calculation groups : tenfingers.medium.com/combining-calculation-groups-in-power-bi-to-create-powerful-reports-cc06f85e8de6 3. Using dataflow : powerquery.microsoft.com/en-us/blog/low-code-data-quality-automation-using-dataflows/ hope this helps. good luck.
In the first section of the video there is a visual in the middle. My question would be: how did you solved that the values are bellow each other and not beside / next to each other?
Whoa that's deep!! 🤯
I very much enjoyed your tutorial. It was the first most comprehensive step by step one. What would you recommend for the following scenario. I have more than 90 tables in SQL and for data quality report, I need to report N and col % for all categorical variables within each table grouped by sites. And, report min, max, median, mean, BoxPlot graph for continuous variables. I can do that manually for each variables in a table but we are talking about close to 2000 variables. I appreciate any guidance/suggestions
Glad you found it useful. There are many ways to do it, I will list in order of my preference:
1. if you are familiar with pyspark, you can easily do this programmatically fairly easily.
2. If you need to do it in Power BI, you can use calculation groups : tenfingers.medium.com/combining-calculation-groups-in-power-bi-to-create-powerful-reports-cc06f85e8de6
3. Using dataflow : powerquery.microsoft.com/en-us/blog/low-code-data-quality-automation-using-dataflows/
hope this helps. good luck.
Excellent!!!
In the first section of the video there is a visual in the middle.
My question would be: how did you solved that the values are bellow each other and not beside / next to each other?
Please Share Dataset