Congrats for creating a nice and quick tool at having a glance at data. I particularly liked ‘compare data’ features.
I tried using Sweetviz on Titanic train & test data sets (from Kaggle) but I have some doubts/issues if you can answer.
How the association between the variables are measured?
I believe for both numerical variables, it’s Pearson’s correlation coefficient. However, the report shows it is ‘0’ for age-age [while it should be 1]
Also, ‘P-class’ and ‘Age’ are negatively associated but the report shows a positive association.
And for both categorical variables, is it measured by Cramer’s V? I didn’t find the details in the document. How about association between one numerical & one categorical variable.Correlation using pandas df.corr()