Unlock a world of possibilities! Login now and discover the exclusive benefits awaiting you.
Hello,
Recently I tried plotting a box plot chart and had some difficulties in understanding and unable to quantify the results.
I used a Fuel efficiency data(attached here) that analyses the MPG across different car Manufacturers. As per the file used 'Mfr Name' as dimension/X-axis and 'CombMPG' as Measure/Y-axis.
Initially i used 'Mfr Name' as Dim1 and 'CombMPG' as measure(had a tough time for using it without aggregation), but did not get individual Box Plots for different manufacturers.
So after some reading in community i understand that 'Mfg Name' has to be inner dimension and used the 'ComboMPG' as Dim1 like a dummy dimension.Now i got individual Box Plots for all manufacturers.
I have used the same data in python Matplotlib. And when compairing both the charts I can see that Some of the outliers are missing ,especially the ones below minimum range(for eg: Mercedes-Benz).
Kindly help me underastand the below
1. Why dimension is made mandatory and for this scenario i have to add 2 dimensions to get the results.
2. How to replicate the result from Python Matplotlib(screenshots attached)
Thanks,
karthik