top of page

Exploratory Data Analysis

Alfred Essa believes that Exploratory Data Analysis is a detective work and i guess that's what we are supposed to do; dig deep and understand each piece of the data. EDA may not help me to discover the whole story but it serves as the foundation stone for forming further hypothesis or doing research or building models.

The EDA approach is precisely that--an approach--not a set of techniques, but an attitude/philosophy about how a data analysis should be carried out. EDA involves the analyst trying to get a “feel” for the data set, often using their own judgment to determine what the most important elements in the data set are. For example, multidimensional scaling is an EDA that uses visual representations of distances or similarities between sets of objects; It’s up to the user to interpret exactly what the distances represent.

Density & Box Plot

  • Calories, Cups, Carbohydrates and Weight are the only quantitative variables which follow normal distribution.

  • Fiber, Protein, Fat, Potass and Rating are positively skewed, whereas, Sodium is the only variable which is negatively skewed.

  • Sodium has the greatest range among all the variables, it varies from 0 to 320 with mean of 159.7.

  • 1.03 ounces is average serving.

  • The average rating for the products was 42.67 on the scale of 100.

Calories

© 2020 by Raj Kumar Beniwal. Proudly created with Wix.com

bottom of page