Skip to content

Case study submission for the DataCamp Associate Data Analyst Certificate Exam.

Notifications You must be signed in to change notification settings

ssarrayya/datacamp-associate-certification

Repository files navigation

DataCamp Associate Data Analyst Certification Exam

Pet Box Subscription

Background

PetMind is a retailer of products for pets. They are based in the United States.
PetMind sells products that are a mix of luxury items and everyday items. Luxury items include toys. Everyday items include food.
The company wants to increase sales by selling more everyday products repeatedly. They have been testing this approach for the last year. They now want a report on how repeat purchases impact sales.

The dataset contains 1500 records and 8 variables. More on the variables can be found here

Tasks

  1. For every column in the data:
    • State whether the values match the description given in the table above.
    • State the number of missing values in the column.
    • Describe what you did to make values match the description if they did not match.
  2. Create a visualization that shows how many products are repeat purchases. Use the visualization to:
    • State which category of the variable repeat purchases has the most observations
    • Explain whether the observations are balanced across categories of the variable repeat purchases
  3. Describe the distribution of all of the sales. Your answer must include a visualization that shows the distribution.
  4. Describe the relationship between repeat purchases and sales. Your answer must include a visualization to demonstrate the relationship.

The review process took 1 week from the submission date. Here's my certification :) Author's DataCamp Data Analyst Associate Certification