This dataset contains historical sales data from a supermarket company. The data includes records from three different branches over a three-month period.
Invoice ID
: Automatically generated identification number for sales slips.Branch
: Supercenter branch (identified as A, B, or C).City
: Location of the supercenters.Customer Type
: Type of customers, categorized as "Members" for those using a member card and "Normal" for those without.Gender
: Gender of the customer.Product Line
: Categorization of general items, including Electronic Accessories, Fashion Accessories, Food and Beverages, Health and Beauty, Home and Lifestyle, and Sports and Travel.Unit Price
: Price of each product in dollars ($).Quantity
: Number of products purchased by the customer.Tax
: 5% tax fee for customers.Total
: Total price including tax.Date
: Purchase date (Recorded from January 2019 to March 2019).Time
: Purchase time (from 10 AM to 9 PM).Payment
: Payment method used by the customer (Cash, Credit Card, or Ewallet).COGS
: Cost of goods sold.Gross Margin Percentage
: Gross margin percentage.Gross Income
: Gross income.Rating
: Customer satisfaction rating based on their overall shopping experience (rated on a scale of 1 to 10)
-
General Questions related to the existence of
- missing values?
- wrong datatypes for columns?
- complete duplicates in the data?
- outliers in each column?
-
Univariate Analysis
- Which
branch
has largest Sales numbers? - Which
Gender
is greatly come to branches? - What is the best-selling
product line
in the branches? - Which
Payment
Customers preferred? - Which
Customer type
comes to branches greater?
- Which
-
Bivariate Questions
- Which
branch
has the highest gross income? - There is relationship between
gross income
and customerrating
? - What is
gender
who come to branches more? - What is male and female
gross income
in each branch? - Which
product line
in branches have greater gross income? - What is relationship between
customer type
,branches
andgross income
? - Which
product line
preferred for every gender? - Which
product line
have greater gross income? - What is
Month
has greater gross income? - What is greater sales
product line
in each month?
- Which
-
Reached Results from Univariate Analysis
- Branche
A
has largest Sales numbers. - Most Gender Come to branches is
female
. - Most sales in branches is
Fashion accessories
. - Customers prefered to
Ewallet
,cash
payments rather thancredit card
. - Most Customers come to branches has
member card
.
- Branche
-
Reached Results from Bivariate Questions
- Branch
C
stands out slightly with higher income compared to BranchA
andB
. Despite BranchA
having slightly higher sales, it is BranchC
that emerges as the most profitable branch in terms of gross income. - No relationship between
rating
and gross income. - Branch
A
,B
has males greater than females, but branchC
has females greater than males. Female
gross income greater than Male in each branch.- Electronic accessories, Home and lifestyle is the most sales in branch
A
, Health and beauty, Sports and travel is the most sales in branchB
and - Food and beverages, Fashion accessories is the most sales in branchC
. - Branch
A
andB
,Normal
customer greater thanMember
but BranchC
,Member
customer greater thanNormal
,When customer number of members in Branch increased,Total of gross income increased
. - When number of females increase,
Fashion accessories
Product line increase but When number of male increase,Health and beauty
Product line increase. - Gross income is highest in
food and beverages
. january
month has greater gross income.January
month Sports and travel, Fashion accessories is the most product line sales,March
month Electronic accessories, Home and lifestyle is the most product line sales andFebruary
month Food and beverages, Fashion accessories is the most product line sales.
- Branch