Skip to content

Analysis on MechaCar data to help the manufacturing team, using R to run multiple linear regression, summary statistics, and t-tests.

Notifications You must be signed in to change notification settings

lingumd/MechaCar_Statistical_Analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

MechaCar_Statistical_Analysis

Purpose

Due to AutosRUs’ newest prototype, the MechaCar, suffering from production troubles that are blocking the manufacturing team’s progress, review the production data for insights that may help the AutosRUs's manufacturing team.

This includes the following tasks:

  • Perform multiple linear regression analysis to identify which variables in the dataset predict the mpg of MechaCar prototypes
  • Collect summary statistics on the pounds per square inch (PSI) of the suspension coils from the manufacturing lots
  • Run t-tests to determine if the manufacturing lots are statistically different from the mean population
  • Design a statistical study to compare vehicle performance of the MechaCar vehicles against vehicles from other manufacturers. For each statistical analysis, you’ll write a summary interpretation of the findings.

Linear Regression to Predict MPG

The variables that provided a non-random amount of variance to the mpg values in the dataset are vehicle length and ground clearance. The intercept is also statistically significant.

This linear model predicts the mpg of MechaCar prototypes effectively because the p-value is 5.35e-11, which is less than 0.05.

Del1.2

The slope of the linear model is not considered zero because the coefficients of the variables are not zero.

Del1.1

Summary Statistics of Suspension Coils

The manufacturing data shows that the variance of the suspension coils does not exceed 100 pounds per square inch when looking at all the manufacturing lots in total.

Del2.2

However, the manufacturing data of each lot individually shows that while Lots 1 and 2 meets the design specifications and both have variances less than 100 pounds per square inch(0.9795918 and 7.4693878 respectively), Lot 3 does not, with a variance of 170.2861224 pounds per square inch.

Del2.1

T-Tests on Suspension Coils

T-test: Ho: mu = 1500

Ha: mu ≠ 1500

All lots: There is not sufficient evidence to to reject the null hypothesis. The two means are statistically similar

Del3.1

Lot 1: There is not sufficient evidence to to reject the null hypothesis. The two means are statistically similar

Del3.2

Lot 2: There is not sufficient evidence to to reject the null hypothesis. The two means are statistically similar

Del3.3

Lot 3: There is sufficient evidence to to reject the null hypothesis. The two means are statistically different.

Del3.4

Study Design: MechaCar vs Competition

  • A metric that can be tested is if maintenance cost has a significant impact on mpg.
  • The null hypothesis would be that all coefficients in the model are zero and none of the predictor variables have a statistically significant relationship with mpg. The alternative hypothesis would be that not every coefficient in the model is zero and at least one of the predictor variables has a statistically significant relationship with mpg.
  • A multiple linear regression would need to be performed.
  • Data on maintenance cost on vehicles would need to be randomly collected. The sample size would need to be reasonably large.

About

Analysis on MechaCar data to help the manufacturing team, using R to run multiple linear regression, summary statistics, and t-tests.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages