Skip to content

Project analyzing the City of Austin's Open Data for 8.4M shared scooter rides and also compare the city's 311 complaints about the same. Find insights about the popular spots for scooter rides and the correlation if these areas also have more scooter complaints and more | Python Project 1 | UT Data Analysis and Visualization Nov 2019 - May 2020

Notifications You must be signed in to change notification settings

sheetalbongale/Scooters-In-Austin-Data-Analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Scooters (Shared Mobility) in Austin - Data Analysis

Austin Scooter

"In 2018, people took 84 million trips on Shared Micromobility in the United States, more than double the number of trips taken in 2017" - NACTO

Our Question:

  • Do the most popular zip codes to start or end rides also have the most complaints?
  • Does frequency of ending scooter rides in zipcode impact complaints when adjusted on per capita basis?
  • Number of rides in certain months of the year.
  • Does the distance of the trip impact the method of transportation (scooter vs. bicycle)
  • What are the 10 most frequent routes for all methods?

Hypothesis:

While we thought that obviously the neighborhoods with the most rides would likely have the most complaints, we also thought that some neighborhoods complain about a higher percent of the scooters that enter their area than other neighborhoods do. We figured these would be neighborhoods that are not as popular start points, who see the scooters longer and thus are more likely to complain.

Data Set:

Technologies Used:

  • Pandas
  • NumPy
  • Matplotlib
  • GeoPandas
  • Plotly.js
  • SQLAlchemy

Actions and Tasks:

  • Data Extraction
  • Data Cleaning and Exploration
  • Data Story and Visualization
  • Writing Analysis and Conclusions

Notebooks:

  • shared_mobility_data_wrangling.ipynb: Data munging and generates clean data CSV file for the Shared Mobility API Data. Also contains script to merge zipcodes to census tracts. The final merged CSV file is what we use for further analysis and visualization.
  • shared_mobility_data_story.ipynb: Data Visualization for shared mobility and some incomplete further work.
  • 311_data_wrangling.ipynb: Data munging and generates clean data CSV file for the 311 Complaints API Data.
  • 311_data_story.ipynb: Data Visualization for 311 data.
  • write_to_sql.py: Convert the 311 dataframe to SQL Database.

Analysis & Conclusions:

  1. 78701 Zipcode has the most number of starting and ending rides.
  2. 78701 also had the most 311 complaints about shared mobility and dockless vehicles.
  3. Saturday was the most popular day to ride a bicycle or a scooter. Also, Weekend spike indicates people like to ride scooters recreationally, or while they are enjoying activities that are outside of their normal weekly commute.
  4. We can see there is a rush hour after work. We saw a peak in the number of rides at 5pm and also before & after 5pm.
  5. October, September and March have the most rides. This is when the weather is nicest in Austin, but also when the biggest Music festivals are.

Which hours of the day are most popular?

trips_per_hour.png

What are the most popular days of the week?

trips_per_week.png

Are some months more popular than others?

trips_per_month2.png

Total daily rides per hour for each day of the week?

dailyrides_per_hour.png

Which census tracts have the most traffic?

trips_per_censustract.png

What are the most popular Census Tracts to start a ride?

choropeth_tractstart.png

What are the most popular Census Tracts to end a ride?

choropeth_tractsend.png

Which Zipcodes have the most traffic? (Without per capita factor)

trips_per_zipcode.png

What are the top 200 popular routes to start and end rides?

trips_popularroutes.png

What locations in Austin get the most complaints about shared mobility??

complains.png

Major limitations in our data:

  1. The data set for the shared mobility data is limited by geography, we can’t get more specific than census tract. Hence we used Census tract shapefiles and geopandas.
  2. 311 data was not clear about what the complaints were specifically about — you would have to manually find them on the 311. We also couldn’t see if it was business, resident, or passerby.
  3. Joyriding, some rides are really short— some people just turn scooters on and don’t seem to take it anywhere. Our overall hypothesis and the census tract choropeth plots will change if we discard all the rows for rides with '0'meter distance.
  4. We’d like to isolate the music festival dates out and see how much those specific dates impact the data.
  5. 78701: Neighborhoods that are mostly commercial have a lot of people who work there, own businesses there, walk around there, but do not show up in the population. It is possible that business owners are more likely to complain because they are made less available to customers due to scooter issues. Nearly 100k people work in downtown Austin which is drastically different from the number of people who are living there. We saw rush hour right after work.

Resources Used:

Documentation about Dataset:

Team Members: Erin Bentley, Jason Jones, Sana Khan, Sheetal Bongale

UT Data Analysis & Visualization Bootcamp | January 2020

Scooter meme

About

Project analyzing the City of Austin's Open Data for 8.4M shared scooter rides and also compare the city's 311 complaints about the same. Find insights about the popular spots for scooter rides and the correlation if these areas also have more scooter complaints and more | Python Project 1 | UT Data Analysis and Visualization Nov 2019 - May 2020

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published