Skip to content

may215/scrape-airbnb

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

61 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

scrape-airbnb

Scrapes metadata from Airbnb listings and saves them as JSON files. Made as part of a week-long web scraping workshop led by Jonathan Puckey at Non-Linear Narrative, a masters programme at the Royal Academy of Art The Hague.

Usage

# Install dependencies
npm i
# Run the main script
node scrape

Manipulating the data

The folder examples contains various examples of manipulating the data. Some of them require that you run object recognition on the scraped data first. Use our tool airbnb-object-recognition. To see a comprehensive overview of the data structure, see the Model.

node examples/[example name]

Basics

  • examining-properties – basic boilerplate to get metadata out of listings
  • examining-adjectives – get adjectives from comments
  • amenities – print out all amenities found in the listings
  • geo-json – extract coordinates from listings
  • minutes-to – print all listings that contain the phrase "X minutes to Y"
  • properties-by-host – see all listings of one host

Imagery

  • copy-bedrooms – get all listing photos that contain beds and copy them to a separate folder
  • copy-group-images-by-type sort listing photos into folders by type
  • copy-order-images-by-cleanliness sort listing photos by Airbnb's cleanliness rating
  • copy-order-images-by-price sort listing photos by price
  • copy-order-images-by-type-by-price sort listing photos by type, then price
  • copy-people – get all listing photos that contain people and copy them to a separate folder
  • copy-random-bed – get random listing photos of bedrooms

Specific

  • export-frontend-data – create JSON files containing all recognized objects for each listing and the other way around
  • neighbourhoods – use GPS coordinates from neighbourhoods in The Hague to determine which neighbourhood each listing is in

About this project

airbnb-object-recognition and scrape-airbnb were written as part of a workshop at Non-Linear Narrative at Royal Academy of Art The Hague. They are tools used to create No Home Like Place.

About

Scrapes metadata from Airbnb listings

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • JavaScript 100.0%