Skip to content

Node powered scraper that iterates trough all the internal links of the specified url. It works on CSR pages (React, Angular) with dynamic urls.

Notifications You must be signed in to change notification settings

jvidalv/super-simple-sitemap-generator

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

24 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

License NPM Version NPM Downloads

A node.js powered scrapper 🔥 that iterates trough all the internal links of the specified url.

It works on CSR pages (React, Angular) with dynamic urls.

Once it is done it generates a sitemap.xml file with all the urls found, ready to be uploaded to Google Search Console.

Usage:

$ sitemap https://vvlog.dev

Params:

Parameter type default description
--wait integer 1500 Specify the time (milliseconds) to wait (So the fetches are completed) before starting to parse the page.
--limit integer 999999 Specify the limit of urls to parse before stopping the scrapper.

Todo:

  • Make it a NPM package.
  • Make wait time dynamic in response of fetches inside url.
  • New params that lets you specify how deep you want to go inside the url.
  • Integrate it as part of build process of a create-react-app.
  • Clean old code.

About

Node powered scraper that iterates trough all the internal links of the specified url. It works on CSR pages (React, Angular) with dynamic urls.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published