content-scraper-py

Scrape List Of Posts Page And Scrape The Content Of It

This tool including to get list of post urls in archive page or similiar page.

And you will be able to get raw content (text of the content, including the title), or html version (only in content area and you will get title string) and translated content in indonesia.

Setup

Install Python in your pc if you haven't. Get it here

Https://www.python.org/downloads/

Install the requirement library here

pip install -r requirements.txt

Download the chromedriver and paste it in driver/ folder. Get it here

http://chromedriver.chromium.org/downloads

How to use It

Open the example file. There are two of them.

First one to get list of urls to scrape. And the second one, scrape all the content of scraped url list

Where is the result

you can see the result in results/ folder

Update Plan

I will update this tool to implan your backlink inside the content.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

content-scraper-py

Setup

How to use It

Where is the result

Update Plan

Files

README.md

Latest commit

History

README.md

File metadata and controls

content-scraper-py

Setup

How to use It

Where is the result

Update Plan