Skip to content

daniele-sartiano/penelope-bot

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

penelope-bot

A distributed and asynchronous C++ Web Crawler based on Docker, nats.io and Scylla db.

Modules

  • Downloder
  • Parser
  • Data Manager
  • Common

Build and Run

make
docker-compose up --scale downloader=3

Downloader

compile downloader locally

dependencies

libprotobuf-dev protobuf-compiler libprotoc-dev
https://github.com/protobuf-c/protobuf-c

commands

cd downloader; mkdir -p build; cd build; cmake ..; make; cd ../..

Data Manager

dependencies

libuv1-dev