Hourly data on the Ontario electricity grid from https://gridwatch.ca/.
-
Updated
May 27, 2024 - Jupyter Notebook
Hourly data on the Ontario electricity grid from https://gridwatch.ca/.
Faster requests on Python 3
An internet search engine written mostly in python. Currently TF-IDF based.
অবসর 📚 A collection of short Bengali stories web scraped from various Bengali eMagazines and eNewspapers.
NBA Stats API via Basketball Reference
A little Python script made for scraping data from grabcraft, which can then be used for things like machine learning and data analysis projects and can be transformed to litematica files with https://github.com/RandomGamingDev/grabcraft-to-schema (Sadly, I can't release the dataset since you aren't allowed to share downloaded content)
A discord.py bot that monitors shadowkingdom.org for new forum posts of interest and reports details related to them back to the staff team.
Scrapes facebook's pages front end with no limitations & provides a feature to turn data into structured JSON or CSV
A GitHub Action workflow for automating the collection of crime and fire logs posted by the University of Southern California's Department of Public Safety.
Scrapfly Python SDK for headless browsers and proxy rotation
A multithreaded 🕸️ web crawler that recursively crawls a website and creates a 🔽 markdown file for each page, designed for LLM RAG
This GitHub repository hosts a collection of my web scraping projects, showcasing various techniques and tools used to extract data from websites. Explore these projects to learn about web scraping, data extraction, and data analysis
Generate and download e-books from online sources.
Scrapes Every Email Address of Every Society in Every University
🚀 OFFICIAL STARTER TEMPLATE FOR BOTASAURUS SCRAPING FRAMEWORK 🤖
为Ta荐(TaJian.tv)工作的基于Hero的Node.js爬虫程序,可抓取B站、抖音、快手、西瓜视频播放页、直播页的标题和封面图
A desktop app for tracking and batch downloading anime
GitHub scraping tool and library
Run Botasaurus in GitPod
Wgit allows you to crawl and extract the data you want from the web
Add a description, image, and links to the web-scraper topic page so that developers can more easily learn about it.
To associate your repository with the web-scraper topic, visit your repo's landing page and select "manage topics."