-
Notifications
You must be signed in to change notification settings - Fork 148
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
automatic login and register #19
Comments
I agree with your point. I also want to implement a login detection. I already build a proof of concept, but I started on another project. When I will finish it, I would like to build a bigger proof of concept but it will probably be only in 1-2 months. If you develop something, let me know ;) |
Hi @zaranmd, This is a gist link to my proof of concept: https://gist.github.com/MrL3X/7b3580087cc18e90ddcb34b7bc52efe7 I did my proof of concept with Scrapy framework, the script that I sent you is only the main script of the project. To test it, you will need to create a new Scrapy project and copy paste the code that I sent you. To test your links, you need to add the URL in start_urls. The proof of concept wasn't set up with tor (not yet). Normally, I used this command to launch the Scrapy project: We can discuss on the issue channel, it will be easier and other people can join us to develop it. |
hi again @mrL3x , |
Sorry for the delay, I know how to add another python package and I found the links really interesting. I would like to implement this: https://github.com/TeamHG-Memex/autologin-middleware to the project. I think that it was developed for scrapy specifically.
|
This is an interesting link that combines login detection and captcha bypass #21 : http://berlusp44zaqyg2e.onion/?c=users&a=login |
hi all...
as we know, there are many onion domains that has a login/register page to enter and crawl their pages. Such domains require user name, password and captcha... I have run freshonions-torscraper and it seems that it doesn't crawl such domain's contents and we see just index page. I am interested to know how can i crawl such domains by entering into them! i have searched a bit, and i saw something like scrapy's FormRequest object... but i don't know what can i do with it in this project! Do you have any idea about my issue?! please guide me...
The text was updated successfully, but these errors were encountered: