Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Some method of properly passing Cloudflare protections? #110

Open
Kadigan opened this issue Oct 2, 2023 · 8 comments
Open

Some method of properly passing Cloudflare protections? #110

Kadigan opened this issue Oct 2, 2023 · 8 comments

Comments

@Kadigan
Copy link

Kadigan commented Oct 2, 2023

Hey.

So Cloudflare is everywhere now (enough to bring Discord down when they furk up), and it seems WRP doesn't know how to work with it. I can't seem to be able to pass Cloudflare's "verify you are human", and it seems to be used more and more aggressively.

Any suggestions?

@TheTechRobo
Copy link

Maybe cloudflare is detecting the headless Chrome?

@Kadigan
Copy link
Author

Kadigan commented Oct 2, 2023

The particular website (fanfiction.net) is actually known for being very aggressive about it (or maybe Cloudflare is particularly aggressive about that website? who knows). All I know is that I see 30-50 "checking the security of the connection" pages on an average day (and some days - at every page load), using a regular browser.

I don't think hiding the headless Chrome better will fix it. There has to be a way to pass this check.

@TheTechRobo
Copy link

Headless Chrome is often used for data scraping. Websites like Cloudflare block it more aggressively.

What exactly happens when you go to the page? Is there an "I'm not a robot" checkbox?

@tenox7
Copy link
Owner

tenox7 commented Oct 3, 2023

yeah crap, we need to come up with some solution for this

@Kadigan
Copy link
Author

Kadigan commented Oct 3, 2023

@TheTechRobo Yes, there is. It does the whole song & dance, and then redirects. Assuming the check works, the "browser" would need to be able to navigate, and/or send screen updates or something.

@tenox7
Copy link
Owner

tenox7 commented Jan 2, 2024

do you have any handy examples of pages that fail this?

@Kadigan
Copy link
Author

Kadigan commented Jan 2, 2024

Anything on fanfiction.net should qualify. They pursue verification so aggressively, that I have to go through the verification almost every other chapter on my desktop. It happens most often when switching chapters using the prev/next buttons.

That was actually the website I was hoping to use it with, as previously mentioned.

@tenox7
Copy link
Owner

tenox7 commented Jan 3, 2024

🤷 I have released 4.6.2; it has some basic anti crawler detection bits, also allows setting user agent with a flag; this seems to help the most;

I have tried ./wrp -ua="Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_7) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/119.0.0.0 Safari/537.36" and it seems better

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants