Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

causes captcha, maybe missing headers #69

Open
123bistami opened this issue Oct 3, 2021 · 3 comments
Open

causes captcha, maybe missing headers #69

123bistami opened this issue Oct 3, 2021 · 3 comments
Labels
enhancement New feature or request

Comments

@123bistami
Copy link

Site is cloudflare protected.
i have access to my target page with my proxy server behind it my proxy server is not on the blacklist. I have access to the target page when I set my proxy settings in the puppeteer args like [—proxy-server=127.0.0.1:4444], no captcha is displayed, but when I use proxy per page, captcha is displayed every time.
Captcha is displayed too if i send request with curl i think cloudflare detects that the request not coming from browser or this package not pass all headers.
Can someone help me?

@mewforest
Copy link

It seems, that this library catches every request from browser and uses own HTTP-client to override it (got.js).

So captcha isn't surprising here.

@Cuadrix Cuadrix added the enhancement New feature or request label Oct 10, 2022
@Cuadrix
Copy link
Owner

Cuadrix commented Oct 10, 2022

Yes, this package doesn't send all headers because Puppeteer doesn't provide all of them in request.headers() .

This is because Puppeteer doesn't listen to the 'Network.requestWillBeSentExtraInfo' event which is fired after the request interception itself is completed, atleast according to this guy: puppeteer/puppeteer#6117 (comment)

This can be worked around by creating a cdp session manually, but this is tricky to implement into this package because of the above fact: https://stackoverflow.com/questions/47078655/missing-request-headers-in-puppeteer/62232903#62232903

@Cuadrix Cuadrix changed the title proxy per page causes captcha causes captcha, maybe missing headers Oct 10, 2022
@agarpolices
Copy link

hi so is there any solution to this problem? i have the same problem, it is causing cloudflare to detect. I use this to lower performance usage but it doesnt work with sites with cloudflare

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

4 participants