c# - How to crawl a website that uses cookies while integrating IP proxy? -
i'm creating crawler uses several ip proxies. whenever tried crawl website without proxy, i'm able html source, when tried enable ip proxy, fail , throws exceptions (the remote server returned error: (403) forbidden.)
upon looking @ fiddler, seems website stores cookies upon visit. if proxy enabled, fails @ response part.
i don't understand why cookies not set using proxy? proxy server settings cookies cause it? or can while still enabling proxy?
i'm using c# way, question doesn't seems language dependent.
another thing consider set cookie ip address of non proxied machine (which worked), when sent request same cookie ip address might have gotten blocked.
some network level software looks @ stuff might have flagged malicious crawler or annonymous tor browser.
Comments
Post a Comment