Welcome to OGeek Q&A Community for programmer and developer-Open, Learning and Share
Welcome To Ask or Share your Answers For Others

Categories

0 votes
199 views
in Technique[技术] by (71.8m points)

python - How to bypass captcha when extracting data from website. I am extracting from https://jp.indeed.com/

When Extracting locally, there is no problem, but when I am using my production website ,there is captcha screen before extraction. I am using Heroku and Django.

 header ={ "User-Agent": "Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:77.0) Gecko/20100101 
 Firefox/77.0" }
 r = requests.get(url,headers=header)
 soup=BeautifulSoup(r.content,'html.parser')

But when I print soup variable , I can see that there is a form to solve the captcha . How can I bypass captcha

question from:https://stackoverflow.com/questions/65901939/how-to-bypass-captcha-when-extracting-data-from-website-i-am-extracting-from-ht

与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…
Welcome To Ask or Share your Answers For Others

1 Reply

0 votes
by (71.8m points)

There is no way to bypass a captcha. That's what captchas are made for, so you can't automate any actions on a webpage.


与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…
OGeek|极客中国-欢迎来到极客的世界,一个免费开放的程序员编程交流平台!开放,进步,分享!让技术改变生活,让极客改变未来! Welcome to OGeek Q&A Community for programmer and developer-Open, Learning and Share
Click Here to Ask a Question

...