Welcome to OGeek Q&A Community for programmer and developer-Open, Learning and Share
Welcome To Ask or Share your Answers For Others

Categories

0 votes
1.2k views
in Technique[技术] by (71.8m points)

beautifulsoup - Extracting details using Beautiful Soup using Python

I have a following soup -

*<a class="view_detail_button" href="/internship/detail/primary-research-data-collection-on-ground-internship-in-bangalore-at-cry-child-rights-and-you1610101064">
View details *`

How do I extract - href="/internship/detail/primary-research-data-collection-on-ground-internship-in-bangalore-at-cry-child-rights-and-you1610101064" from it.


与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…
Welcome To Ask or Share your Answers For Others

1 Reply

0 votes
by (71.8m points)

The following snippet finds all a tags from a given html and prints the value of their href attribute:

from bs4 import BeautifulSoup

html = '<a class="view_detail_button" href="/internship/detail/primary-research-data-collection-on-ground-internship-in-bangalore-at-cry-child-rights-and-you1610101064">'

soup = BeautifulSoup(html, features="lxml")

for a in soup.find_all('a', href=True):
        print(a['href'])

Output

/internship/detail/primary-research-data-collection-on-ground-internship-in-bangalore-at-cry-child-rights-and-you1610101064

与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…
OGeek|极客中国-欢迎来到极客的世界,一个免费开放的程序员编程交流平台!开放,进步,分享!让技术改变生活,让极客改变未来! Welcome to OGeek Q&A Community for programmer and developer-Open, Learning and Share
Click Here to Ask a Question

...