Welcome to WuJiGu Developer Q&A Community for programmer and developer-Open, Learning and Share
Welcome To Ask or Share your Answers For Others

Categories

0 votes
975 views
in Technique[技术] by (71.8m points)

beautifulsoup - Extracting details using Beautiful Soup using Python

I have a following soup -

*<a class="view_detail_button" href="/internship/detail/primary-research-data-collection-on-ground-internship-in-bangalore-at-cry-child-rights-and-you1610101064">
View details *`

How do I extract - href="/internship/detail/primary-research-data-collection-on-ground-internship-in-bangalore-at-cry-child-rights-and-you1610101064" from it.


与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…
Welcome To Ask or Share your Answers For Others

1 Answer

0 votes
by (71.8m points)

The following snippet finds all a tags from a given html and prints the value of their href attribute:

from bs4 import BeautifulSoup

html = '<a class="view_detail_button" href="/internship/detail/primary-research-data-collection-on-ground-internship-in-bangalore-at-cry-child-rights-and-you1610101064">'

soup = BeautifulSoup(html, features="lxml")

for a in soup.find_all('a', href=True):
        print(a['href'])

Output

/internship/detail/primary-research-data-collection-on-ground-internship-in-bangalore-at-cry-child-rights-and-you1610101064

与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…
Welcome to WuJiGu Developer Q&A Community for programmer and developer-Open, Learning and Share
...