This blog article shows you one of the ways to do Web Scraping, using Python. I have the below web page and I want to get all the titles.
‘
I need to install BeautifulSoup and loop through the web like the code sample below.
import requests
from bs4 import BeautifulSoup
url = “https://www.infobel.com/en/malaysia/business/10000/food_restaurants”
response = requests.get(url)
soup = BeautifulSoup(response.content, “html.parser”)
restaurant_names = []
restaurant_elements = soup.find_all(“h2″, class_=”customer-item-name”)
for restaurant in restaurant_elements:
name = restaurant.text.strip()
restaurant_names.append(name)
for idx, name in enumerate(restaurant_names, start=1):
print(f”Restaurant {idx}: {name}”)
You can run the code snippet in Jupyter Notebook then you can see the results like the one below.
Also: SQL Server Insert Data using Python (the problem)
Source code download: https://github.com/chanmmn/python/tree/main/WebScraping/?WT.mc_id=DP-MVP-36769
Reference: https://realpython.com/beautiful-soup-web-scraper-python/
About chanmingman
Since March 2011 Microsoft Live Spaces migrated to Wordpress (http://www.pcworld.com/article/206455/Microsoft_Live_Spaces_Moves_to_WordPress_An_FAQ.html) till now, I have is over 1 million viewers. This blog is about more than 50% telling you how to resolve error messages, especial for Microsoft products. The blog also has a lot of guidance teaching you how to get stated certain Microsoft technologies. The blog also uses as a help to keep my memory. The blog is never meant to give people consulting services or silver bullet solutions. It is a contribution to the community. Thanks for your support over the years.
Ming Man is Microsoft MVP since year 2006. He is a software development manager for a multinational company. With 25 years of experience in the IT field, he has developed system using Clipper, COBOL, VB5, VB6, VB.NET, Java and C #. He has been using Visual Studio (.NET) since the Beta back in year 2000. He and the team have developed many projects using .NET platform such as SCM, and HR based applications. He is familiar with the N-Tier design of business application and is also an expert with database experience in MS SQL, Oracle and AS 400.