jeudi 17 janvier 2019

Web scraping the steam sale

I am currently trying to web scrape the steam store sale page however I can scrape all the names and discounts however I do not know how to remove tags from the data

my code is...

import bs4 as bs import urllib.request

opening a connection

my_url = urllib.request.urlopen('https://store.steampowered.com/search/?specials=1&os=win').read()

turning the html into a beautifulsoup object

soup = bs.BeautifulSoup(my_url, 'lxml') def remove_tags(text): return ''.join(xml.etree.ElementTree.fromstring(text).itertext())

data_discounts = (soup.find_all('div', {'class':'col search_discount responsive_secondrow'})) data_body = (soup.find_all('span', {'class':'title'})) print (data_body)




Aucun commentaire:

Enregistrer un commentaire