vendredi 22 décembre 2017

Web scraping - basics

I am web scraping a h1 tag from a page using urrlib and bs4 for practice. When I executed the script the business/wifi owner's landing page h1 tag kept appearing instead of the site in my script. I switched over to my own hot spot, executed the script and received the correct h1.

Why is this occurring? Why am I receiving the h1 from the wifi's owners landing page and not the the h1 from the page in my code? I am assuming it has to do with a block on their network? Thanks!




Aucun commentaire:

Enregistrer un commentaire