I'm trying to loop a web query through 500+ pages where each page has only one table with 8 rows. I want to keep only the last row and load the results in a worksheet (that part is fine).
The web pages to be crawled bear an "asp" extention. My target row is included between the <tr id="c8"></tr> tags.
In the past, I've coded web scrapers without setting Internet Explorer objects, using the WebQuery syntax. For matters of practicality, I'd like to avoid referring to a specific web browser (especially Explorer since I'm on Mac). Unfortunately, as far as I know, the WebQuery method only goes as far as allowing to specify which table is to be scraped (xlSpecifiedTable parameter), but not a specific part of a table such as a row or a column.
Any advice?
Aucun commentaire:
Enregistrer un commentaire