There is a betting website that updates match statistics about every second. I want to data-mine it. But they dont provide any API. I have been able to inject javascript inside and mine the data directly from html tags in real time. However, it is cumbersome and future changes in html layout would mean I would have to start again from the scratch.
The website updates itself by javascript, which has (tens of) thousands lines. I was thinking of intercepting the traffic between server and local javascript or somehow data-dump memory that chrome uses to run javascript so I can then search it for strings that are changing (like betting odds) on website to identify which part of javascript and which variables are responsible for that part.
What is the best way to approach this?
Aucun commentaire:
Enregistrer un commentaire