jeudi 6 décembre 2018

Delay on scraping JavaScript contents on website

I'm trying to scraping this website to get notified the announcements as soon as one is published. I found that the announcement listings are within an Ajax container calling an API, so I'm polling the API directly. However, I found there are two delays: 1. about 1min delay between the creation of an article and its timestamp in the announcement API; 2. about 2 min delay between the timestamp in the announcement API and when it is visible externally by either a browser or a polling programme. This second delay also happens on another website I'm experimenting. I'm wondering its exact cause, and whether they are using some kind of masking technique to avoid instant detection. If so, is there any way that I can circumvent the masking please? Thanks.




Aucun commentaire:

Enregistrer un commentaire