jeudi 21 juin 2018

Easy way to remove escaped sequences from web / javascript content

I am using python scrapy module scraping a web page and got the following response:

if (typeof addMoreNewsResults == \'function\') { \r\n
    addMoreNewsResults( {\r\n\t\tblob: \'3DP.AX\',\r\n        
    sortBy: \'date\',\r\n        
    dateRange: \'all\',\r\n        
    totalResultNumber: 2,\r\n        
    totalResultNumberStr: "2",\r\n        
    news: [ \r\n    ]} ); \r\n    }

There are lots of escaped sequences there. I want to remove all of them like \' \r. I am wondering there should be some easy to use package to remove those sequences but was unable to find one. Any recommendation?




Aucun commentaire:

Enregistrer un commentaire