I search "然乌湖" at http://image.baidu.com, and i save the search page as html. then I look into the source code of this file, what I want to is to get the real picture url, but I see strange URL as below:
"thumbURL":"http://ift.tt/1IBxUTL"
"middleURL":"http://ift.tt/1IBxUTL"
"largeTnImageUrl":"http://ift.tt/1IBxUTL"
"hasLarge" :0
"hoverURL":"http://ift.tt/1B0d2Oc"
"pageNum":0
"objURL":"ippr_z2C$qAzdH3FAzdH3Ft428_z&e3Bvwvij_z&e3Bgjpjwfj_z&e3Bv54AzdH3FvwpvirtvAzdH3FcAzdH3Fc8AzdH3Fc8cAE0d00BblE9b8Dbb0AmEnFaEBEm90_z&e3B3r2"
"fromURL":"ippr_z2C$qAzdH3FAzdH3Fgjof_z&e3B8mn_z&e3Bv54AzdH3F8dAzdH3Fac8bAzdH3FadAzdH3Fb8OMcL0aaaa89AED_z&e3Bip4s"
"fromURLHost":"http://news.163.com"
"currentIndex":"12934"
"width":400
"height":266
"type":"jpg"
"filesize":"26"
"bdSrcType":"0"
"di":"199292606480"
"is":"0,0"
"bdSetImgNum":0
"bdImgnewsDate":"1970-01-01 08:00"
"fromPageTitle":"<strong>然乌湖<\/strong>"
"bdSourceName":""
"bdFromPageTitlePrefix":""
"isAspDianjing":0
"token":"9254"
"imgType" : ""
"cs" : "1555410164,2312683372"
"os" : "1775047619,54625452"
"source_type":""
for the "objURL" and "fromURL", they are some special characters which I never work with. Is there anyone can help how to get the real URL behide?
I am trying to use python to write web picture spider as practice.
Thanks a lot.
Aucun commentaire:
Enregistrer un commentaire