dimanche 27 décembre 2015

How to acess the cache of a website

I want to filter information out of a (cached?) website in C#.

If I download the html from this example page, I get the source for the site. Now, the page has several "tabs" labelled from 1-17 but I can't access the content from the additional tabs. As the URL does not change when I switch between the tabs I guess it caches the site and is pulling data out of it.

I viewed the source in Microsoft Edge and it gave me this:

Opening http://file/C:/UsersAppData/Local/Packages/microsoft.microsoftedge_8wekyb3d8bbwe/AC/#!001\MicrosoftEdge\Cache\QG0E4GTD\AK-47 _ Cartel (Battle-Scarred)[1]

I can't access this directory with the File Explorer. It doesn't exist after "MicrosoftEdge\" ("Hidden Items are showed" is checked).

I also looked into the Chrome cache and found this:

http://ift.tt/1MC8QZc
HTTP/1.1 200 OK
Server: Apache
X-Frame-Options: DENY
Content-Security-Policy: script-src 'self' 'unsafe-inline' 'unsafe-eval'      http://ift.tt/1KudDh7 http://ift.tt/WFqQ2C http://ift.tt/rzP4g5 http://ift.tt/1bdKvGt https://www.google.com https://www.gstatic.com https://apis.google.com; object-src 'none'; connect-src 'self' http://ift.tt/TQJakx http://ift.tt/1qnrM60 http://ift.tt/WFqQ2C; frame-src 'self' http://ift.tt/kbbG1A http://ift.tt/1ed0tcv http://www.youtube.com https://www.youtube.com https://www.google.com;
Expires: Mon, 26 Jul 1997 05:00:00 GMT
Cache-Control: no-cache
Vary: Accept-Encoding
Content-Encoding: gzip
Content-Length: 21181
Content-Type: text/html; charset=UTF-8
Date: Sun, 27 Dec 2015 18:26:39 GMT

00000000: 44 17 00 00 03 4f 57 00 72 b5 56 9e 7b 86 2e 00  D....OW.r.V.{...
00000010: 85 b7 6d 9e 7b 86 2e 00 2b 03 00 00 48 54 54 50  ..m.{...+...HTTP
00000020: 2f 31 2e 31 20 32 30 30 20 4f 4b 00 53 65 72 76  /1.1 200 OK.Serv
00000030: 65 72 3a 20 41 70 61 63 68 65 00 58 2d 46 72 61  er: Apache.X-Fra
00000040: 6d 65 2d 4f 70 74 69 6f 6e 73 3a 20 44 45 4e 59  me-Options: DENY
00000050: 00 43 6f 6e 74 65 6e 74 2d 53 65 63 75 72 69 74  .Content-Securit
00000060: 79 2d 50 6f 6c 69 63 79 3a 20 73 63 72 69 70 74  y-Policy: script
00000070: 2d 73 72 63 20 27 73 65 6c 66 27 20 27 75 6e 73  -src 'self' 'uns
00000080: 61 66 65 2d 69 6e 6c 69 6e 65 27 20 27 75 6e 73  afe-inline' 'uns
00000090: 61 66 65 2d 65 76 61 6c 27 20 68 74 74 70 73 3a  afe-eval' https:
000000a0: 2f 2f 73 74 65 61 6d 63 6f 6d 6d 75 6e 69 74 79  //steamcommunity

// Many more lines..

I tried to copy this into a .gz and decompress. Didn't work.

Is there a way to see the cache or the whole source?




Aucun commentaire:

Enregistrer un commentaire