Idea: HTTP Get Multiple Files

This functionality can easily be jammed in into existing browser and server code.

Summary:
Basically, a browser will parse html file and compile a list of files (like JavaScript, images, etc…) that it needs to get from server(s). For example, if the browser sees something like this in html <img alt="google logo" src="http://www.google.ca/intl/en_ca/images/logo.gif” /> it will not make a request to get the image but rather will add it to the list of files to get.
Once the list is ready, the browser sends an HTTP GET request to each server (links within HTML can point to many locations) with the list of files it needs.
The server gets the files and compresses them to an archive (zip for example) with correct relative directory structure and serves it to the client.
The client, knowing the file received is in fact an archive, extracts it and gets multiple files in one shot!

How to implement in Browser:
In the ProcessHTML() function, that’s before DisplayHTML() function, get all the links that need a GET request and add them to an array. There will be an array for each server. For example, my HTML might have links to 10 images located somewhere on google.ca and another 10 somewhere on advertize.com.
For each array execute a GET with the filename being “?this is a multi file request. filenames are “. So the filename will not be a legal URI.
The server will return an archive with the files and relative directory structure.
Extract the archive to a temp location and replace all the links within HTML that point to external locations to the ones that point to your local hard drive. For example: <img alt="google logo" src="http://www.google.ca/intl/en_ca/images/logo.gif” /> will be replaced with <img alt="google logo" src="file:///C:/BrowserTempFolder/www.google.ca/intl/en_ca/images/logo.gif” />.
Then, invoke the DisplayHTML() function.

How to implement in Server:
If the incoming GET request has a filename that start with “”?this is a multi file request. filenames are” DON’T try to do an I/O operation as the filename is not legal and the I/O operation will fail. Instead, jump to a function that will parse out the filenames and put them in an archive that has relative directory structure.
Give the archive to the client, that’s it!

Notes:
The browser should have logic that will determine if doing the above makes sense. Say I am parsing a web page that has 40 images. Would the above functionality have benefit? Yes (assuming the images are tiny). How about one or two images? Probably not. This also assumes that the client / server already have support for HTTP Compression, which modern servers and clients do.

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out /  Change )

Google photo

You are commenting using your Google account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s