monkinetic weblog

Steve Ivy's Weblog - Since 1999 - XII Ed.

HTTP, HTML, and HEAD Requests

The other day I was working on a script and I wanted to get the titles for a bunch of web pages I had links to. I coded up something that downloaded each page, pulled out the title, and cached it (so that there would be no more than 1 request per URL).

Still, a lot of these were weblogs or other dynamically generated pages, so it was still a time & resource intensive operation.

It occured to me that it would be cool if there was a standard way to ask a webserver just to send you the HEAD element (and content of course) for a page. You could extract the title, meta keywords, and link elements, etc without having to fetch the entire page's contents.

Thoughts?

My name is Steve Ivy and I write about technology, the open web, social software, and general nerdity on monkinetic.com. You should follow me on Twitter or subscribe to this blog if you like what you're reading. I spend my days hacking Movable Type, python, Django, and various other efforts at Wallrazer. This is my personal site.