Leveraging the Web: Caching

Saturday, 26 November 2005

The first in an occasional series about the real-world benefits of REST and the Web architecture, as applied to HTTP.

I used to work for a fairly huge company as a Web/Internet guru. One day, I got sucked into a meeting with a visiting executive who was talking about rolling out a new set of servers to allow customer service reps access internal documentation. The requirement was to make large-ish PDF files available on the internal network world-wide nearly instantaneously, with access control.

An external vendor had quoted a solution; it involved rolling out a pair of Windows NT servers (for redundancy) to each location around the world, each with its own database and custom-designed software that client applications on the reps’ desktops would connect to. The whole thing would be tied together with message queues and centrally managed.

Our exec wasn’t happy because the deployment cost for this was huge; developing software, rolling out and maintaining Windows NT boxes with databases to over fifty sites around the world is no picnic. The cost of the servers and software alone was prohibitive, and the ongoing maintenance was a very healthy chunk of change. And, the complexity of the proposed system lead us to believe that it would be pretty flakey.

Furthermore, he was frustrated because the same information was already available on an internal Web site, but it just wasn’t fast enough for his purposes (after all, if you’re a rep in Dubai, you can’t wait around with a customer on the phone for five minutes while the PDF comes down).

So, when I wondered aloud why they didn’t just use Web caches, he got very interested.

After a prototype using Squid, we got buy-in to go further. The app had some pretty specific requirements; for example, each and every request had to be authenticated, but we still needed to get the PDF from local cache. We took care of that with a Cache-Control: public, must-revalidate. Then, they wanted the PDF to be in cache, even for the first person to request it. So, we had a small script on the Web server that pushed the PDFs into the caches as they were published, effectively pre-fetching them. They wanted it to be reliable, so we designed a two-level hierarchy with fail-over between both co-located and remote caches. Even in a complete failure, the original Web site could be used, so that the data would still be available (albeit slow).

Caching isn’t just about saving bandwidth; it’s also about distributing an application, improving reliability and improving user experience.

We ended up deploying a large-ish number of Network Appliance Netcaches around the world. The NetApps were fantastic; because they were off-the-shelf appliances, they were very easy to configure, and once running, they didn’t require any but the most basic monitoring. The startup cost was, IIRC, nearly an order of magnitude less than the original quote, and the maintenance for the NetApps was, comparatively, a pittance. The project took about six months, start to finish, and that was mostly working out the deal with NetApp and getting the deployment plan together; there wasn’t any development beyond the thirty or so lines of Perl to get the database to ping the caches.

Our exec was very happy.

I went to headquarters to do the final integration into the Web site, and give a demo or two. At one point, some senior IT execs came in and were very sceptical about the value of caching; while it might do good in tiny, remote offices, it wouldn’t help there (where they had some impossibly big pipes straight into the Internet). Needless to say, their eyes pretty much popped out of their heads when I showed them the difference between surfing from the net and surfing from the cache, and they were immediate converts.

Lessons Learned

The biggest surprise to many involved was that we were able to scale the Web site out with basically no code, using off-the-shelf components. Caching brought both scalability and reliability to the application very cheaply and easily, despite the requirement for authentication.

It was also quite eye-opening to see how using a message queue and other “Enterprise” mechanisms just plain weren’t necessary, despite experts’ insistence that they were. The constraints of the Web (as REST explains) makes it very easy and simple to do very powerful things.

Lastly, this is a nice demonstration that caching isn’t just about saving bandwidth; it’s also about distributing an application, improving reliability and improving user experience.

6 Comments

Subbu Allamaraju said:

"Lastly, this is a nice demonstration that caching isn’t just about saving bandwidth; it’s also about distributing an application, improving reliability and improving user experience."

Well said.

Sunday, November 27 2005 at 6:17 AM

Jim Webber said:

Hi Mark,

First up, good article. Use the Web for Web-like things is a great message.

However, I’m thinking that the benefits inherit in the architecture are down to caching rather than REST per se. While I agree that the technical simpler solution offered by using the Web is totally sensible, this problem space gives an atypical view of a Web-scale integration project.

For example, the fact that information travelled only in one direction meant that caching could be trivially employed. If there was an exchange of data in both directions then caching would be that much harder (nasty consistency issues etc).

I suspect that such bilerateral information exchanges would be the general case in a large application (as opposed to a large information system).

My concern is that the information in this article will be misrepresented by some folks to bolster their specific technical religion, rather than being a useful case study for a particular situation.

Would you care to perhaps distil the underlying practices in a separate post? Something like the patterns work would be a really cool way of presenting this stuff (forces, strengths, weaknesses, etc).

Jim

Monday, November 28 2005 at 4:28 AM

Mark Nottingham said:

Jim,

The benefits I describe are available thanks to the constraints described by REST; in particular, uniform, generic methods allow caches to be interposed without re-deploying the application. Doing so in this environment would have meant months more work architecting, developing and testing the solution, and then getting it through change control. Of course it’s possible to cache anything that’s cacheable, in pretty much any system, given enough coding — but that’s not the point.

Certainly, there are things that the Web can’t do, and projects where it’s not appropriate to use it. However, my experience is that people consistently under-estimate what it’s capable of, and invest heavily in architectures, tools and systems that will, sooner or later, be superseded by it. Erring on the side of caution means considering it first, before trying something more complex. This is borne out by the example well; a traditional enterprise architect would think “how do I get that data over there” and use an over-specified tool; a message queue. Leveraging the Web often means changing how you think.

WRT underlying practices, have you seen https://www.mnot.net/cache_docs/ ?

Cheers,

Monday, November 28 2005 at 9:56 AM

Arthur Davidson Ficke said:

Mark,

If the application must serve its documents using SSL, are there any caching options available? It’s my understanding that with SSL no intermediary caching can be done.

Thanks

Thursday, December 1 2005 at 5:46 AM

Mark Nottingham said:

Proxy caching can’t be done, but gateways can. E.g., Akamai and some other “Reverse Proxy” or “Surrogate” caches. And, of course, you can use client caching as well.

Wednesday, December 14 2005 at 5:30 AM

Nicolas said:

Of course REST matters here. You can’t use standard HTTP caching with SOAP!

Saturday, December 13 2008 at 12:08 PM

mark nottingham

other HTTP Caching posts