How to Save One, Many, or All Items from a Google Reader Feed Locally

4 Responses · November 29, 2007

Google Reader, employ­ing Google’s petabytes of stor­age, archives every feed item it’s ever pulled for you. This has always amazed me, as I’m sure I and every­one else must be using far more in Reader than the 5 gigs we get from Gmail. Still, they don’t have much of a choice; it wouldn’t do any­body good if you could only see the 10 or 20 items present on a feed’s XML file at any given time. And even though they’re prob­a­bly clever enough to only have to store one copy of every item for that item’s hun­dreds of thou­sands of read­ers, they’ve prac­ti­cally built a third copy of the inter­net (after their cache).

A nice fall­out of this archiv­ing is that when­ever con­tent you’ve sub­scribed to dis­ap­pears from the web, you’ll still be able to access its (admit­tedly homog­e­nized) Reader copy, for­ever; “for­ever” here mean­ing “pre­sum­ably for as long as Google is around.” When (if?) Google dies, will its data die with it? Despite my intu­ition that Google will long out­last cur­rent notions of what com­put­ers are and how they work, I still don’t like entrust­ing impor­tant data to other peo­ple, not to men­tion data that is acces­si­ble only through the web. I want a local copy.

But they don’t make it easy for you. Reader is all AJAXed out, so even sim­ple page saves don’t work. Copying/pasting would be a night­mare. Screen­shots? Too sloppy. Email­ing copies of each item? Too time-consuming. Tag­ging them with a spe­cial tag, mak­ing that tag’s feed pub­lic, then sub­scrib­ing in, like, Thun­der­bird or some­thing? Even if that weren’t absurdly round­about, the pub­lic feeds only have twenty or so items.

I’m talk­ing specif­i­cally about a blog I loved, but that up and dis­ap­peared one day, com­pletely, leav­ing the only copies of the lost data scat­tered through­out Netvibes, News­ga­tor, Blog­lines, and Reader. Google searches turned up noth­ing like a straight­for­ward guide to sav­ing from Reader, which sur­prised me. But there were clues, and using only a cou­ple tools, I finally got it. It’s actu­ally pretty easy, I was able to save 118 items in about ten min­utes with this method. Let me show you it.

You need Fire­fox, the two plu­g­ins Grease­mon­key and Scrap­Book, and the Grease­mon­key script Google Reader Print But­ton. Then it’s just a mat­ter of click­ing “Print” for each item you want to save, which opens it in its own tab, then using ScrapBook’s “Cap­ture All Tabs…” func­tion, which auto­mat­i­cally does a “Save Page As, Web Page, com­plete” into your %App­Data% folder for each tab, then finally option­ally using ScrapBook’s “Com­bine Wiz­ard” (in the tools menu of the Scrap­Book side­bar [Alt+K]) to put all the items into a sin­gle folder with a sin­gle index.html file.

The “print­ing” part is the most cum­ber­some, but goes by pretty quickly with the rep­e­ti­tion of a series of clicks and keystrokes:

  1. Click “Print”
  2. Press Esc (to close the print dialogue)
  3. Press Ctrl+Tab (to get back to Reader)
  4. Press J (to go to the next feed item)

Do that mind­lessly for a cou­ple min­utes, and they’ll all be there, wait­ing to be saved. I’m gonna put the word “disk” in here too so that any­body Googling for a solu­tion might find this.

You could also use the iMacro exten­sion to do it, probably.

I’m cur­rently try­ing to backup all my “starred items” in Google Reader and this looks like a nice hack.

Giacomo · December 31, 2008

Actu­ally I’m hav­ing trou­ble with that Google Reader Print But­ton script now that Google Reader has gone through some updates. The code was appar­ently updated on 12/17, but it’s still not work­ing for me.

Jay · December 31, 2008
a · June 5, 2009

I really don’t know whether or not to con­sider that spam…

Jay · June 8, 2009

Leave a Comment or Subscribe