Filter Your Feeds With Yahoo Pipes

f you look at my blog roll, you may notice a few planets: Planet Cataloging, planet code4lib, and Planet RDF. These topical aggregators are quite handy, saving me much of the trouble of seeking out new and relevant blogs. I just subscribe to the feed for the aggregator and let the maintainer do all the hard work.

Aggregators do have their drawbacks, though. A fair number of blogs that I’ve no interest in reading get lumped in with those of value to me. Since some blogs are included in more than one feed, I also end up getting duplicates in my feed reader (Google Reader, if you’re curious). All of the clutter can be a nuisance and a time-waster. I need a way to filter out the chaff.

If you look at my blog roll, you may notice a few planets: Planet Cataloging, planet code4lib, and Planet RDF. These topical aggregators are quite handy, saving me much of the trouble of seeking out new and relevant blogs. I just subscribe to the feed for the aggregator and let the maintainer do all the hard work.

Aggregators do have their drawbacks, though. A fair number of blogs that I’ve no interest in reading get lumped in with those of value to me. Since some blogs are included in more than one feed, I also end up getting duplicates in my feed reader (Google Reader, if you’re curious). All of the clutter can be a nuisance and a time-waster. I need a way to filter out the chaff.

Ed Summers kindly suggested that I try out Yahoo Pipes (thanks, Ed). Pipes is a visual editor that lets you take input from various feeds (RSS, Atom, RDF, or iCal) and other data sources (XML, JSON, iCal, or KML), run the information through a series of filters, and output the results in RSS 2.0, RSS 1.0, JSON, and Atom.

I’ve now built my first pipe. Smaller Planets grabs the feeds from the aforementioned planets, removes any items with del.icio.us in the title (I prefer not to see what everyone bookmarked today) and removes any duplicates (based on the URL of the post). After just a couple of minutes getting Pipes figured out, I now have my custom feed to subscribe to.

Filtering out other content is easy, too. You can add rules to the Filter module to block, for example, posts from certain blogs. A rule that says “item.link contains xplus3.net” would block out anything I post to this blog. Feel free to clone Smaller Planets and customize it to create your personalized feed.

I’ve only just scratched the surface, but Pipes looks to be a powerful application; I look forward to playing around with it more. If you create any interesting or useful pipes, please leave a comment here; I’d love to see it.

Off-Line Web Applications With Google Gears

One of the most frequent criticisms of the modern crop of web- and AJAX-based applications is the need for an Internet connection for them to work. After all, what good are Google Docs or webmail to you if you are on an airplane or facing a temporary connection interruption. Google has brought us one step closer to fixing that this week with its new Google Gears browser extension.

One of the most frequent criticisms of the modern crop of web- and AJAX-based applications is the need for an Internet connection for them to work. After all, what good are Google Docs or webmail to you if you are on an airplane or facing a temporary connection interruption. Google has brought us one step closer to fixing that this week with its new Google Gears browser extension.

As others have pointed out, Google is not the first to develop something like this. But putting their weight behind it should help increase the development and adoption of off-line web applications.

I can imagine several potential uses for Gears within the library. Someone could conceivably save a set of records within an OPAC web interface, and still have access to those records while taking their laptop through the stacks of the library (presuming the library doesn’t have adequate Wi-Fi, of course). Or a student could save several items from a digital collection and still have access to those pages when presenting them to a class later. These things can already be done, of course, but Gears should make them easier and more user-friendly.

Like many Google products, Gears is still in Beta (much like Google Reader, which has implemented Gears), which means there are still some bugs to work out. There is some suspicion that it was rushed out the door to meet the deadline of Google Developer Day. But hopefully Gears will soon be a stable, robust, functional API for delivering off-line web applications.