How to combine both freeform and structured tags

Today we launched a new design for the article pages on InfoWorld.com.  We are going through the site section by section and making upgrades based on the home page redesign we recently launched.  But what may seem mostly cosmetic on the outside is actually a significant shift in the way we operate.


What I like most in this new architecture is that the related links are now driven by del.icio.us.  Our edit team is tagging content in del.icio.us.  The engineers are pulling down the del.icio.us RSS feeds.  And then we create matching logic based on the common tags.  We also link back out to del.icio.us pages via the tags for the article on display. 

This is a first step with several more ideas for leveraging tags coming soon.  We need a more densely tagged data set behind us before some of the other plans can become real.  The accuracy of the related links will also be a little shady, I'm sure, until we get more sophisticated with our tagging.  But we're all excited about the possibilities for the site now that we have these tags.  New ideas seem to crop up daily.

The downside is that we're probably going to phase out or at least simplify the robust taxonomy that we spent so much time and energy building and refining over the years.  It's hard to look back on that effort and to think of walking away from all that valuable code and history of manhours. 

Of course, there's still a need for structured tagging, and we will continue to tag in ways that enable us to create new sections of the site and to help advertisers optimize their marketing campaigns.  We built a lot of functionality into the site that is dependent on tagging in a normalized way that would evaporate if we moved completely to freeform tags. 

For example, we have advertisers who want to reach people interested in storage products.  There are probably 10 different ways to target storage on the site with different kinds of marketing including contextual targeting, behavioral targeting and lead generation programs.  Eliminating a high level structure to our tagging would mean that our freeform tags would have to be incredibly precise at all times.

Our lead engineer Derek Butcher defined this problem in an interesting way.  "Structured tags give you low precision but high volume.  Freeform tags give you high precision but low volume."  David Weinberger explained it in terms of Trees and Leaves.  I won't make any bets on the future of folksonomies yet, but I'm certain we've done a good thing here.  And I'm starting to see how del.icio.us can become a business with a revenue stream instead of just a good idea worth betting on.

UPDATE: David Weinberger asks twice, "I’m not sure what it means that Infoworld is applying matching logic to del.icio.us feeds. Does that mean they’re looking at tags from non-Infoworlders?" 

Not yet.  At the moment, we're simply matching articles that use the same InfoWorld-specific tags.  If Article A and Article B share Tag Z, then they are related.  If Article A and Article C share Tag Z and Tag Y, then Article C is more related than Article B.  Then we rank chronologically. 

Well, this is the intent, anyway.  Give us a few days to get it all working right.


Comments:

Re: How to combine both freeform and structured tags
by Tom on Fri 15 Apr 2005 09:16 AM EDT tpimental

Matt,

It's great to see mainstream publishers integrating tags into their sites. New coverage areas render static taxonomies too limiting and while tags still have a long way to go, I think they are definitely worth pursuing. One question: Why are you linking to delicious results? Why not pull the results via XML and display them on your site?

Kudos,
Tom Pimental
Computerworld
Re: Re: How to combine both freeform and structured tags
by agaffin on Fri 15 Apr 2005 02:06 PM EDT agaffin

Most cool. What happens if del.icio.us goes down or gets hacked, though?

Comments:

Re: How to combine both freeform and structured tags
by Terry on Sat 16 Apr 2005 09:20 AM EDT terrys

Matt,

I emphatically agree with the potential benefits of tagging. I note that you (quite correctly) acknowledge that you can't really realize these benefits unless/until you get a fairly rich set of tags and can attain decent accuracy. What I can't figure is how you expect to achieve those ends via del.icio.us.

Regards,

Terry




PS: please contact directly

Comments:

Re: How to combine both freeform and structured tags
by Matt McAlister on Mon 18 Apr 2005 09:02 PM EDT mattmcal

We're pulling down the RSS feeds from del.icio.us to our own servers on a regular basis to make sure we don't lose any data should del.icio.us break for whatever reason. This will then enable us to show the results in our own UI, too, but we're not done with that yet.

Trackbacks:

TrackBack URL:
http://www.mattmcalister.com/blog/_trackback/582945

Infoworld goes tagalicious
Weblog:  Joho the Blog
Excerpt:  Matt McAlister explains that the Infoworld.com upgrade isn't merely cosmetic: On the articles pages they've moved from a fixed taxonomy that took them a lot of time to develop to a structured tagging system: What I like most in this new architecture is...
Posted:  Fri Apr 15 09:35:56 EDT 2005
Infoworld goes tagalicious
Weblog:  Many-to-Many
Excerpt:  Matt McAlister explains that the Infoworld.com upgrade isn’t merely cosmetic: On the articles pages they’ve moved from a fixed taxonomy that took them a lot of time to develop to a semi-structured tagging system: What I like most in this...
Posted:  Fri Apr 15 09:44:53 EDT 2005
InfoWorld redesign incorporates del.icio.us tags
Weblog:  a shel of my former self
Excerpt:  InfoWorld debuts a new look to its Web site today, but the exciting part of the redesign is under the hood. According to Matt McAlister, director of online product development for the magazine, "What I like most in this new architecture is that th...
Posted:  Fri Apr 15 10:56:34 EDT 2005
How to combine both freeform and structured tags
Weblog:  Oloop.org
Excerpt:  Today we launched a new design for the article pages on InfoWorld.com.  We are going through the site section by section and making upgrades based on the home page redesign we recently launched.  But what may seem mostly cosmetic on the outs...
Posted:  Fri Apr 15 11:41:07 EDT 2005
links for 2005-04-15
Weblog:  The Room
Excerpt:  AIML 1.0.1 (A.L.I.C.E. AI Foundation) (tags: aiml spec) Freakonomics: A Rogue Economist Explores the Hidden Side of Everything by...
Posted:  Fri Apr 15 12:18:11 EDT 2005
Infoworld combines freeform and structured tags
Weblog:  Pagevie.ws
Excerpt:  Matt McAlister talks about what infoworld is doing with tags. It looks interesting and adds to the overall user experiance without adding to the clutter, "What I like most in this new architecture...
Posted:  Sat Apr 16 07:55:37 EDT 2005
Infoworld combines freeform and structured tags
Weblog:  Pagevie.ws
Excerpt:  Matt McAlister talks about what infoworld is doing with tags. It looks interesting and adds to the overall user experiance without adding to the clutter, "What I like most in this new architecture...
Posted:  Sat Apr 16 07:55:38 EDT 2005
The end game for the desktop - and front pages
Weblog:  Notes from Classy's Kitchen
Excerpt:  It's beginning to look like the only thing protecting the desktop from irrelevance is the lack of a widely deployed...
Posted:  Sun Apr 17 16:10:29 EDT 2005
Tags for Your Resource Library?
Weblog:  Influence
Excerpt:  If your policy or issue-focused organization is like most, then chances are good that you maintain an online information library for your key audiences, with content organized around a pre-defined taxonomy. Can you improve your users' ability to find info
Posted:  Mon Apr 18 12:35:41 EDT 2005
Web 2.0: Bottom-up and Self-Organizing
Weblog:  Johnnie Manzari
Excerpt:  When I was working on the first release of Photoshop Album, one of the biggest areas of contention was around tags. It was clear that there was a benefit to building an organizational model around tags, but it was unclear...
Posted:  Tue Apr 19 00:25:41 EDT 2005
xFolk 0.3 — xhtml microformat for emergence
Weblog:  The Community Engine Blog
Excerpt:  In this post, I have described an iteration of xFolk that is much more similar to previous microformat efforts in how it specifies and uses attribute values. This version should be easy to implement in templates and tools. Ten examples from current we...
Posted:  Tue Apr 19 22:37:39 EDT 2005
To Rupert: To Get Relevant Buy Jon Udell
Weblog:  James Governor's MonkChips
Excerpt:  Rupert Murdoch, who really is one of the smartest guys in the room, even in the Autumn season, as opposed to one of those pumped up MBA carpetbaggers, has recognized the need to get real on content, syndication and business...
Posted:  Wed Apr 20 14:09:57 EDT 2005
Verslag Les Blogs, Parijs [met veel audio]
Weblog:  R-win.com [weblog] drwxr--r--
Excerpt:  Vanavond stond de teller op 1.111 foto's, foto's bij Flickr die zijn gelabeld met de term 'les blogs'. De conferentie Les Blogs vond gisteren, maandag, plaats in Parijs. Organisator Loic LeMeur (baas Six Apart Europa) had een bont gezelschap met spreke...
Posted:  Tue Apr 26 16:36:05 EDT 2005
Infoworld uses del.icio.us to structure articles
Weblog:  XyroX - Digital Lifestyle Magazine
Excerpt:  According to Matt McAlister, IT magzine Infoworld is using del.icio.us to tag their articles and to provide links to related stuff by polling the del.icio.us RSS feeds for the chosen tags. As of now, they are only using tags of their own though, so the...
Posted:  Sat Apr 30 07:50:35 EDT 2005
High Octane Blogging — Computing platform
Weblog:  The Community Engine Blog
Excerpt:  We will be using The Port Network, a great RSS-based system that has the potential to marry back-end information consumption with front-end publishing. Issues we are working around include social bookmarking and training with a very short time frame.
Posted:  Tue May 03 22:30:43 EDT 2005
xFolk Entry 0.4 — Microformat for decentralized tagging
Weblog:  The Community Engine Blog
Excerpt:  xFolk Entry 0.4 is a new iteration of the xFolk microformat that is extremely easy to implement. It enables the publication of tagged bookmarks so that they can be harvested on the web and aggregated into folksonomies. As such, xFolk eliminates the n...
Posted:  Sun Jun 05 23:19:55 EDT 2005
Tagsonomy - Or How I Learned to Stop Worrying and Love The Semantic Web
Weblog:  Matt McAlister
Excerpt:  I was asked to present some of the ideas behind our recent tagging efforts at this year's IDG online meeting in Boston. Strangely, I didn't realize that I was actually presenting the Semantic Web until I finished my slides and finally woke up to what t...
Posted:  Mon Jun 20 14:33:43 EDT 2005
Switching back to del.icio.us from My Web
Weblog:  Matt McAlister
Excerpt:  There are some really interesting aspects of the Yahoo! My Web tool, but none of them are compelling enough to keep me using the service as my primary bookmarking environment now that del.icio.us is part of the team here.  I do hope, however, that...
Posted:  Fri Dec 16 23:59:49 EST 2005
Making your web site weigh less
Weblog:  Matt McAlister
Excerpt:  Like most people, I've made the mistake of overdesigning a web site a few times.  It's easy to do.  You can get excited about the possibilities and forget that most people who come to your site really don't care what you're up to.  They'...
Posted:  Fri Feb 10 12:44:57 EST 2006
Jon Udell, storytelling, and learning through imitation
Weblog:  Matt McAlister
Excerpt:  Jon Udell visited Yahoo! last week to share some of his thoughts on language, visualization, and storytelling, among other things that all connected together in the end despite a seemingly random journey through his recent thoughts.  One bit I rea...
Posted:  Tue Mar 21 12:45:00 EST 2006
How to combine both freeform and structured tags