<?xml version="1.0" encoding="UTF-8"?>
<!-- generator="bbPress/1.0.2" -->
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom">
	<channel>
		<title>TagShadow Forum &#187; Tag: amazon - Recent Posts</title>
		<link>http://tagshadow.com/forum/tags/amazon</link>
		<description>a quantitative visual SFF book recommendation ... thingy</description>
		<language>en-US</language>
		<pubDate>Sun, 01 Aug 2010 00:18:37 +0000</pubDate>
		<generator>http://bbpress.org/?v=1.0.2</generator>
		<textInput>
			<title><![CDATA[Search]]></title>
			<description><![CDATA[Search all topics from these forums.]]></description>
			<name>q</name>
			<link>http://tagshadow.com/forum/search.php</link>
		</textInput>
		<atom:link href="http://tagshadow.com/forum/rss/tags/amazon" rel="self" type="application/rss+xml" />

		<item>
			<title>MentatJack on "The Master Cloud"</title>
			<link>http://tagshadow.com/forum/topic/the-master-cloud#post-5</link>
			<pubDate>Wed, 23 Sep 2009 08:16:08 +0000</pubDate>
			<dc:creator>MentatJack</dc:creator>
			<guid isPermaLink="false">5@http://tagshadow.com/forum/</guid>
			<description>&#60;p&#62;2 big announcements:&#60;/p&#62;
&#60;p&#62;Last night I pulled together a page I'm calling &#60;a href=&#34;http://tagshadow.com/amazon/MasterCloud.html&#34;&#62;The Master Cloud&#60;/a&#62;.  This page provides links to visualizations based a common tag. On the heels of that improvement, I updated the processed data to include over 70,000 data points divided up over ~1000 tags.&#60;/p&#62;
&#60;p&#62;The idea for breaking the TagShadow visualization (or just &#34;TagShadow&#34; for short) into multiple pages came from 2 directions.  There was just too much data, with 65,000 products and thousands of tags.  So I had to divide everything up in some manner. The most common question I got on the general visualization was &#34;What am I looking at?&#34;  Being able to simply answer &#34;All books that have X tag&#34; works out nicely.&#60;/p&#62;
&#60;p&#62;For other recent updates, be sure to checkout &#60;a href=&#34;http://tagshadow.com/amazon/change-log.html&#34;&#62;the change log&#60;/a&#62;.&#60;/p&#62;
&#60;p&#62;I'm narrowing in on this as the final version of the TagShadow Amazon Prototype.  From this point development will center on the user based version.
&#60;/p&#62;</description>
		</item>
		<item>
			<title>MentatJack on "The source of this idea"</title>
			<link>http://tagshadow.com/forum/topic/the-source-of-this-idea#post-2</link>
			<pubDate>Mon, 21 Sep 2009 07:39:38 +0000</pubDate>
			<dc:creator>MentatJack</dc:creator>
			<guid isPermaLink="false">2@http://tagshadow.com/forum/</guid>
			<description>&#60;p&#62;When I saw &#60;a href=&#34;http://opinion.berkeley.edu/&#34;&#62;Opinion Space&#60;/a&#62; I immediately latched on to using the same concept for book recommendations.  I spent a few weeks searching for someone else who had already done this and came up with nothing.&#60;/p&#62;
&#60;p&#62;I wanted tag shadow to be heavily user based and was immediate struck with the chicken and egg problem.  I needed data to process so that potential users would know what it is that I'm trying to do.  I decided to test my code on a version that used data from &#60;a href=&#34;http://www.amazon.com/gp/redirect.html?ie=UTF8&#38;amp;location=http%3A%2F%2Fwww.am&#34;&#62;Amazon&#60;/a&#62;.&#60;/p&#62;
&#60;p&#62;The first thing I realized when I started gathering data on amazon was that tag usage was rather chaotic.  You see this everywhere.  One person labels science fiction with the tag &#34;sciFi&#34; whereas another person uses &#34;science fiction.&#34;  Some people tag all science fiction additionally as &#34;fantasy&#34;.  Some just settle for &#34;sff&#34; or &#34;speculative fiction.&#34;  I immediately set about dealing with this issue.&#60;/p&#62;
&#60;p&#62;And then I read an article that eased my mind greatly: &#60;a href=&#34;http://www.shirky.com/writings/ontology_overrated.html&#34;&#62;Ontology is Overrated: Categories, Links, and Tags&#60;/a&#62;.  I particularly enjoyed the comparison of yahoo versus google, but this is the chunk that really stuck with me:&#60;/p&#62;
&#60;blockquote&#62;&#60;p&#62;This looks relatively simple with the Apple/Mac/OSX example, but when we start to expand to other groups of related words, like movies, film, and cinema, the case for the thesaurus becomes much less clear. I learned this from Brad Fitzpatrick's design for LiveJournal, which allows user to list their own interests. LiveJournal makes absolutely no attempt to enforce solidarity or a thesaurus or a minimal set of terms, no check-box, no drop-box, just free-text typing. Some people say they're interested in movies. Some people say they're interested in film. Some people say they're interested in cinema.&#60;/p&#62;
&#60;p&#62;The cataloguers first reaction to that is, &#34;Oh my god, that means you won't be introducing the movies people to the cinema people!&#34; To which the obvious answer is &#34;Good. The movie people don't want to hang out with the cinema people.&#34; Those terms actually encode different things, and the assertion that restricting vocabularies improves signal assumes that that there's no signal in the difference itself, and no value in protecting the user from too many matches.&#60;/p&#62;&#60;/blockquote&#62;
&#60;p&#62;Once I decided to just work with whatever input I was given, everything just kind of fell into place.  As of this writing, I have a version of the Amazon backed TagShadow with most of the display functionality that I envisioned. Check out this &#60;a href=&#34;http://tagshadow.com/amazon/pca.php?tagId=85&#34;&#62;alternate history&#60;/a&#62; visualization.
&#60;/p&#62;</description>
		</item>

	</channel>
</rss>
