<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>semanticvoid &#187; Tagging</title>
	<atom:link href="http://semanticvoid.com/blog/index.php/category/tagging/feed/" rel="self" type="application/rss+xml" />
	<link>http://semanticvoid.com/blog</link>
	<description>extracting the semantics from the void</description>
	<lastBuildDate>Thu, 22 Sep 2011 21:05:48 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0.4</generator>
		<item>
		<title>Particle &#8211; on the way to a Findory</title>
		<link>http://semanticvoid.com/blog/2008/07/11/particle-on-the-way-to-a-findory/</link>
		<comments>http://semanticvoid.com/blog/2008/07/11/particle-on-the-way-to-a-findory/#comments</comments>
		<pubDate>Sat, 12 Jul 2008 04:52:32 +0000</pubDate>
		<dc:creator>Anand Kishore</dc:creator>
				<category><![CDATA[Artificial Intelligence]]></category>
		<category><![CDATA[Hacking]]></category>
		<category><![CDATA[Project]]></category>
		<category><![CDATA[Tagging]]></category>

		<guid isPermaLink="false">http://semanticvoid.com/blog/2008/07/11/particle-on-the-way-to-a-findory/</guid>
		<description><![CDATA[Although I started this project as an experimental weekend thingy (to play around with Google App Engine), the project has shaped up well. Before you surf over to another blog, wondering what the hell I&#8217;m talking about, let me introduce you to &#8220;Personalized ARTICLE&#8221; aggregator (read as PARTICLE). The aim is to personalize a users [...]]]></description>
			<content:encoded><![CDATA[<p><center><a href="http://particle.semanticvoid.com"><img src="http://yucki.appspot.com/images/particle_logo.png" border=0/></a></center><br />
Although I started this project as an experimental weekend thingy (to play around with Google App Engine), the project has shaped up well. Before you surf over to another blog, wondering what the hell I&#8217;m talking about, let me introduce you to &#8220;<a href="http://particle.semanticvoid.com"><b>P</b>ersonalized <b>ARTICLE</b></a>&#8221; aggregator (read as PARTICLE). The aim is to personalize a users online reading (just like what Findory did). Findory was an excellent service and I&#8217;ll be glad if I can achieve even an iota of what Greg created. This project is at very rudimetary and experimental stage. Rather than tapping into the users reading history on the site (monitored by the links clicked), the idea is to study how a users <b>*interests*</b>, scattered around at various &#8220;databases of interest&#8221; like del.icio.us, could be used to personalize online reading (news articles, blogs and more). This way the user could merrily browse the world wide web, bookmarking pages, doing his usual stuff and let PARTICLE worry about making this data useful.</p>
<p><a href="http://particle.semanticvoid.com"><b>Click here to try PARTICLE</b></a></p>
<p>Presently you need to provide PARTICLE with your del.icio.us username, which it uses to analyze your <b>*interests*</b> and present you with recent news stories you may like. It works well if you have a decent number of bookmarks in del.icio.us. As I mentioned, the project is at a very rudimentary stage, so don&#8217;t feel disappointed by the results (ah! the unlucky few). I encourage you to play around with the app and recommend it to others to try. I&#8217;ll be making many changes/additions in the coming weeks.</p>
<p>Test drive PARTICLE at <a href="http://particle.semanticvoid.com">http://particle.semanticvoid.com</a>. Kindly leave your feedback/comments/suggestions in the comments or send me an email at &#8216;anand at semanticvoid.com&#8217;.</p>
<p><b>[UPDATE]</b> Yahoo! Research has a similar project called <a href="http://garcon.sandbox.yahoo.net/index.php">Garçon</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://semanticvoid.com/blog/2008/07/11/particle-on-the-way-to-a-findory/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Tag Search: Inferring Relevance From User Authority</title>
		<link>http://semanticvoid.com/blog/2007/03/04/tag-search-inferring-relevance-from-user-authority/</link>
		<comments>http://semanticvoid.com/blog/2007/03/04/tag-search-inferring-relevance-from-user-authority/#comments</comments>
		<pubDate>Sat, 03 Mar 2007 20:27:48 +0000</pubDate>
		<dc:creator>Anand Kishore</dc:creator>
				<category><![CDATA[Algorithm]]></category>
		<category><![CDATA[Search]]></category>
		<category><![CDATA[Social Networks]]></category>
		<category><![CDATA[Tagging]]></category>

		<guid isPermaLink="false">http://semanticvoid.com/blog/2007/03/04/tag-search-inferring-relevance-from-user-authority/</guid>
		<description><![CDATA[Search has always been an integral part of any tagging system. Such systems need to make sense out of the abundant user generated metadata such that the documents/items can be ranked in some order. However, very little has been said or written openly about such ranking algorithms for tagging systems. Conventional Methods Most systems, that [...]]]></description>
			<content:encoded><![CDATA[<p>Search has always been an integral part of any tagging system. Such systems need to make sense out of the abundant user generated metadata such that the documents/items can be ranked in some order. However, very little has been said or written openly about such ranking algorithms for tagging systems.</p>
<p><span style="font-weight: bold">Conventional Methods</span></p>
<p>Most systems, that allow tag search, base their rankings on factors like simply the &#8216;number of unique users&#8217; or on ratios like &#8216;number of unique users for tag t / number of unique users for all tags&#8217; etc. These conventional algorithms do work, but not quite so well for large datasets where they can be exploited. They also often do not represent the true relevance. Reminds me often of the pre-<a target="_blank" href="http://en.wikipedia.org/wiki/Page_Rank">PageRank </a>era of information retrieval systems.</p>
<p><span style="font-weight: bold">So, which relevance algorithm do I use?</span></p>
<p>Well, you can always use the conventional methods, but then you can always try the algorithm I devised. This algorithm seems to capture the true essence of relevance in tagging systems. I call it the <span style="font-weight: bold">WisdomRank </span>as it is truly based on the &#8216;wisdom&#8217; of the crowds, the fundamental part of any social system. Read along to understand it in detail (or download the <a target="_blank" title="wisdom rank" href="http://docs.semanticvoid.com/wisdomRank.pdf">pdf</a>).</p>
<hr />
<p align="center" class="MsoNormal" style="text-align: center"><span style="font-size: 16pt; font-family: Tahoma">Inferring relevance for tag search</span></p>
<p align="center" class="MsoNormal" style="text-align: center"><span style="font-size: 16pt; font-family: Tahoma"> from user authority – Abstract</span></p>
<p class="MsoNormal"><span style="font-family: Tahoma">Tagging is an act of imparting human knowledge/wisdom to objects. Thus a tag, a one word interpretation/categorization of the object by the user, fundamentally represents the basic unit of human wisdom for any object. This wisdom is difficult to quantify as it is relative for every user. One approach to quantify this would be to use the wisdom of the other users to define this for us. This can be done by assuming that every tag corresponds to a topic for which every user has some authority. Also, every tag added to an object corresponds to a vote, similar to the Digg model, asserting that the object belongs to that topic (tag).</span></p>
<p class="MsoNormal"><span style="font-family: Tahoma">Let us consider a user Ui who has tagged object Oj with the tag Tk. Whenever other users in the system tag Oj with Tk, they are implicitly affirming Ui’s wisdom for tag Tk.</span></p>
<p class="MsoNormal"><span style="font-family: Tahoma">Thus, we define the function <strong>affirmation</strong> for the <strong>tuple(u, d, t)</strong> as the number of other users who have also tagged document d with tag t:</span></p>
<p align="center" class="MsoNormal" style="text-align: center"><strong><span style="font-family: Tahoma">affirmation(u, d, t) = ∑<sub>i=All users except ‘u’</sub> tagged(u<sub>i</sub>, d, t)</span></strong></p>
<p><span style="font-family: Tahoma">where,</span></p>
<p class="MsoNormal"><span style="font-family: Tahoma">          u – the user<br />
d – the document/object<br />
t – the tag<br />
tagged – 1 if the user Ui has tagged d with t<br />
-  0 otherwise</span></p>
<p class="MsoNormal"><span style="font-family: Tahoma">Hence, we can proceed to define the wisdom of the user for a topic (tag) t as the sum of all such assertions by other users,</span><strong><span style="font-family: Tahoma"><br />
</span></strong></p>
<p align="center" class="MsoNormal" style="text-align: center"><strong><span style="font-family: Tahoma">wisdom(u, t) = ∑<sub>x=For all documents d tagged with tag t by U </sub>affirmation(u, d, t)</span><span style="font-family: Tahoma"><br />
</span></strong></p>
<p class="MsoNormal"><span style="font-family: Tahoma">Likewise, we can now define the <strong>authority</strong> of a user for the topic <strong>t</strong>, as the ratio of the user’s wisdom to the collective wisdom for <strong>t</strong>. Hence,<br />
</span>
</p>
<p align="center" class="MsoNormal" style="text-align: center; text-indent: 0.5in"><strong><span style="font-family: Tahoma">authority(u, t) = wisdom(u, t) / ∑ wisdom(u<sub>i</sub>, t)</span></strong></p>
<p class="MsoNormal" style="text-indent: 0.5in"><strong><span style="font-family: Tahoma"> </span></strong><span style="font-family: Tahoma">For example: Let us determine the authority of user u1 for tag t1</span><br />
<strong><span style="font-family: Tahoma" /></strong><br />
<span style="font-family: Tahoma">          Object d1:    Object d2:    Object d3:<br />
t1 by u1                    </span><span style="font-family: Tahoma">    </span><span style="font-family: Tahoma">t1 by u1                   </span><span style="font-family: Tahoma">    </span><span style="font-family: Tahoma">  t1 by u2<br />
t1 by u2                   </span><span style="font-family: Tahoma">     </span><span style="font-family: Tahoma">t3 by u1                  </span><span style="font-family: Tahoma">       </span><span style="font-family: Tahoma">t1 by u3<br />
t1 by u3                    </span><span style="font-family: Tahoma">    </span><span style="font-family: Tahoma">t3 by u1<br />
t2 by u1</span><br />
<span style="font-family: Tahoma" /></p>
<p class="MsoNormal" style="text-indent: 0.5in"><span style="font-family: Tahoma">affirmation(u1, d1, t1) = 2          affirmation(u1, d2, t1) = 0<br />
Hence, wisdom(u1, t1) = 2</span></p>
<p>Likewise for other users,</p>
<p class="MsoNormal" style="text-indent: 0.5in"><span style="font-family: Tahoma">wisdom(u2, t1) = 3<br />
wisdom(u3, t1) = 3</span>
</p>
<p class="MsoNormal"><span style="font-family: Tahoma"> Hence the authority of user u1 for t1 is as follows:</span></p>
<p class="MsoNormal"><span style="font-family: Tahoma">    authority(u1, t1) = 2 / (2 + 3 + 3) = 2 / 8 = 0.25</span></p>
<p class="MsoNormal"><span style="font-family: Tahoma">Whenever a user tags an object with a tag, he does so with the authority he possesses for that tag. Thus as compared to conventional methods, where the objects are usually ranked on the number of instances of the tags, in this method the measure of the relevance of a tag for an object is equivalent to the sum of all such user authorities. Thus,</span></p>
<p align="center" class="MsoNormal" style="text-align: center"><strong><span style="font-family: Tahoma">relevance_metric(d, t) = ∑<sub>i= all user who have tagged document d with t </sub>authority(u, t)</span></strong></p>
<p class="MsoNormal"><strong><span style="font-family: Tahoma" /></strong><span style="font-family: Tahoma">This relevance score, when calculated for every tag would provide an accurate measure for ranking the objects. As compared to the conventional methods where more number of instances of a tag for an object ensured a higher relevance for that tag, here the number of authoritative users counts.</span></p>
<p class="MsoNormal"><span style="font-family: Tahoma">Let us consider the following example:</span></p>
<p class="MsoNormal"><span style="font-family: Tahoma"> </span><span style="font-family: Tahoma">          Object d1:</span><span style="font-family: Tahoma">    </span><span style="font-family: Tahoma">Object d2:<br />
t1 by u1                    </span><span style="font-family: Tahoma">    </span><span style="font-family: Tahoma"> t1 by u2<br />
t2 by u5                  </span><span style="font-family: Tahoma">       </span><span style="font-family: Tahoma">t1 by u3<br />
t1 by u4</span>
</p>
<p class="MsoNormal"><span style="font-family: Tahoma"> Let us assume that u1 has a very high authority for tag t1. Hence in the above scenario, a search for tag t1 may rank d1 higher than d2, if </span></p>
<p class="MsoNormal"><span style="font-family: Tahoma"> authority(u1, t1) <strong>></strong> authority(u2, t1) + authority(u3, t1) + authority(u4, t1)</span></p>
<p class="MsoNormal"><span style="font-family: Tahoma">This result is with the assumption that u1’s authority is greater than those of u2,u3 and u4 combined.</span></p>
<p class="MsoNormal"><span style="font-family: Tahoma">On the other hand, d2 would be ranked higher than d1 if the combined authorities of u2, u3 and u4 exceed that of u1. If the majority of the users are suggesting something, it indicates that their suggestion is far more valuable than that of an individual user or a subset of users.</span></p>
<p><strong><span style="font-family: Tahoma">Future Enhancements</span></strong></p>
<p class="MsoNormal"><strong><span style="font-family: Tahoma" /></strong><span style="font-family: Tahoma">While calculating the user assertions this algorithm currently considers all such users as equal even though they may have varying authorities for the corresponding tag. As a future enhancement, I plan to incorporate the authorities of the users as well into the affirmation calculations. </span></p>
]]></content:encoded>
			<wfw:commentRss>http://semanticvoid.com/blog/2007/03/04/tag-search-inferring-relevance-from-user-authority/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Interpreting Bookmarking</title>
		<link>http://semanticvoid.com/blog/2006/07/11/bookmarking-interpreted/</link>
		<comments>http://semanticvoid.com/blog/2006/07/11/bookmarking-interpreted/#comments</comments>
		<pubDate>Tue, 11 Jul 2006 14:04:11 +0000</pubDate>
		<dc:creator>Anand Kishore</dc:creator>
				<category><![CDATA[Tagging]]></category>

		<guid isPermaLink="false">http://semanticvoid.com/blog/2006/07/11/bookmarking-interpreted/</guid>
		<description><![CDATA[In a Social bookmarking system, users store lists of Internet resources, which they find useful. They also categorize their resources by the use of informally assigned, user-defined keywords or tags. [via Wikipedia] One way of interpreting bookmarking systems is as the above ie simply a list of &#8216;resources which they find useful&#8216;. If we consider [...]]]></description>
			<content:encoded><![CDATA[<blockquote><p>In a <strong>Social bookmarking</strong> system, users store lists of Internet resources, which they find useful. They also categorize their resources by the use of informally assigned, user-defined <em>keywords</em> or tags. [via <a target="_blank" href="http://en.wikipedia.org/wiki/Social_bookmarking">Wikipedia</a>]</p></blockquote>
<p>One way of interpreting bookmarking systems is as the above ie simply a list of &#8216;<em>resources which they find useful</em>&#8216;. If we consider bookmarking systems like del.icio.us, as per the above the bookmarks are just lists of urls which the user finds useful.</p>
<p>The same if viewed from another perspective could be interpreted as &#8216;<em>resources which they find interesting</em>&#8216;. Hence a bookmarking system behaves as an &#8216;<em>interest management system</em>&#8216;.</p>
<p>Yet another perspective, which is relevant to search engines, is the &#8216;<em>history of visited pages labelled by keywords</em>&#8216;. This gives the search engines information about the likelihood of a user clicking on a result (the bookmarked link) for specific keywords (tags), something intrinsic to personalized search.</p>
<p>These are the various perspectives I intend to exploit with the tool I&#8217;ve started developing (<a title="Automated Tagging" target="_blank" href="http://blog.semanticvoid.com/2006/05/24/tagging-learning-automated-tagging/">read about the core concept here</a>). I will be developing this tool initally on <a target="_blank" href="http://www.simpy.com/">Simpy</a> as Otis Gospodnetic (author of LIA book and the one behind Simpy) has offered his valuable support.</p>
<p>In the meantime you can ponder over other ways to look at bookmarking and possibly list it down here as well.</p>
]]></content:encoded>
			<wfw:commentRss>http://semanticvoid.com/blog/2006/07/11/bookmarking-interpreted/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Tagging + Learning = Automated Tagging</title>
		<link>http://semanticvoid.com/blog/2006/05/24/tagging-learning-automated-tagging/</link>
		<comments>http://semanticvoid.com/blog/2006/05/24/tagging-learning-automated-tagging/#comments</comments>
		<pubDate>Wed, 24 May 2006 08:52:00 +0000</pubDate>
		<dc:creator>Anand Kishore</dc:creator>
				<category><![CDATA[Machine Learning]]></category>
		<category><![CDATA[Tagging]]></category>

		<guid isPermaLink="false">http://semanticvoid.com/blog/2006/05/24/tagging-learning-automated-tagging/</guid>
		<description><![CDATA[What happens when you cross Tagging with Machine Learning? Well you get a tool that learns as you tag. Sounds interesting? Then read on. Learns? But what. Thats what I asked myself yesterday. What I visualised was a tool that learns how to tag and what to tag from you. It would learn from past [...]]]></description>
			<content:encoded><![CDATA[<p>What happens when you cross Tagging with Machine Learning? Well you get a tool that learns as you tag. Sounds interesting? Then read on.</p>
<p><strong>Learns? But what.</strong></p>
<p>Thats what I asked myself yesterday. What I visualised was a tool that learns <strong>how to tag</strong> and <strong>what to tag</strong> from you. It would learn from past experiences, what you would like to tag and also what keywords (tags) you would likely use for them. And finally one day, when it has a large enough knowledge base, it could probably automate the entire task for you.</p>
<p><strong>Imagine</strong></p>
<p>Imagine having a your own personal crawler, spidering the web in search of pages that might interest you and even saving the most likely ones. Imagine coming to office and seeing your <em>toRead</em> list already populated by the bot.</p>
<p>Sounds too optimistic? Well I&#8217;ll give it a try. Until then you&#8217;ll have to do what humans do best &#8211; tagging &#8211; on your own.</p>
]]></content:encoded>
			<wfw:commentRss>http://semanticvoid.com/blog/2006/05/24/tagging-learning-automated-tagging/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>The Evolution Of Tagging</title>
		<link>http://semanticvoid.com/blog/2006/05/18/the-evolution-of-tagging/</link>
		<comments>http://semanticvoid.com/blog/2006/05/18/the-evolution-of-tagging/#comments</comments>
		<pubDate>Wed, 17 May 2006 19:48:38 +0000</pubDate>
		<dc:creator>Anand Kishore</dc:creator>
				<category><![CDATA[Tagging]]></category>

		<guid isPermaLink="false">http://semanticvoid.com/blog/2006/05/18/the-evolution-of-tagging/</guid>
		<description><![CDATA[The Present And The Future Tagging has been there for quite sometime now, although it seems to be picking momentum after the Web2.0 meme. But the question one needs to answer is that &#8220;Has tagging evolved?&#8220;. It has yet to evolve out of its stone age era. Tagging basically deals with organizing information retrieval. But [...]]]></description>
			<content:encoded><![CDATA[<p><strong>The Present And The Future</strong></p>
<p>Tagging has been there for quite sometime now, although it seems to be picking momentum after the Web2.0 meme. But the question one needs to answer is that &#8220;<em>Has tagging evolved?</em>&#8220;. It has yet to evolve out of its <em>stone age</em> era.</p>
<p>Tagging basically deals with <strong>organizing information retrieval</strong>. But yet current systems don&#8217;t seem to apply any of the information retrieval optimizations to it. It could prove useful and relevant if it was treated as a mere search rather than a whole new concept (the very reason why non-techie users are not lured into tagging). It would also prove to be more accurate if the IR preprocessing like stemming, synonym etc could be applied to it. The knowledge acquired in the process is very valuable due to the human intelligence behind it and can be exploited in many useful ways.<br />
But tagging has evolved to some extent. It has evolved from single word tags to multi-word tags. It has evolved in terms of granularity from the Document (del.icio.us) to the Content (recoja, Google Notebook).</p>
<p>What we need to focus on is what more can be done with it (tagging) rather than just replicate what already can be done with it. What do you think could be the evolutionary steps in tagging?</p>
]]></content:encoded>
			<wfw:commentRss>http://semanticvoid.com/blog/2006/05/18/the-evolution-of-tagging/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Google Goes Co-operative Finally</title>
		<link>http://semanticvoid.com/blog/2006/05/14/google-goes-co-operative-finally/</link>
		<comments>http://semanticvoid.com/blog/2006/05/14/google-goes-co-operative-finally/#comments</comments>
		<pubDate>Sun, 14 May 2006 13:09:32 +0000</pubDate>
		<dc:creator>Anand Kishore</dc:creator>
				<category><![CDATA[Google]]></category>
		<category><![CDATA[Tagging]]></category>

		<guid isPermaLink="false">http://semanticvoid.com/blog/2006/05/14/google-goes-co-operative-finally/</guid>
		<description><![CDATA[Something which was bound to happen. After all even Google has to rely on human intelligence to do some of its work. Google Coop helps people contribute their expertise (in other terms bookmark links) by adding labels (categories) and annotating (description) them. But it goes way beyond the del.icio.us mode by using this information in [...]]]></description>
			<content:encoded><![CDATA[<p>Something which was bound to happen. After all even Google has to rely on human intelligence to do some of its work. <a target="_blank" title="Google Coop" href="http://www.google.com/coop">Google Coop</a> helps people contribute their expertise (in other terms bookmark links) by adding labels (categories) and annotating (description) them. But it goes way beyond the del.icio.us mode by using this information in improvising search.</p>
<p>In all the forums I visited I came across this one point again and again: &#8220;<em>Google Coop is susceptible to spammers</em>&#8220;. I don&#8217;t agree. Like any other social app which is fuelled by the people, at the first glance it does seem susceptible. But it is the social factor that seems to ward of spammers. Quoting a FAQ from Google Coop:</p>
<blockquote><p><strong> Who will see my labels?</strong></p>
<p>Users who subscribe to you will see your labels for relevant searches. As your labels become higher quality and more comprehensive, and as more users subscribe to you, your labels may start surfacing to more Google users than just those who explicitly subscribed. A number of factors help determine how broadly your labels appear &#8212; such as the number of subscribers you have, how many websites you&#8217;ve labeled, and, most importantly, how often your labels make it easier for users to find what they&#8217;re looking for.</p></blockquote>
<p>Google seems to have realised that it can achieve a lot more by utilizing the human intelligence, <em>Intelligence </em>which is the core of the Web 2.0. After all the ubiquitous search is also based on human knowledge (creation of links between pages).</p>
]]></content:encoded>
			<wfw:commentRss>http://semanticvoid.com/blog/2006/05/14/google-goes-co-operative-finally/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Tag Cloud Font Distribution Algorithm</title>
		<link>http://semanticvoid.com/blog/2006/01/06/tag-cloud-font-distribution-algorithm/</link>
		<comments>http://semanticvoid.com/blog/2006/01/06/tag-cloud-font-distribution-algorithm/#comments</comments>
		<pubDate>Fri, 06 Jan 2006 18:19:48 +0000</pubDate>
		<dc:creator>Anand Kishore</dc:creator>
				<category><![CDATA[Tagging]]></category>

		<guid isPermaLink="false">http://semanticvoid.com/blog/2006/01/06/tag-cloud-font-distribution-algorithm/</guid>
		<description><![CDATA[Ever thought of how the collection of tags with varying fontsizes (known as Tag Cloud) populated. As I say &#8216;theres an algorithm for everything&#8217;, theres an algorithm for this too. Assuming you know all about tag popularity (if not refer previous post) I&#8217;ll go ahead explaining it. The distinct feature of tag clouds are the [...]]]></description>
			<content:encoded><![CDATA[<p>Ever thought of how the collection of tags with varying fontsizes (known as Tag Cloud) populated. As I say &#8216;theres an algorithm for everything&#8217;, theres an algorithm for this too. Assuming you know all about tag popularity (if not refer <a title="Calculation Of Tag Popularity" href="http://semanticvoid.com/blog/2006/01/02/calculation-of-tag-popularity/">previous post</a>) I&#8217;ll go ahead explaining it.</p>
<p>The distinct feature of tag clouds are the different groups of font sizes. Now the number of such groups desired depends entirely upon the developer. Usually having six such size-groups proves optimal.</p>
<p>Assume any suitable metric for measuring popularity (for instance &#8216;number of users using the tag&#8217;). We can always obtain the max and min numbers for the same. For example:</p>
<p>max(Popularity) = 130<br />
min(Popularity) = 35</p>
<p>Therefore we can define one block of font-size as :<br />
( max(Popularity) &#8211; min(Popularity) ) / 6</p>
<p>For the above values we get one such block range as (130 &#8211; 35) / 6 = 15.83 ~ 16<br />
Font-sizes therefore could be bound as follows:</p>
<p>Range        Font-Size<br />
35 to 51           1<br />
52 to 68           2<br />
69 to 85           3<br />
86 to 102         4<br />
103 to 119       5<br />
120 to 136       6</p>
<p>Thats as easy as it can get.</p>
]]></content:encoded>
			<wfw:commentRss>http://semanticvoid.com/blog/2006/01/06/tag-cloud-font-distribution-algorithm/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Calculation Of Tag Popularity</title>
		<link>http://semanticvoid.com/blog/2006/01/02/calculation-of-tag-popularity/</link>
		<comments>http://semanticvoid.com/blog/2006/01/02/calculation-of-tag-popularity/#comments</comments>
		<pubDate>Mon, 02 Jan 2006 19:38:13 +0000</pubDate>
		<dc:creator>Anand Kishore</dc:creator>
				<category><![CDATA[Tagging]]></category>

		<guid isPermaLink="false">http://semanticvoid.com/blog/2006/01/02/calculation-of-tag-popularity/</guid>
		<description><![CDATA[Determinig the popularity of tags has very fluid solutions which keep changing from application to application. But in general one metric that can be used is the number of unique items tagged using the particular tag. Secondly another metric that is the number of unique users using this tag could also be used. I&#8217;ve come [...]]]></description>
			<content:encoded><![CDATA[<p>Determinig the popularity of tags has very fluid solutions which keep changing from application to application. But in general one metric that can be used is the number of unique items tagged using the particular tag. Secondly another metric that is the number of unique users using this tag could also be used. I&#8217;ve come up with a formula that encompasses both of these:</p>
<p><span style="font-weight: bold">( Usage Count / Number of tagged Items ) * ( User Count / Number of Taggers )</span></p>
<p>where,<br />
Usage Count (UsgCnt) : the number of unique items having the tag.<br />
Number of tagged Items (NTI) : the total number of items having atleast one tag (i.e. items participating in tagging)<br />
User Count (UsrCnt) : the number of users using this tag.<br />
Number of Taggers (NOT) : the total number of users participating in tagging.</p>
<p><span style="font-weight: bold">Case 1:</span><br />
UsgCnt = 15, NTI = 40, UsrCnt = 2, NOT = 20<br />
Popularity = 0.0375<br />
Note: This represents a case in which the two users may be trying to spam the system by tagging many items by the specific tag.</p>
<p><span style="font-weight: bold">Case 2:</span><br />
UsgCnt = 15, NTI = 40, UsrCnt = 9, NOT = 20<br />
Popularity = 0.1685<br />
Note: Here we clearly see that as the number of users using this tag increases the popularity increases as well (suggesting no spam but folksonomy).</p>
<p><span style="font-weight: bold">Case 3:</span><br />
UsgCnt = 15, NTI = 40, UsrCnt = 1, NOT = 1<br />
Popularity = 0.375<br />
Note: Here it can be noted that if there is only one user in the system the popularity becomes independent of the user ratio and depends entirely on the tagged items ratio.</p>
<p><span style="font-weight: bold">Case 4:</span><br />
UsgCnt = 40, NTI = 40, UsrCnt = 10, NOT = 20<br />
Popularity = 0.5<br />
Note: In this case if all the messages in the system are tagged using the specific tag (UsgCnt = NTI ) the popularity depends entirely on the number of users using this tag.</p>
<p>This gives a fairly rough idea of tag popularity calculation.</p>
]]></content:encoded>
			<wfw:commentRss>http://semanticvoid.com/blog/2006/01/02/calculation-of-tag-popularity/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
	</channel>
</rss>

<!-- Dynamic Page Served (once) in 0.629 seconds -->

