<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>nandeshwar.info &#187; visualization</title>
	<atom:link href="http://nandeshwar.info/tag/visualization/feed/" rel="self" type="application/rss+xml" />
	<link>http://nandeshwar.info</link>
	<description></description>
	<lastBuildDate>Wed, 18 Aug 2010 19:24:47 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Mining publication data</title>
		<link>http://nandeshwar.info/2010/03/08/mining-publication-data/</link>
		<comments>http://nandeshwar.info/2010/03/08/mining-publication-data/#comments</comments>
		<pubDate>Mon, 08 Mar 2010 15:57:24 +0000</pubDate>
		<dc:creator>a7n9</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[scripting]]></category>
		<category><![CDATA[text mining]]></category>
		<category><![CDATA[visualization]]></category>

		<guid isPermaLink="false">http://nandeshwar.info/?p=315</guid>
		<description><![CDATA[I found treasure! Publication and citation data with metadata (author names, addresses, affiliation): http://citeseer.ist.psu.edu/oai.html 
I was reading about knowledge management here, which says that knowledge management is nonsense. I agree to a certain degree, not because of the field, but because of its name. How do you manage knowledge? Isn&#8217;t knowledge derived? Wasn&#8217;t information &#8220;science&#8221; [...]]]></description>
			<content:encoded><![CDATA[<p>I found treasure! Publication and citation data with metadata (author names, addresses, affiliation): <a href="http://citeseer.ist.psu.edu/oai.html">http://citeseer.ist.psu.edu/oai.html </a></p>
<p>I was reading about knowledge management <a href="http://informationr.net/ir/8-1/paper144.html?referer=www.clickfind.com.au">here</a>, which says that knowledge management is nonsense. I agree to a certain degree, not because of the field, but because of its name. How do you manage knowledge? Isn&#8217;t knowledge derived? Wasn&#8217;t information &#8220;science&#8221; good enough? (I have problem with &#8220;business intelligence&#8221; as well&#8230;) As the author of that article says, it is a new term coined to attract attention. He does provide some evidence, but I was left unsatisfied.</p>
<p>I thought of performing text mining on publications database, and citeseer has this great resource. I downloaded the data (72 XML files), performed some clean-up, and ran a script to pull citeseer ID, author addresses, and publication dates where the abstract contained the term &#8220;knowledge management&#8221;. I was interested in seeing the trend of publication and places of publication.</p>
<p style="text-align: center;">Have a look at this chart:<br />
<a href="http://nandeshwar.info/wp-content/uploads/2010/03/NoPubsbyYear.JPG"><img class="aligncenter size-full wp-image-317" title="Publications by year" src="http://nandeshwar.info/wp-content/uploads/2010/03/NoPubsbyYear.JPG" alt="Publications by year" width="573" height="587" /></a></p>
<p>There is a definite growth in this area, at least in research and publications. It is startling to see a paper published in 1970, and a peak in 2002. As citeseer data ends in 2004, it is possible that it doesn&#8217;t have complete publication history of 2004.</p>
<p>Geographic location wise, the US and Europe leads the way in number of publications:<br />
<a href="http://nandeshwar.info/wp-content/uploads/2010/03/WorldMapPubs.JPG"><img class="aligncenter size-medium wp-image-318" title="Worldwide Publications " src="http://nandeshwar.info/wp-content/uploads/2010/03/WorldMapPubs-300x168.jpg" alt="Worldwide Publications " width="300" height="168" /></a></p>
<div class='bookmarkify'><a name='bookmarkify'></a><div class='title' title='Use these links to share this page with others'>Share</div><div class='linkbuttons'><a href='http://www.citeulike.org/posturl?url=http://nandeshwar.info/2010/03/08/mining-publication-data/&amp;title=Mining publication data' title='Save to CiteULike' onclick='target="_blank";' rel='nofollow'><img src='http://nandeshwar.info/wp-content/plugins/bookmarkify/citeulike.png' style='width:16px; height:16px;' alt='[CiteULike] ' /></a> <a href='http://del.icio.us/post?url=http://nandeshwar.info/2010/03/08/mining-publication-data/&amp;title=Mining publication data' title='Save to del.icio.us' onclick='target="_blank";' rel='nofollow'><img src='http://nandeshwar.info/wp-content/plugins/bookmarkify/delicious.png' style='width:16px; height:16px;' alt='[del.icio.us] ' /></a> <a href='http://digg.com/submit?phase=2&amp;url=http://nandeshwar.info/2010/03/08/mining-publication-data/&amp;title=Mining publication data' title='Digg It!' onclick='target="_blank";' rel='nofollow'><img src='http://nandeshwar.info/wp-content/plugins/bookmarkify/digg.png' style='width:16px; height:16px;' alt='[Digg] ' /></a> <a href='http://www.facebook.com/share.php?u=http://nandeshwar.info/2010/03/08/mining-publication-data/' title='Save to Facebook' onclick='target="_blank";' rel='nofollow'><img src='http://nandeshwar.info/wp-content/plugins/bookmarkify/facebook.png' style='width:16px; height:16px;' alt='[Facebook] ' /></a> <a href='http://www.furl.net/storeIt.jsp?u=http://nandeshwar.info/2010/03/08/mining-publication-data/&amp;t=Mining publication data' title='Save to Furl' onclick='target="_blank";' rel='nofollow'><img src='http://nandeshwar.info/wp-content/plugins/bookmarkify/furl.png' style='width:16px; height:16px;' alt='[Furl] ' /></a> <a href='http://www.google.com/bookmarks/mark?op=edit&amp;output=popup&amp;bkmk=http://nandeshwar.info/2010/03/08/mining-publication-data/&amp;title=Mining publication data' title='Save to Google Bookmarks' onclick='target="_blank";' rel='nofollow'><img src='http://nandeshwar.info/wp-content/plugins/bookmarkify/google.png' style='width:16px; height:16px;' alt='[Google] ' /></a> <a href='http://reddit.com/submit?url=http://nandeshwar.info/2010/03/08/mining-publication-data/&amp;title=Mining publication data' title='Reddit' onclick='target="_blank";' rel='nofollow'><img src='http://nandeshwar.info/wp-content/plugins/bookmarkify/reddit.png' style='width:16px; height:16px;' alt='[Reddit] ' /></a> <a href='http://slashdot.org/bookmark.pl?url=http://nandeshwar.info/2010/03/08/mining-publication-data/&amp;title=Mining publication data' title='Slashdot It!' onclick='target="_blank";' rel='nofollow'><img src='http://nandeshwar.info/wp-content/plugins/bookmarkify/slashdot.png' style='width:16px; height:16px;' alt='[Slashdot] ' /></a> <a href='http://www.stumbleupon.com/submit?url=http://nandeshwar.info/2010/03/08/mining-publication-data/&amp;title=Mining publication data' title='Stumble It!' onclick='target="_blank";' rel='nofollow'><img src='http://nandeshwar.info/wp-content/plugins/bookmarkify/stumbleupon.png' style='width:16px; height:16px;' alt='[StumbleUpon] ' /></a> <a href='http://technorati.com/faves?add=http://nandeshwar.info/2010/03/08/mining-publication-data/' title='Add to my Technorati Favorites' onclick='target="_blank";' rel='nofollow'><img src='http://nandeshwar.info/wp-content/plugins/bookmarkify/technorati.png' style='width:16px; height:16px;' alt='[Technorati] ' /></a> <a href='http://twitter.com/home/?status=Mining publication data+http://nandeshwar.info/2010/03/08/mining-publication-data/' title='Save to Twitter' onclick='target="_blank";' rel='nofollow'><img src='http://nandeshwar.info/wp-content/plugins/bookmarkify/twitter.png' style='width:16px; height:16px;' alt='[Twitter] ' /></a> <a href='http://www.feedburner.com/fb/a/emailFlare?itemTitle=Mining publication data&amp;uri=http://nandeshwar.info/2010/03/08/mining-publication-data/&amp;loc=en_US' title='Email this to a friend' onclick='target="_blank";' rel='nofollow'><img src='http://nandeshwar.info/wp-content/plugins/bookmarkify/email.png' style='width:16px; height:16px;' alt='[Email] ' /></a>  <a title='See more bookmark and sharing options...' href='http://nandeshwar.info/2010/03/08/mining-publication-data/#bookmarkify' rel='nofollow'><small>More&nbsp;&raquo;</small></a></div><div class='brand'><small><a href='http://www.bookmarkify.com/'>Powered by Bookmarkify&trade;</a></small></div></div>]]></content:encoded>
			<wfw:commentRss>http://nandeshwar.info/2010/03/08/mining-publication-data/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>My experiments with sparklines</title>
		<link>http://nandeshwar.info/2009/09/23/my-experiments-with-sparklines/</link>
		<comments>http://nandeshwar.info/2009/09/23/my-experiments-with-sparklines/#comments</comments>
		<pubDate>Wed, 23 Sep 2009 18:05:19 +0000</pubDate>
		<dc:creator>a7n9</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[Access]]></category>
		<category><![CDATA[dashboard]]></category>
		<category><![CDATA[excel]]></category>
		<category><![CDATA[LaTex]]></category>
		<category><![CDATA[R]]></category>
		<category><![CDATA[report]]></category>
		<category><![CDATA[sparklines]]></category>
		<category><![CDATA[VBA]]></category>
		<category><![CDATA[visualization]]></category>

		<guid isPermaLink="false">http://nandeshwar.info/?p=293</guid>
		<description><![CDATA[For the uninitiated, Sparklines are &#8220;data-intense, design-simple, word-sized graphics&#8221; according to its inventor Edward Tufte. I always wanted to include them in my trend reports. The challenge: How and Which tool to use?
The data came from this report came from a complex query using SQL server and Access that had this format:



College
Major
2005
2006
2007
2008
2009


A
B
10
15
14
18
25



Some options I investigated [...]]]></description>
			<content:encoded><![CDATA[<p>For the uninitiated, <a title="Sparkline" href="http://en.wikipedia.org/wiki/Sparkline">Sparklines</a> are &#8220;data-intense, design-simple, word-sized graphics&#8221; according to its inventor <a title="Edward Tufte" href="http://en.wikipedia.org/wiki/Edward_Tufte">Edward Tufte</a>. I always wanted to include them in my trend reports. The challenge: How and Which tool to use?</p>
<p>The data came from this report came from a complex query using SQL server and Access that had this format:</p>
<table border="0">
<tbody>
<tr>
<td>College</td>
<td>Major</td>
<td>2005</td>
<td>2006</td>
<td>2007</td>
<td>2008</td>
<td>2009</td>
</tr>
<tr>
<td>A</td>
<td>B</td>
<td>10</td>
<td>15</td>
<td>14</td>
<td>18</td>
<td>25</td>
</tr>
</tbody>
</table>
<p>Some options I investigated and tried (be sure to read <a href="http://www.edwardtufte.com/bboard/q-and-a-fetch-msg?msg_id=0000Lk">this page</a>, you might find many more):</p>
<ul>
<li>Google&#8217;s Chart API (<a href="http://code.google.com/apis/chart/types.html">http://code.google.com/apis/chart/types.html</a>): you can embed these in Google spreadsheets,  create HTML pages using <a href="http://www.cazh1.com/blogger/thoughts/2009/01/hacking-google-chart-api-from-excel.shtml">Excel via VBA</a>, or embed them in<a href="http://www.tushar-mehta.com/publish_train/xl_vba_cases/excel_google_chart_api/index.htm"> Excel sheets</a>. Alas, none would work the way I would like them to work: In-cell graphics</li>
<li> <a href="http://sparklines-excel.blogspot.com/">Use Sparklines for Excel add-in</a>: this add-in will create great in-cell charts (bar, bullet graphs, sparklines, etc), but copying them down is difficult and resource intensive, and any change you make in the column size will alter the shape of that chart. In addition, I had more than 400 rows to populate&#8211;these would be too many objects in a spreadsheet for Excel to handle. My machine froze when I tried to copy it for 10 rows.</li>
<li>Use R&#8217;s implementation by Jason Dieterle (search on <a href="http://www.edwardtufte.com/bboard/q-and-a-fetch-msg?msg_id=0000Lk">this page</a> for Jason Dieterle (email), January 28, 2008. Works very good. I modified the code to print max and mins only, but the function generates a graphic, which needed to be embeded in LaTex file. It did not work nicely. Charts were too big to fit in a cell of a table.</li>
<li>Create bar charts in <a href="http://www.juiceanalytics.com/writing/lightweight-data-exploration-in-excel/">Excel using REPT function</a>. I tried different font sizes and styles, but it didn&#8217;t look pretty, for the range of data varied. (Hint: make the alignment of text 90, use pipe signs, create columns and graphs for each data value, remove gridlines, keep the columns very close. It almost worked.)</li>
<li>Use <a href="http://jblevins.org/projects/spark/">spark</a> package for LaTex, doesn&#8217;t work in pdflatex, and you have to play a lot with the settings</li>
<li>Use <a href="http://www.ctan.org/tex-archive/help/Catalogue/entries/sparklines.html">sparklines </a>package for LaTex. Problem was that data needed to be normalized (or scaled) from 0 to 1, and needed extra parameters for min and max points. Solution: create a normalize function in Excel, and write a big formula to produce the exact needed string for the sparklines to work i.e.:<br />
<code>\begin{sparkline}{5}<br />
\sparkdot 1 1 blue<br />
\sparkdot 0.2 0 red<br />
\spark 0.2 0 0.4 0.0625 0.6 0.5625 0.8 0.75 1 1 /<br />
\end{sparkline}</code></li>
</ul>
<p>Here&#8217;s the normalize function:</p>

<div class="wp_syntax"><div class="code"><pre class="vb" style="font-family:monospace;"><span style="color: #000080;">Public</span> <span style="color: #000080;">Function</span> Normalize(cell2Normalize <span style="color: #000080;">As</span> Range, WholeRng <span style="color: #000080;">As</span> Range)
       Normalize = (cell2Normalize.Value - WorksheetFunction.Min(WholeRng)) / (WorksheetFunction.Max(WholeRng) - WorksheetFunction.Min(WholeRng))
<span style="color: #000080;">End</span> <span style="color: #000080;">Function</span></pre></div></div>

<p>Here&#8217;s the big formula to convert the range for data values from 2005 to 2009 (in the range E2:I2) to the sparkline LaTex environment.<br />
<code><br />
="\begin{sparkline}{5} " &amp; "\sparkdot " &amp; CHOOSE(MATCH(MAX(E2:I2),E2:I2,0),0.2,0.4,0.6,0.8,1) &amp; " " &amp; 1 &amp; " blue " &amp; "\sparkdot " &amp; CHOOSE(MATCH(MIN(E2:I2),E2:I2,0),0.2,0.4,0.6,0.8,1) &amp; " " &amp; 0 &amp; " red " &amp; " \spark 0.2 " &amp; Normalize(E2,E2:I2) &amp; " 0.4 " &amp; Normalize(F2,E2:I2) &amp; " 0.6 " &amp; Normalize(G2,E2:I2) &amp; " 0.8 " &amp; Normalize(H2,E2:I2) &amp; " 1 " &amp; Normalize(I2,E2:I2) &amp; " / \end{sparkline}"</code></p>
<p>This last option worked beautifully. I dragged the formula down. I selected the data, and clicked on &#8220;Convert Table to LaTex&#8221; button (using this<a href="http://dataninja.wordpress.com/2006/01/20/excel-to-latex/"> add-in</a>). Copied the LaTex code to clipboard and pasted it in my LaTex editor. Manually merged the rows for colleges (using \multirow), and generated a beautiful looking pdf.</p>
<p>I was very happy. Printed it in color. Got a request back very soon that there should be total rows.</p>
<p>I forgot about the sparklines and created a report in Access with plain old numbers in less than 15 mins. Gave it back.</p>
<p>I tried to repeat this &#8211; <a href="http://www.youtube.com/watch?v=EK73YYo0CUk">Sparkline in Cognos</a> -in Access with no luck.</p>
<p>(I later tried it one more time: Got the data in Excel using External data&gt; Access, created a pivottable with rows and all, did some formatting, copied and pasted values and formats, inserted sparkline code, converted it to LaTex, copied and pasted in LaTex editor, and here&#8217;s the beautiful looking <a href="http://nandeshwar.info/wp-content/uploads/2009/09/CollegeEnrSample.pdf">sample pdf</a> of 16)</p>
<p>I wish there were simple reporting solutions that included awesome data visualization tools. (BTW, Excel 2010 will have sparklines: <a href="http://blogs.msdn.com/excel/archive/2009/07/17/sparklines-in-excel.aspx">link</a>) For this report, I did try Sweave, R, and LaTex, but because of the time constraints I could not investigate it further.</p>
<p>Please comment if you know any other way which meet (or don&#8217;t, Tableau is certainly one) these conditions: inexpensive (read free), efficient,  and repeatable.</p>
<div class='bookmarkify'><a name='bookmarkify'></a><div class='title' title='Use these links to share this page with others'>Share</div><div class='linkbuttons'><a href='http://www.citeulike.org/posturl?url=http://nandeshwar.info/2009/09/23/my-experiments-with-sparklines/&amp;title=My experiments with sparklines' title='Save to CiteULike' onclick='target="_blank";' rel='nofollow'><img src='http://nandeshwar.info/wp-content/plugins/bookmarkify/citeulike.png' style='width:16px; height:16px;' alt='[CiteULike] ' /></a> <a href='http://del.icio.us/post?url=http://nandeshwar.info/2009/09/23/my-experiments-with-sparklines/&amp;title=My experiments with sparklines' title='Save to del.icio.us' onclick='target="_blank";' rel='nofollow'><img src='http://nandeshwar.info/wp-content/plugins/bookmarkify/delicious.png' style='width:16px; height:16px;' alt='[del.icio.us] ' /></a> <a href='http://digg.com/submit?phase=2&amp;url=http://nandeshwar.info/2009/09/23/my-experiments-with-sparklines/&amp;title=My experiments with sparklines' title='Digg It!' onclick='target="_blank";' rel='nofollow'><img src='http://nandeshwar.info/wp-content/plugins/bookmarkify/digg.png' style='width:16px; height:16px;' alt='[Digg] ' /></a> <a href='http://www.facebook.com/share.php?u=http://nandeshwar.info/2009/09/23/my-experiments-with-sparklines/' title='Save to Facebook' onclick='target="_blank";' rel='nofollow'><img src='http://nandeshwar.info/wp-content/plugins/bookmarkify/facebook.png' style='width:16px; height:16px;' alt='[Facebook] ' /></a> <a href='http://www.furl.net/storeIt.jsp?u=http://nandeshwar.info/2009/09/23/my-experiments-with-sparklines/&amp;t=My experiments with sparklines' title='Save to Furl' onclick='target="_blank";' rel='nofollow'><img src='http://nandeshwar.info/wp-content/plugins/bookmarkify/furl.png' style='width:16px; height:16px;' alt='[Furl] ' /></a> <a href='http://www.google.com/bookmarks/mark?op=edit&amp;output=popup&amp;bkmk=http://nandeshwar.info/2009/09/23/my-experiments-with-sparklines/&amp;title=My experiments with sparklines' title='Save to Google Bookmarks' onclick='target="_blank";' rel='nofollow'><img src='http://nandeshwar.info/wp-content/plugins/bookmarkify/google.png' style='width:16px; height:16px;' alt='[Google] ' /></a> <a href='http://reddit.com/submit?url=http://nandeshwar.info/2009/09/23/my-experiments-with-sparklines/&amp;title=My experiments with sparklines' title='Reddit' onclick='target="_blank";' rel='nofollow'><img src='http://nandeshwar.info/wp-content/plugins/bookmarkify/reddit.png' style='width:16px; height:16px;' alt='[Reddit] ' /></a> <a href='http://slashdot.org/bookmark.pl?url=http://nandeshwar.info/2009/09/23/my-experiments-with-sparklines/&amp;title=My experiments with sparklines' title='Slashdot It!' onclick='target="_blank";' rel='nofollow'><img src='http://nandeshwar.info/wp-content/plugins/bookmarkify/slashdot.png' style='width:16px; height:16px;' alt='[Slashdot] ' /></a> <a href='http://www.stumbleupon.com/submit?url=http://nandeshwar.info/2009/09/23/my-experiments-with-sparklines/&amp;title=My experiments with sparklines' title='Stumble It!' onclick='target="_blank";' rel='nofollow'><img src='http://nandeshwar.info/wp-content/plugins/bookmarkify/stumbleupon.png' style='width:16px; height:16px;' alt='[StumbleUpon] ' /></a> <a href='http://technorati.com/faves?add=http://nandeshwar.info/2009/09/23/my-experiments-with-sparklines/' title='Add to my Technorati Favorites' onclick='target="_blank";' rel='nofollow'><img src='http://nandeshwar.info/wp-content/plugins/bookmarkify/technorati.png' style='width:16px; height:16px;' alt='[Technorati] ' /></a> <a href='http://twitter.com/home/?status=My experiments with sparklines+http://nandeshwar.info/2009/09/23/my-experiments-with-sparklines/' title='Save to Twitter' onclick='target="_blank";' rel='nofollow'><img src='http://nandeshwar.info/wp-content/plugins/bookmarkify/twitter.png' style='width:16px; height:16px;' alt='[Twitter] ' /></a> <a href='http://www.feedburner.com/fb/a/emailFlare?itemTitle=My experiments with sparklines&amp;uri=http://nandeshwar.info/2009/09/23/my-experiments-with-sparklines/&amp;loc=en_US' title='Email this to a friend' onclick='target="_blank";' rel='nofollow'><img src='http://nandeshwar.info/wp-content/plugins/bookmarkify/email.png' style='width:16px; height:16px;' alt='[Email] ' /></a>  <a title='See more bookmark and sharing options...' href='http://nandeshwar.info/2009/09/23/my-experiments-with-sparklines/#bookmarkify' rel='nofollow'><small>More&nbsp;&raquo;</small></a></div><div class='brand'><small><a href='http://www.bookmarkify.com/'>Powered by Bookmarkify&trade;</a></small></div></div>]]></content:encoded>
			<wfw:commentRss>http://nandeshwar.info/2009/09/23/my-experiments-with-sparklines/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Tag Cloud of Data Mining Jobs</title>
		<link>http://nandeshwar.info/2009/08/20/tag-cloud-of-data-mining-jobs/</link>
		<comments>http://nandeshwar.info/2009/08/20/tag-cloud-of-data-mining-jobs/#comments</comments>
		<pubDate>Thu, 20 Aug 2009 14:13:48 +0000</pubDate>
		<dc:creator>a7n9</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[data mining]]></category>
		<category><![CDATA[pipes]]></category>
		<category><![CDATA[python]]></category>
		<category><![CDATA[stemming]]></category>
		<category><![CDATA[tag cloud]]></category>
		<category><![CDATA[text mining]]></category>
		<category><![CDATA[visualization]]></category>

		<guid isPermaLink="false">http://nandeshwar.info/?p=260</guid>
		<description><![CDATA[Here&#8217;s what I did to get a cool looking tag cloud of data mining jobs:

Used Yahoo Pipes (I created mine, but this one has more feeds)&#8211; this pipe aggregates feeds from different job web-sites, and gives the user unique job listing that you can subscribe via RSS:  Job Feed Aggregator by Sean Dolan 
Subscribed [...]]]></description>
			<content:encoded><![CDATA[<p>Here&#8217;s what I did to get a cool looking tag cloud of data mining jobs:</p>
<ol>
<li>Used Yahoo Pipes (I created mine, but this one has more feeds)&#8211; this pipe aggregates feeds from different job web-sites, and gives the user unique job listing that you can subscribe via RSS:  <a href="http://pipes.yahoo.com/pipes/pipe.info?_id=50bf0b7cbcf40213deb98f1314dedf51">Job Feed Aggregator by Sean Dolan </a></li>
<li>Subscribed to the RSS feed for the keyword &#8220;data mining&#8221;</li>
<li>Copied the job descriptions and requirements of many jobs, and saved the text file</li>
<li>Got the <a href="http://tartarus.org/~martin/PorterStemmer/index-old.html">python stemmer </a></li>
<li>Applied the python stemmer to the text file. Stemmer truncates words to their roots, so that we can combine variants of a word into a single word. (First or second step in text mining)</li>
<li>Created a tag cloud using the services of <a href="http://www.wordle.net/">http://www.wordle.net/</a> . They use &#8220;stop words,&#8221; so I didn&#8217;t have to apply those. Stop words are common words, which necessarily don&#8217;t add any value for categorization, of a language.</li>
</ol>
<div id="attachment_261" class="wp-caption aligncenter" style="width: 591px"><a href="http://nandeshwar.info/wp-content/uploads/2009/08/dmjobstagcloud.jpg"><img class="size-full wp-image-261 " title="Data Mining Jobs Tag Cloud" src="http://nandeshwar.info/wp-content/uploads/2009/08/dmjobstagcloud.jpg" alt="Data Mining Jobs Tag Cloud" width="581" height="249" /></a><p class="wp-caption-text">Data Mining Jobs Tag Cloud</p></div>
<p><!--adsensestart--><br />
The most frequent word is: experience. Companies want people with experience in different data mining techniques. You&#8217;ll see that some other big words are: SAS (stemmed as sa), Excel, SQL, analytical skills, statistics, and quantitative skills.</p>
<p>And how do you master these skills, you ask?</p>
<ol>
<li>Get a graduate degree in statistics, economics, mathematics, computer science, financial engineering, or industrial engineering with emphasis on databases, data mining, and marketing.</li>
<li>Successfully complete data mining projects using free, open-source data mining tools, such as Weka, R, Orange, Rapid-Miner.</li>
<li>Participate in data mining competitions. SAS&#8217;s data mining conference has a data mining competition every year.</li>
</ol>
<p>Have a look at a detailed study by Pejic Bach, M: Creating profile of data mining specialist</p>
<div class='bookmarkify'><a name='bookmarkify'></a><div class='title' title='Use these links to share this page with others'>Share</div><div class='linkbuttons'><a href='http://www.citeulike.org/posturl?url=http://nandeshwar.info/2009/08/20/tag-cloud-of-data-mining-jobs/&amp;title=Tag Cloud of Data Mining Jobs' title='Save to CiteULike' onclick='target="_blank";' rel='nofollow'><img src='http://nandeshwar.info/wp-content/plugins/bookmarkify/citeulike.png' style='width:16px; height:16px;' alt='[CiteULike] ' /></a> <a href='http://del.icio.us/post?url=http://nandeshwar.info/2009/08/20/tag-cloud-of-data-mining-jobs/&amp;title=Tag Cloud of Data Mining Jobs' title='Save to del.icio.us' onclick='target="_blank";' rel='nofollow'><img src='http://nandeshwar.info/wp-content/plugins/bookmarkify/delicious.png' style='width:16px; height:16px;' alt='[del.icio.us] ' /></a> <a href='http://digg.com/submit?phase=2&amp;url=http://nandeshwar.info/2009/08/20/tag-cloud-of-data-mining-jobs/&amp;title=Tag Cloud of Data Mining Jobs' title='Digg It!' onclick='target="_blank";' rel='nofollow'><img src='http://nandeshwar.info/wp-content/plugins/bookmarkify/digg.png' style='width:16px; height:16px;' alt='[Digg] ' /></a> <a href='http://www.facebook.com/share.php?u=http://nandeshwar.info/2009/08/20/tag-cloud-of-data-mining-jobs/' title='Save to Facebook' onclick='target="_blank";' rel='nofollow'><img src='http://nandeshwar.info/wp-content/plugins/bookmarkify/facebook.png' style='width:16px; height:16px;' alt='[Facebook] ' /></a> <a href='http://www.furl.net/storeIt.jsp?u=http://nandeshwar.info/2009/08/20/tag-cloud-of-data-mining-jobs/&amp;t=Tag Cloud of Data Mining Jobs' title='Save to Furl' onclick='target="_blank";' rel='nofollow'><img src='http://nandeshwar.info/wp-content/plugins/bookmarkify/furl.png' style='width:16px; height:16px;' alt='[Furl] ' /></a> <a href='http://www.google.com/bookmarks/mark?op=edit&amp;output=popup&amp;bkmk=http://nandeshwar.info/2009/08/20/tag-cloud-of-data-mining-jobs/&amp;title=Tag Cloud of Data Mining Jobs' title='Save to Google Bookmarks' onclick='target="_blank";' rel='nofollow'><img src='http://nandeshwar.info/wp-content/plugins/bookmarkify/google.png' style='width:16px; height:16px;' alt='[Google] ' /></a> <a href='http://reddit.com/submit?url=http://nandeshwar.info/2009/08/20/tag-cloud-of-data-mining-jobs/&amp;title=Tag Cloud of Data Mining Jobs' title='Reddit' onclick='target="_blank";' rel='nofollow'><img src='http://nandeshwar.info/wp-content/plugins/bookmarkify/reddit.png' style='width:16px; height:16px;' alt='[Reddit] ' /></a> <a href='http://slashdot.org/bookmark.pl?url=http://nandeshwar.info/2009/08/20/tag-cloud-of-data-mining-jobs/&amp;title=Tag Cloud of Data Mining Jobs' title='Slashdot It!' onclick='target="_blank";' rel='nofollow'><img src='http://nandeshwar.info/wp-content/plugins/bookmarkify/slashdot.png' style='width:16px; height:16px;' alt='[Slashdot] ' /></a> <a href='http://www.stumbleupon.com/submit?url=http://nandeshwar.info/2009/08/20/tag-cloud-of-data-mining-jobs/&amp;title=Tag Cloud of Data Mining Jobs' title='Stumble It!' onclick='target="_blank";' rel='nofollow'><img src='http://nandeshwar.info/wp-content/plugins/bookmarkify/stumbleupon.png' style='width:16px; height:16px;' alt='[StumbleUpon] ' /></a> <a href='http://technorati.com/faves?add=http://nandeshwar.info/2009/08/20/tag-cloud-of-data-mining-jobs/' title='Add to my Technorati Favorites' onclick='target="_blank";' rel='nofollow'><img src='http://nandeshwar.info/wp-content/plugins/bookmarkify/technorati.png' style='width:16px; height:16px;' alt='[Technorati] ' /></a> <a href='http://twitter.com/home/?status=Tag Cloud of Data Mining Jobs+http://nandeshwar.info/2009/08/20/tag-cloud-of-data-mining-jobs/' title='Save to Twitter' onclick='target="_blank";' rel='nofollow'><img src='http://nandeshwar.info/wp-content/plugins/bookmarkify/twitter.png' style='width:16px; height:16px;' alt='[Twitter] ' /></a> <a href='http://www.feedburner.com/fb/a/emailFlare?itemTitle=Tag Cloud of Data Mining Jobs&amp;uri=http://nandeshwar.info/2009/08/20/tag-cloud-of-data-mining-jobs/&amp;loc=en_US' title='Email this to a friend' onclick='target="_blank";' rel='nofollow'><img src='http://nandeshwar.info/wp-content/plugins/bookmarkify/email.png' style='width:16px; height:16px;' alt='[Email] ' /></a>  <a title='See more bookmark and sharing options...' href='http://nandeshwar.info/2009/08/20/tag-cloud-of-data-mining-jobs/#bookmarkify' rel='nofollow'><small>More&nbsp;&raquo;</small></a></div><div class='brand'><small><a href='http://www.bookmarkify.com/'>Powered by Bookmarkify&trade;</a></small></div></div>]]></content:encoded>
			<wfw:commentRss>http://nandeshwar.info/2009/08/20/tag-cloud-of-data-mining-jobs/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Data-Information Visualization</title>
		<link>http://nandeshwar.info/2009/05/21/data-information-visualization/</link>
		<comments>http://nandeshwar.info/2009/05/21/data-information-visualization/#comments</comments>
		<pubDate>Thu, 21 May 2009 12:34:56 +0000</pubDate>
		<dc:creator>a7n9</dc:creator>
				<category><![CDATA[Books]]></category>
		<category><![CDATA[charts]]></category>
		<category><![CDATA[dashboard]]></category>
		<category><![CDATA[design]]></category>
		<category><![CDATA[visualization]]></category>

		<guid isPermaLink="false">http://nandeshwar.info/?p=232</guid>
		<description><![CDATA[Ever since I read the book The Visual Display of Quantitative Information, by Edward R. Tufte, I am captivated by the idea of creating good design while doing data analysis or dashboard building. Although Excel 2007 charts are much nicer than its previous births, I have started disliking Excel charts. I am even developing an [...]]]></description>
			<content:encoded><![CDATA[<p>Ever since I read the book The Visual Display of Quantitative Information, by Edward R. Tufte, I am captivated by the idea of creating good design while doing data analysis or dashboard building. Although Excel 2007 charts are much nicer than its previous births, I have started disliking Excel charts. I am even developing an eye for picking out the bad information pixels. Apart from Tufte&#8217;s books, these books have helped me immensely:</p>
<ul>
<li>The Elements of Graphing Data, by William S. Cleveland</li>
<li>Information Dashboard Design, by Stephen Few</li>
</ul>
<p>Administrators/executives neither have the time nor the patience to understand complicated data mining algorithms and its results, and when they don&#8217;t understand them most probably they will never go in &#8220;production.&#8221; Simple, yet informative, designs and charts have better chances of going in production, which I am sure every data miner longs for.</p>
<p>I found a course web-site on Information Visualization: <a href="http://www.stat.auckland.ac.nz/~ihaka/120/lectures.html">http://www.stat.auckland.ac.nz/~ihaka/120/lectures.html</a></p>
<div class='bookmarkify'><a name='bookmarkify'></a><div class='title' title='Use these links to share this page with others'>Share</div><div class='linkbuttons'><a href='http://www.citeulike.org/posturl?url=http://nandeshwar.info/2009/05/21/data-information-visualization/&amp;title=Data-Information Visualization' title='Save to CiteULike' onclick='target="_blank";' rel='nofollow'><img src='http://nandeshwar.info/wp-content/plugins/bookmarkify/citeulike.png' style='width:16px; height:16px;' alt='[CiteULike] ' /></a> <a href='http://del.icio.us/post?url=http://nandeshwar.info/2009/05/21/data-information-visualization/&amp;title=Data-Information Visualization' title='Save to del.icio.us' onclick='target="_blank";' rel='nofollow'><img src='http://nandeshwar.info/wp-content/plugins/bookmarkify/delicious.png' style='width:16px; height:16px;' alt='[del.icio.us] ' /></a> <a href='http://digg.com/submit?phase=2&amp;url=http://nandeshwar.info/2009/05/21/data-information-visualization/&amp;title=Data-Information Visualization' title='Digg It!' onclick='target="_blank";' rel='nofollow'><img src='http://nandeshwar.info/wp-content/plugins/bookmarkify/digg.png' style='width:16px; height:16px;' alt='[Digg] ' /></a> <a href='http://www.facebook.com/share.php?u=http://nandeshwar.info/2009/05/21/data-information-visualization/' title='Save to Facebook' onclick='target="_blank";' rel='nofollow'><img src='http://nandeshwar.info/wp-content/plugins/bookmarkify/facebook.png' style='width:16px; height:16px;' alt='[Facebook] ' /></a> <a href='http://www.furl.net/storeIt.jsp?u=http://nandeshwar.info/2009/05/21/data-information-visualization/&amp;t=Data-Information Visualization' title='Save to Furl' onclick='target="_blank";' rel='nofollow'><img src='http://nandeshwar.info/wp-content/plugins/bookmarkify/furl.png' style='width:16px; height:16px;' alt='[Furl] ' /></a> <a href='http://www.google.com/bookmarks/mark?op=edit&amp;output=popup&amp;bkmk=http://nandeshwar.info/2009/05/21/data-information-visualization/&amp;title=Data-Information Visualization' title='Save to Google Bookmarks' onclick='target="_blank";' rel='nofollow'><img src='http://nandeshwar.info/wp-content/plugins/bookmarkify/google.png' style='width:16px; height:16px;' alt='[Google] ' /></a> <a href='http://reddit.com/submit?url=http://nandeshwar.info/2009/05/21/data-information-visualization/&amp;title=Data-Information Visualization' title='Reddit' onclick='target="_blank";' rel='nofollow'><img src='http://nandeshwar.info/wp-content/plugins/bookmarkify/reddit.png' style='width:16px; height:16px;' alt='[Reddit] ' /></a> <a href='http://slashdot.org/bookmark.pl?url=http://nandeshwar.info/2009/05/21/data-information-visualization/&amp;title=Data-Information Visualization' title='Slashdot It!' onclick='target="_blank";' rel='nofollow'><img src='http://nandeshwar.info/wp-content/plugins/bookmarkify/slashdot.png' style='width:16px; height:16px;' alt='[Slashdot] ' /></a> <a href='http://www.stumbleupon.com/submit?url=http://nandeshwar.info/2009/05/21/data-information-visualization/&amp;title=Data-Information Visualization' title='Stumble It!' onclick='target="_blank";' rel='nofollow'><img src='http://nandeshwar.info/wp-content/plugins/bookmarkify/stumbleupon.png' style='width:16px; height:16px;' alt='[StumbleUpon] ' /></a> <a href='http://technorati.com/faves?add=http://nandeshwar.info/2009/05/21/data-information-visualization/' title='Add to my Technorati Favorites' onclick='target="_blank";' rel='nofollow'><img src='http://nandeshwar.info/wp-content/plugins/bookmarkify/technorati.png' style='width:16px; height:16px;' alt='[Technorati] ' /></a> <a href='http://twitter.com/home/?status=Data-Information Visualization+http://nandeshwar.info/2009/05/21/data-information-visualization/' title='Save to Twitter' onclick='target="_blank";' rel='nofollow'><img src='http://nandeshwar.info/wp-content/plugins/bookmarkify/twitter.png' style='width:16px; height:16px;' alt='[Twitter] ' /></a> <a href='http://www.feedburner.com/fb/a/emailFlare?itemTitle=Data-Information Visualization&amp;uri=http://nandeshwar.info/2009/05/21/data-information-visualization/&amp;loc=en_US' title='Email this to a friend' onclick='target="_blank";' rel='nofollow'><img src='http://nandeshwar.info/wp-content/plugins/bookmarkify/email.png' style='width:16px; height:16px;' alt='[Email] ' /></a>  <a title='See more bookmark and sharing options...' href='http://nandeshwar.info/2009/05/21/data-information-visualization/#bookmarkify' rel='nofollow'><small>More&nbsp;&raquo;</small></a></div><div class='brand'><small><a href='http://www.bookmarkify.com/'>Powered by Bookmarkify&trade;</a></small></div></div>]]></content:encoded>
			<wfw:commentRss>http://nandeshwar.info/2009/05/21/data-information-visualization/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
