<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>nandeshwar.info &#187; data mining</title>
	<atom:link href="http://nandeshwar.info/tag/data-mining/feed/" rel="self" type="application/rss+xml" />
	<link>http://nandeshwar.info</link>
	<description></description>
	<lastBuildDate>Mon, 12 Dec 2011 21:33:19 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	
		<item>
		<title>Free Certificate in Data Mining/Analytics</title>
		<link>http://nandeshwar.info/2011/02/01/free-certificate-in-data-mininganalytics/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=free-certificate-in-data-mininganalytics</link>
		<comments>http://nandeshwar.info/2011/02/01/free-certificate-in-data-mininganalytics/#comments</comments>
		<pubDate>Wed, 02 Feb 2011 02:40:39 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[Books]]></category>
		<category><![CDATA[Data mining]]></category>
		<category><![CDATA[analytics]]></category>
		<category><![CDATA[courses]]></category>
		<category><![CDATA[data mining]]></category>
		<category><![CDATA[data science]]></category>
		<category><![CDATA[free]]></category>
		<category><![CDATA[machine learning]]></category>
		<category><![CDATA[R]]></category>
		<category><![CDATA[text mining]]></category>
		<category><![CDATA[visualization]]></category>

		<guid isPermaLink="false">http://nandeshwar.info/?p=328</guid>
		<description><![CDATA[Analytics or data science has following components: data mining/machine learning/statistics data visualization database management programming There are some free online courses that cover many of these areas, and these courses are usually part of a degree or a certificate program in data mining. Those who are new or interested in this field can learn a [...]]]></description>
			<content:encoded><![CDATA[<p>Analytics or data science has following components:</p>
<ul>
<li>data mining/machine learning/statistics</li>
<li>data visualization</li>
<li>database management</li>
<li>programming</li>
</ul>
<p>There are some free online courses that cover many of these areas, and these courses are usually part of a degree or a certificate program in data mining. Those who are new or interested in this field can learn a whole lot without paying a dime. Here is the list:</p>
<ul>
<li><a href="http://oli.web.cmu.edu/openlearning/forstudents/freecourses/statistics">Intro to Probability and Statistics</a> (Carnegie Mellon)</li>
<li><a href="http://machinelearning2010fall.pbworks.com/w/page/30032895/FrontPage">Machine Learning 101/102</a></li>
<li><a href="http://web.mit.edu/govdata/#Materials">GovData</a> (MIT/Harvard)</li>
<li><a href="http://www.stat.auckland.ac.nz/~ihaka/120/">STATS 120: Information Visualisation</a> (The University of Auckland)</li>
<li><a href="http://scc.stat.ucla.edu/mini-courses/">R Programming</a> (UCLA)</li>
<li><a href="http://see.stanford.edu/see/materials/aimlcs229/handouts.aspx">CS 229: Machine Learning</a> (Stanford) (<a href="http://www.youtube.com/view_play_list?p=A89DCFA6ADACE599">videos</a>)</li>
<li><a href="http://www9.georgetown.edu/faculty/mad87/06/420/syllabus.html">Linguistics  420: Statistical Natural Language Processing</a> (Georgetown)</li>
<li><a href="https://open.umich.edu/education/si/si508/fall2008">SI 508: Networks: Theory and Application </a>(University of Michigan)</li>
<li><a href="http://www.csee.wvu.edu/~timm/cs591o/old/">CS 591: Data Mining</a> (West Virginia University)</li>
<li><a href="http://www.stat.auckland.ac.nz/~dscott/782/index.php">STATS 782: Computing for Statisticians</a> (The University of Auckland)</li>
<li><a href="http://ocw.mit.edu/courses/electrical-engineering-and-computer-science/6-867-machine-learning-fall-2006/index.htm">6.867: Machine Learning</a> (MIT)</li>
<li><a href="http://www.autonlab.org/tutorials/list.html">Andrew Moore&#8217;s Slides on Statistical Data Mining Tutorials</a></li>
<li><a href="http://dataminingtools.net/browsetutorials.php?tag=rdmt">Lots of tutorials</a> (Data Mining Tools)</li>
<li>Capstone project:  <a href="http://www.kaggle.com/index.php?option=com_taskmaster&amp;view=findcompetition&amp;viewtype=results">kaggle</a> or <a href="http://www.sigkdd.org/kddcup/index.php">kdd</a> (for a bigger list see <a href="http://www.kdnuggets.com/datasets/competitions.html">kdnuggets</a>)</li>
</ul>
<p>Some free text books:</p>
<ul>
<li><a href="http://www.stanford.edu/~hastie/Papers/ESLII.pdf">The Elements of Statistical Learning by Hastie, Tibshirani, and Friedman</a></li>
<li><a href="http://infolab.stanford.edu/~ullman/mmds/book.pdf">Mining of Massive Datasets by Rajaraman and Ullman</a></li>
</ul>
<p>In addition, there is an excellent thread on quora on <a href="http://www.quora.com/How-do-I-become-a-data-scientist">how to become a data scientist</a> that covers lot of things and is a very good resource on the practice of analytics.</p>
]]></content:encoded>
			<wfw:commentRss>http://nandeshwar.info/2011/02/01/free-certificate-in-data-mininganalytics/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Tag Cloud of Data Mining Jobs</title>
		<link>http://nandeshwar.info/2009/08/20/tag-cloud-of-data-mining-jobs/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=tag-cloud-of-data-mining-jobs</link>
		<comments>http://nandeshwar.info/2009/08/20/tag-cloud-of-data-mining-jobs/#comments</comments>
		<pubDate>Thu, 20 Aug 2009 14:13:48 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[data mining]]></category>
		<category><![CDATA[pipes]]></category>
		<category><![CDATA[python]]></category>
		<category><![CDATA[stemming]]></category>
		<category><![CDATA[tag cloud]]></category>
		<category><![CDATA[text mining]]></category>
		<category><![CDATA[visualization]]></category>

		<guid isPermaLink="false">http://nandeshwar.info/?p=260</guid>
		<description><![CDATA[Here&#8217;s what I did to get a cool looking tag cloud of data mining jobs: Used Yahoo Pipes (I created mine, but this one has more feeds)&#8211; this pipe aggregates feeds from different job web-sites, and gives the user unique job listing that you can subscribe via RSS: Job Feed Aggregator by Sean Dolan Subscribed [...]]]></description>
			<content:encoded><![CDATA[<p>Here&#8217;s what I did to get a cool looking tag cloud of data mining jobs:</p>
<ol>
<li>Used Yahoo Pipes (I created mine, but this one has more feeds)&#8211; this pipe aggregates feeds from different job web-sites, and gives the user unique job listing that you can subscribe via RSS:  <a href="http://pipes.yahoo.com/pipes/pipe.info?_id=50bf0b7cbcf40213deb98f1314dedf51">Job Feed Aggregator by Sean Dolan </a></li>
<li>Subscribed to the RSS feed for the keyword &#8220;data mining&#8221;</li>
<li>Copied the job descriptions and requirements of many jobs, and saved the text file</li>
<li>Got the <a href="http://tartarus.org/~martin/PorterStemmer/index-old.html">python stemmer </a></li>
<li>Applied the python stemmer to the text file. Stemmer truncates words to their roots, so that we can combine variants of a word into a single word. (First or second step in text mining)</li>
<li>Created a tag cloud using the services of <a href="http://www.wordle.net/">http://www.wordle.net/</a> . They use &#8220;stop words,&#8221; so I didn&#8217;t have to apply those. Stop words are common words, which necessarily don&#8217;t add any value for categorization, of a language.</li>
</ol>
<div id="attachment_261" class="wp-caption aligncenter" style="width: 591px"><a href="http://nandeshwar.info/wp-content/uploads/2009/08/dmjobstagcloud.jpg"><img class="size-full wp-image-261 " title="Data Mining Jobs Tag Cloud" src="http://nandeshwar.info/wp-content/uploads/2009/08/dmjobstagcloud.jpg" alt="Data Mining Jobs Tag Cloud" width="581" height="249" /></a><p class="wp-caption-text">Data Mining Jobs Tag Cloud</p></div>
<p><!--adsensestart--><br />
The most frequent word is: experience. Companies want people with experience in different data mining techniques. You&#8217;ll see that some other big words are: SAS (stemmed as sa), Excel, SQL, analytical skills, statistics, and quantitative skills.</p>
<p>And how do you master these skills, you ask?</p>
<ol>
<li>Get a graduate degree in statistics, economics, mathematics, computer science, financial engineering, or industrial engineering with emphasis on databases, data mining, and marketing.</li>
<li>Successfully complete data mining projects using free, open-source data mining tools, such as Weka, R, Orange, Rapid-Miner.</li>
<li>Participate in data mining competitions. SAS&#8217;s data mining conference has a data mining competition every year.</li>
</ol>
<p>Have a look at a detailed study by Pejic Bach, M: Creating profile of data mining specialist</p>
]]></content:encoded>
			<wfw:commentRss>http://nandeshwar.info/2009/08/20/tag-cloud-of-data-mining-jobs/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

