How can I learn about data mining with Hadoop?

Author Topic: How can I learn about data mining with Hadoop?  (Read 1199 times)

Offline rashidacse

  • Full Member
  • ***
  • Posts: 103
  • Test
    • View Profile
How can I learn about data mining with Hadoop?
« on: November 28, 2015, 12:18:14 PM »
Honestly, you shouldn't be looking to learn Data Mining 'with Hadoop', specifically. Data Mining is a broad subject while Hadoop is a software framework which lets you use distributed computing to implement data mining (most commonly) algorithms to mine massive data sets; something one computer or server would fail to do. So assuming that you want to first learn the subject of Data Mining and then go about using Hadoop, here are a few guidelines:

First, you'll need to know the underlying theories and concepts of Data Mining - I would suggest this book - Introduction to Data Mining
It is used by many Universities for their introductory Data Mining courses and is very well structured for beginners to follow, too.
You can supplement this with one of the many open courses online. The following two should be good places to start (I haven't done them myself, but want to)
1)  https://www.edx.org/course/calte...

2) https://www.coursera.org/course/...

Second, assuming you've gone through these resources, a very handy place to learn not just about Hadoop but about all the other tools, frameworks and software for a data scientist is this brilliant course by Jeff Leek and co. at John Hopkin's - https://class.coursera.org/datas...