300x250 AD TOP

Powered by Blogger.

Thursday, 9 January 2014

Tagged under: , ,

Hadoop in Four Minutes

Hadoop

We are a group of four software engineers from Punjab University College of Information Technology, Lahore. When we faced the dilemma of what to do as our capstone project we decided to do a Decision Support System. We took our idea to a number of teachers and slowly grew our knowledge on it. Then we met Shuja-ur-Rehman and asked him to be our project advisor.

He told us about Big Data Analytics and Hadoop platform that was the first time we heard about Hadoop. After a very thorough research on Hadoop and Big Data Analytics we decided to go through with Shuja's idea for the project. During our working/development we faced so many difficulties in configuring and running Hadoop programs that we thought of writing our own blog for absolute beginners.

This is a sister blog for the company Data Molecule that four of us have entrepreneured. The services we provide are Real Time Analysis of huge data sets, Visualization in the form of graphs and Real Time Search. In the subsequent blogs we will write lay man installation and programming tutorials covering the following.


  • MapReduce Jobs
  • Apache Oozie
  • Apache Flume
  • Apache Sqoop
  • Apache HBase
  • Apache Tika
  • Apache Solr
In the next post we will talk about configuring running The Yellow Elephant on a virtual machine.