We usually develop programs based on open sourced MapReduce frameworks such as Hadoop, Apache Pig, Apache Hive, and Spark to solve Big Data problems. In this post, I will use an example to describe what MapReduce is and how it works. I hope this will help you learn those Big Data technologies such as Hadoop, Pig, Hive and Spark easier.
What is MapReduce?
MapReduce is the key of Big Data. It was invented by Google, and it is the heart of Hadoop.
It is a programming paradigm that allows engineers or scientist build scalable systems that can run on hundreds or thousands of servers.[Read More...]