MapReduce is a programming model for processing large data sets, and the name of an implementation of the model by Google. MapReduce is typically used to do distributed computing on clusters of computers.
Mapping :- Apply the logic(do the calculation) on subsets of the target data stored in one or more data nodes in a cluster.
Reducing :- This is a technique use to generate the final output by using the outputs of Mapping.
The simple idea behind this concept is, process the data at where the data located to provide more processing power to our databases.
No comments:
Post a Comment