Skip to main content

Map Reduce

Data Locality is the potential to move the computations closer to the actual data location on the machines.

A frame work used to process mostly a very large set of data in parallel over a bunch of processes. map reducer comprises of map part which is used to distribute the computation over processes and the reduce part which is used to aggregate the results across distributed processes.

so a map reduce performs the work simultaneously across all processes by dividing it to the servers in the servers, hence utilizing the computation capacity of distributed servers to reduce time of computation.