Forum

This topic contains 0 replies, has 1 voice, and was last updated by  srikanth 1 year, 6 months ago.

Viewing 1 post (of 1 total)
  • Author
    Posts
  • #1986 Reply

    srikanth
    Participant

    Partitioning: It is the process of determining which reducer instance will receive which intermediate keys and values. Each mapper must determine for all of its output (key, value) pairs which reducer will receive them. It is necessary that for any key, regardless of which mapper instance generated it, the destination partition is the same.

    Shuffle: After the first map tasks have completed, the nodes may still be performing several more map tasks each. But they also begin exchanging the intermediate outputs from the map tasks to where they are required by the reducers. This process of moving map outputs to the reducers is known as shuffling.

    Sort: Each reduce task is responsible for reducing the values associated with several intermediate keys. The set of intermediate keys on a single node is automatically sorted by Hadoop before they are presented to the Reducer.

Viewing 1 post (of 1 total)
Reply To: WHAT IS SHUFFLING AND SORTING IN MAPREDUCE
Your information:




cf22

Your Name (required)

Your Email (required)

Subject

Phone No

Your Message

Cart

  • No products in the cart.