This topic contains 1 reply, has 2 voices, and was last updated by  dipankar000 4 years ago.

Viewing 2 posts - 1 through 2 (of 2 total)
  • Author
  • #1388


    Question: What will be Replication Factor to complete MapReduce job.if we have disk size per datanode is 20 and no. of nodes you have 40?

    Answer: We have 80 TB of data to store and to run MapReduce on this amount of data. Configuration of datanodes.
    Simple formula: disk size= total amount of data*R.F/Total no. of nodes
    disk size =20, No. of nodes40, Total amount of data=80TB
    Formula rewritten as R.F = disk size * Total no. of nodes/total amount of data
    = 20 * 40/80
    R.F. = 10



    How did you calculate 80 GB of data?

Viewing 2 posts - 1 through 2 (of 2 total)

The forum ‘General Discussion’ is closed to new topics and replies.