This is the manageable split (processing unit) of input file read from HDFS. Each split is then assigned to each map task to guarantee the processing of entire input file by the MR Job. This splitting mechanism is depends on input file formats (4 such formats exist). The default format is Text Input Format. In Text Input Format, splitting is done on new line character (‘/n’).
Also, the splitting of input files depends on the following properties: