Job title: Big Data -Hadoop Developer + Storm
25-Oct-2018
Pune
15-Nov-2018
Job Responsibilities :
  • Loading data from different datasets and deciding on which file format is efficient for a task.
  • Understanding the requirements of input to output transformations.
  • Cleaning data as per business requirements using streaming API’s or user defined functions.
  • Defining Hadoop Job Flows.
  • Build distributed, reliable and scalable data pipelines to ingest and process data in real-time. With fetching impression streams, transaction behaviours, clickstream data and other unstructured data.
  • Managing Hadoop jobs using scheduler.
  • Reviewing and managing hadoop log files.
  • Design and implement column family schemas of Hive and HBase within HDFS,Assign schemas and create Hive tables.
  • Develop efficient pig and hive scripts with joins on datasets using various techniques.
  • Assess the quality of datasets for a hadoop data lake.
  • Apply different HDFS formats and structure
  • 0

    1. 0 ratings
    2. 0 reviews
  • 40% Complete (success)
  • 40% Complete (success)
  • 40% Complete (success)
  • 40% Complete (success)
  • 40% Complete (success)
 
Write Review