Hadoop Basic PIG Commands with Examples
Hadoop Basic PIG Commands with Examples, are you looking for a list of Top Rated Pig commands in Hadoop examples? Or the one who is casually glancing for the best platform which is listing the Top-rated Hadoop pig script commands with examples for beginners? Then you’ve landed on the Right Platform which is packed with Tons of Tutorials of Pig commands for Hadoop. Follow the below mentioned Pig commands in Hadoop which were originally designed by the world-class Trainers of Big Data Training institute Professionals. If you are the one who is a hunger to become the certified Pro Hadoop Developer? Or the one who is looking for the best Hadoop Training institute which offering advanced tutorials and Hadoop certification course to all the Tech Enthusiasts who are eager to learn the technology from 0 Level to Advanced Level.Basic PIG commands in Hadoop
data:image/s3,"s3://crabby-images/a88ef/a88efb64f7ccccc2bc2ff25070f510af642b5c65" alt="Hadoop Basic PIG Commands with Examples"
- Create the following input file (text format)
Database 1: studb
data:image/s3,"s3://crabby-images/47170/4717071827e6f0b7f694a799b70a13eec7c2fdac" alt="pig commands in hadoop"
Database 2: empdb
data:image/s3,"s3://crabby-images/3e6c0/3e6c0a3c35b6f96fb72e302fc8b80097f82ad9c8" alt="pig commands in hadoop"
#Database 3: detaildb
data:image/s3,"s3://crabby-images/f5ce0/f5ce04db7467c55c63adce0b7f59df05fb8cce78" alt="pig commands in hadoop"
Database 4: stu2db
data:image/s3,"s3://crabby-images/67774/677743219c6f4d28d5e456c29fe3e0b3485af991" alt="pig commands in hadoop"
Move the created file from LFS to HDFS
data:image/s3,"s3://crabby-images/c6e5d/c6e5d798d3949cae0f0082c14f13f196decee2f6" alt="Hadoop Basic PIG Commands with Examples"
Run PIG command from console (cluster mode)
data:image/s3,"s3://crabby-images/8d1c7/8d1c7bee98628dabd4e3088ec41d984c3479ee40" alt="Hadoop Basic PIG Commands with Examples"
Data Input using pig: Load data from hdfs to Pig
data:image/s3,"s3://crabby-images/02b66/02b66e276fad8dcee2266ae9e4c70a5fb7a8f424" alt="Hadoop Basic PIG Commands with Examples"
Dump Command: This command is used to display all data loaded.
data:image/s3,"s3://crabby-images/d1325/d132533211c2180602cdf3f6036bb6afa28be33d" alt="Hadoop Basic PIG Commands with Examples"
Foreach: This command is used to generate data transformation based on columns of data
data:image/s3,"s3://crabby-images/3ac2e/3ac2eeac2765a22bf47fafadc64878b72819b426" alt="Hadoop Basic PIG Commands with Examples"
data:image/s3,"s3://crabby-images/9bc5c/9bc5c7cc50d6b81a398905ef7e90be34f983cc49" alt="Hadoop Basic PIG Commands with Examples"
Filter: Select particular tuples from a relation based on a condition.
data:image/s3,"s3://crabby-images/7d015/7d015e2c39751893d5ea0a33e10fa6fb49e9bcad" alt="Hadoop Basic PIG Commands with Examples"
Order By: Sort a relation based on one or more fields
data:image/s3,"s3://crabby-images/8378c/8378cfc07486bd0985d1a6c004991aa6a3867cc5" alt="Hadoop Basic PIG Commands with Examples"
Store: Save results to the local file system or HDFS
data:image/s3,"s3://crabby-images/60bfd/60bfddf9d5a74319a9519b26d244f2b618742542" alt="Hadoop Basic PIG Commands with Examples"
Cogroup: This operator is used to group two databases using a particular column.
data:image/s3,"s3://crabby-images/fe388/fe38812e11d3cc918eb7129a56c2567f4e319d2f" alt="Hadoop Basic PIG Commands with Examples"
Pig commands in Hadoop with examples
Join: This operator is used to join two or more table.
Inner Join: Joining two table having a common columndata:image/s3,"s3://crabby-images/605ad/605ad1f80ae684357168c6980c641fd9fbfd0f91" alt="Hadoop Basic PIG Commands with Examples"
data:image/s3,"s3://crabby-images/2bd4e/2bd4e772d0c807984402835fb9ced86a4b8f5cbe" alt="Hadoop Basic PIG Commands with Examples"
data:image/s3,"s3://crabby-images/11efe/11efe2fadee12373add91a1171a6bc7144807327" alt="Hadoop Basic PIG Commands with Examples"
data:image/s3,"s3://crabby-images/f1411/f1411622cf4877ee8403602756a8736143abedbb" alt="Hadoop Basic PIG Commands with Examples"
data:image/s3,"s3://crabby-images/8f33d/8f33d7bf00ef6675a02efa58a5371033b8e660a6" alt="Hadoop Basic PIG Commands with Examples"
data:image/s3,"s3://crabby-images/64371/64371cddd1562447f72f663ded0a1e6f6709677c" alt="Hadoop Basic PIG Commands with Examples"
PIG Commands with Examples
GROUP OPERATOR:
The simpler of these operators is GROUP. Continuing with the same set of relations
grunt> Emp_grouped = GROUP Emp BY salary;
grunt> DUMP Emp_grouped;
data:image/s3,"s3://crabby-images/11015/1101503aeb7a0aa16675cdb1518d98e06319da2f" alt="Hadoop Basic PIG Commands with Examples"
data:image/s3,"s3://crabby-images/2440d/2440dc9ae5c76e16f476f2150ae09edc1183f5ca" alt="Hadoop Basic PIG Commands with Examples"
Group Output:
data:image/s3,"s3://crabby-images/38496/3849656520bd24ca64514487626e1310a78627ca" alt="Hadoop Basic PIG Commands with Examples"
FOREACH OPERATOR:
One of the key uses of Pig is data transformation. You can define a new relation based on the fields of an existing relation using the FOREACH command
grunt> foreach_Emp = FOREACH Emp GENERATE id,name;
data:image/s3,"s3://crabby-images/2af22/2af22d2278deda57150a6b68508701c274a0ad48" alt="Hadoop Basic PIG Commands with Examples"
grunt> dump foreach_Emp;
Foreach Output:
data:image/s3,"s3://crabby-images/62ec2/62ec29a71a644e318026c6c5148ad2f6aeb19b66" alt="Hadoop Basic PIG Commands with Examples"
data:image/s3,"s3://crabby-images/375f0/375f079d2f57d0ad877ddd5058cc4efc48c23e5e" alt="Hadoop Basic PIG Commands with Examples"
JOIN OPERATOR:
which groups together tuples from multiple relations . It functions much like a join. For example, let’s Join Emp and Customer on the first column.
grunt> Emp_self = join Emp by id, Customer by id;
data:image/s3,"s3://crabby-images/650f0/650f0dff7ed01fa5801b28f95a58018fcd52c286" alt="Hadoop Basic PIG Commands with Examples"
grunt> DUMP Emp_self;
data:image/s3,"s3://crabby-images/7c1cf/7c1cf82b29408774eff69789301b422288581705" alt="Hadoop Basic PIG Commands with Examples"
Self Join Output:
data:image/s3,"s3://crabby-images/9b3ae/9b3ae9d06bcc91eccfbc051c3d37f153680c9ce4" alt="Hadoop Basic PIG Commands with Examples"
By default behavior of join as an outer join, and the join keyword can modify it to be left outer join, right outer join, or inner join.Another way to do inner join in Pig is to use the JOIN operator. Empcust_left = join Emp by id left outer, Customer by id;
data:image/s3,"s3://crabby-images/0d750/0d750948154f60fb22a5d0de9e78b0d1ebc6d931" alt="Hadoop Basic PIG Commands with Examples"
grunt> dump Empcust_left;
data:image/s3,"s3://crabby-images/82a70/82a7001078812ead3d5cbb4ed01faa0e08e738e8" alt="Hadoop Basic PIG Commands with Examples"
Left Outer Join Output:
data:image/s3,"s3://crabby-images/f9e4b/f9e4b088bcf0bd526b6bbf87c973167367ec9771" alt="Hadoop Basic PIG Commands with Examples"
Empcust_right = join Emp by id right outer, Customer by id;
data:image/s3,"s3://crabby-images/cb278/cb278718a9c894e55ccc216632f8e010a67bd279" alt="Hadoop Basic PIG Commands with Examples"
grunt> dump Empcust_right;
data:image/s3,"s3://crabby-images/e6ac4/e6ac451a9d03cd743c5f49965ebb980828be3567" alt="Hadoop Basic PIG Commands with Examples"
Right Outer Join Output:
data:image/s3,"s3://crabby-images/4b4dd/4b4dd6176d846a11d88e45b9d80890d03ceb8b2a" alt="Hadoop Basic PIG Commands with Examples"
Empcust_full = join Emp by id full outer, Customer by id;
data:image/s3,"s3://crabby-images/8a3b6/8a3b642f59950972f16ff2e21ecf2900ee49fabb" alt="Hadoop Basic PIG Commands with Examples"
grunt> dump Empcust_full;
data:image/s3,"s3://crabby-images/4a472/4a47277f463e316e666c3ad7892f8760659dfc68" alt="Hadoop Basic PIG Commands with Examples"
Full Outer Join Output:
data:image/s3,"s3://crabby-images/85316/8531611e2dfe2ba830b53b82bbc3bbd38fa12d1a" alt="Hadoop Basic PIG Commands with Examples"
DISTINCT OPERATOR:
grunt> Emp2_dist = DISTINCT Emp2;
data:image/s3,"s3://crabby-images/4bafd/4bafdaff6bc1e610a632f7a9fc5ba4516e69852c" alt="Hadoop Basic PIG Commands with Examples"
grunt> dump Emp2_dist;
data:image/s3,"s3://crabby-images/b5aca/b5aca1e9169cd82fb8b9ad5b4f682556a0edb87e" alt="Hadoop Basic PIG Commands with Examples"
DISTINCT Output:
data:image/s3,"s3://crabby-images/3b2ee/3b2eebe70e02ff0bf334fdf8a1b23f1e0a975447" alt="Hadoop Basic PIG Commands with Examples"
LIMIT OPERATOR:
grunt> limit_data = LIMIT Emp 3;
data:image/s3,"s3://crabby-images/1abc0/1abc06529b6d7aa26561b622324f9052425b852f" alt="Hadoop Basic PIG Commands with Examples"
grunt> dump limit_data;
data:image/s3,"s3://crabby-images/249e3/249e37692e8de6f8d10c7256583f1cdb2a37c022" alt="Hadoop Basic PIG Commands with Examples"
data:image/s3,"s3://crabby-images/f991b/f991b3d220f35eaf6ec2537111833715583f1d32" alt="Hadoop Basic PIG Commands with Examples"