{"id":8915,"date":"2021-05-30T05:41:44","date_gmt":"2021-05-30T05:41:44","guid":{"rendered":"https:\/\/prwatech.in\/blog\/?p=8915"},"modified":"2023-07-13T11:53:24","modified_gmt":"2023-07-13T11:53:24","slug":"working-with-dataproc","status":"publish","type":"post","link":"https:\/\/prwatech.in\/blog\/google-cloud-platform\/dataproc\/working-with-dataproc\/","title":{"rendered":"Working with Dataproc"},"content":{"rendered":"\r\n<p><strong>Prerequisites<\/strong><\/p>\r\n\r\n\r\n\r\n<p><a href=\"https:\/\/www.prwatech.com\/course\/gcptraining\" target=\"_blank\" rel=\"noreferrer noopener\" data-type=\"URL\" data-id=\"https:\/\/www.prwatech.com\/course\/gcptraining\">GCP <\/a>account<\/p>\r\n\r\n\r\n\r\n<p>Open Cloud Console.<\/p>\r\n\r\n\r\n\r\n<p>Open Menu &gt; <a href=\"https:\/\/prwatech.in\/blog\/google-cloud-platform\/dataproc-cluster-creation\/\" target=\"_blank\" rel=\"noreferrer noopener\" data-type=\"URL\" data-id=\"https:\/\/prwatech.in\/blog\/google-cloud-platform\/dataproc-cluster-creation\/\">Dataproc<\/a> &gt; Clusters<\/p>\r\n\r\n\r\n\r\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"456\" height=\"339\" class=\"wp-image-8916\" src=\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-281.png\" alt=\"\" srcset=\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-281.png 456w, https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-281-300x223.png 300w\" sizes=\"auto, (max-width: 456px) 100vw, 456px\" \/><\/figure>\r\n\r\n\r\n\r\n<p>Click the Cluster.<\/p>\r\n\r\n\r\n\r\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"502\" height=\"154\" class=\"wp-image-8917\" src=\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-282.png\" alt=\"\" srcset=\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-282.png 502w, https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-282-300x92.png 300w\" sizes=\"auto, (max-width: 502px) 100vw, 502px\" \/><\/figure>\r\n\r\n\r\n\r\n<p>Click on VM Instances<\/p>\r\n\r\n\r\n\r\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"628\" height=\"88\" class=\"wp-image-8918\" src=\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-283.png\" alt=\"\" srcset=\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-283.png 628w, https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-283-300x42.png 300w\" sizes=\"auto, (max-width: 628px) 100vw, 628px\" \/><\/figure>\r\n\r\n\r\n\r\n<p>Click on SSH of master node<\/p>\r\n\r\n\r\n\r\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"572\" height=\"162\" class=\"wp-image-8919\" src=\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-284.png\" alt=\"\" srcset=\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-284.png 572w, https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-284-300x85.png 300w\" sizes=\"auto, (max-width: 572px) 100vw, 572px\" \/><\/figure>\r\n\r\n\r\n\r\n<p>Check whether the components is already installed.<\/p>\r\n\r\n\r\n\r\n<p>$\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 pyspark\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 #opens pyspark<\/p>\r\n\r\n\r\n\r\n<p>To exit press ctrl +d<\/p>\r\n\r\n\r\n\r\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"628\" height=\"214\" class=\"wp-image-8920\" src=\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-285.png\" alt=\"\" srcset=\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-285.png 628w, https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-285-300x102.png 300w\" sizes=\"auto, (max-width: 628px) 100vw, 628px\" \/><\/figure>\r\n\r\n\r\n\r\n<p>$\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 hive\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 #To check hive is available or not<\/p>\r\n\r\n\r\n\r\n<p>To exit press ctrl +d<\/p>\r\n\r\n\r\n\r\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"628\" height=\"61\" class=\"wp-image-8921\" src=\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-286.png\" alt=\"\" srcset=\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-286.png 628w, https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-286-300x29.png 300w\" sizes=\"auto, (max-width: 628px) 100vw, 628px\" \/><\/figure>\r\n\r\n\r\n\r\n<p>$\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 python \u2013V\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 #to check python version<\/p>\r\n\r\n\r\n\r\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"395\" height=\"66\" class=\"wp-image-8922\" src=\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-287.png\" alt=\"\" srcset=\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-287.png 395w, https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-287-300x50.png 300w\" sizes=\"auto, (max-width: 395px) 100vw, 395px\" \/><\/figure>\r\n\r\n\r\n\r\n<p>$\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 spark-shell\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 # opens spark shell<\/p>\r\n\r\n\r\n\r\n<p>To exit press ctrl +d<\/p>\r\n\r\n\r\n\r\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"628\" height=\"291\" class=\"wp-image-8923\" src=\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-288.png\" alt=\"\" srcset=\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-288.png 628w, https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-288-300x139.png 300w\" sizes=\"auto, (max-width: 628px) 100vw, 628px\" \/><\/figure>\r\n\r\n\r\n\r\n<p>$\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 pwd\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 #To get path<\/p>\r\n\r\n\r\n\r\n<p>$\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 mkdir ratingscounter\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 #making directory named ratingscounter<\/p>\r\n\r\n\r\n\r\n<p>$\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 cd ratingscounter\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 #Change the directory into ratingscounter<\/p>\r\n\r\n\r\n\r\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"499\" height=\"106\" class=\"wp-image-8924\" src=\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-289.png\" alt=\"\" srcset=\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-289.png 499w, https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-289-300x64.png 300w\" sizes=\"auto, (max-width: 499px) 100vw, 499px\" \/><\/figure>\r\n\r\n\r\n\r\n<p>$\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 wget https:\/\/s3.amazonaws.com\/sankethadoop\/u.data\u00a0 \u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 #To get the data for dataproc<\/p>\r\n\r\n\r\n\r\n<p>$\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 ls\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 #Display the contents in the directory<\/p>\r\n\r\n\r\n\r\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"628\" height=\"196\" class=\"wp-image-8925\" src=\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-290.png\" alt=\"\" srcset=\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-290.png 628w, https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-290-300x94.png 300w\" sizes=\"auto, (max-width: 628px) 100vw, 628px\" \/><\/figure>\r\n\r\n\r\n\r\n<p>$\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 nano u.data\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 #open the u.data file.<\/p>\r\n\r\n\r\n\r\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"573\" height=\"70\" class=\"wp-image-8926\" src=\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-291.png\" alt=\"\" srcset=\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-291.png 573w, https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-291-300x37.png 300w\" sizes=\"auto, (max-width: 573px) 100vw, 573px\" \/><\/figure>\r\n\r\n\r\n\r\n<p>It will display the content in u.data<\/p>\r\n\r\n\r\n\r\n<p>To exit press ctrl + x<\/p>\r\n\r\n\r\n\r\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"336\" height=\"200\" class=\"wp-image-8927\" src=\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-292.png\" alt=\"\" srcset=\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-292.png 336w, https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-292-300x179.png 300w\" sizes=\"auto, (max-width: 336px) 100vw, 336px\" \/><\/figure>\r\n\r\n\r\n\r\n<p>$\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 nano ratingscounter.py\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 #Creates and opens file ratingscounter.py<\/p>\r\n\r\n\r\n\r\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"628\" height=\"49\" class=\"wp-image-8928\" src=\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-293.png\" alt=\"\" srcset=\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-293.png 628w, https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-293-300x23.png 300w\" sizes=\"auto, (max-width: 628px) 100vw, 628px\" \/><\/figure>\r\n\r\n\r\n\r\n<p>Paste the below code into ratingscounter.py file<\/p>\r\n\r\n\r\n\r\n<p>from pyspark import SparkConf, SparkContext<\/p>\r\n\r\n\r\n\r\n<p>import collections<\/p>\r\n\r\n\r\n\r\n<p>conf = SparkConf().setMaster(&#8220;local&#8221;).setAppName(&#8220;Ratings&#8221;)<\/p>\r\n\r\n\r\n\r\n<p>sc = SparkContext(conf = conf)<\/p>\r\n\r\n\r\n\r\n<p>lines = sc.textFile(&#8220;sparkdata\/u.data&#8221;)<\/p>\r\n\r\n\r\n\r\n<p>ratings = lines.map(lambda x: x.split( )[2])<\/p>\r\n\r\n\r\n\r\n<p>result = ratings.countByValue()<\/p>\r\n\r\n\r\n\r\n<p>sortedResults = collections.OrderedDict(sorted(result.items()))<\/p>\r\n\r\n\r\n\r\n<p>for key, value in sortedResults.items():<\/p>\r\n\r\n\r\n\r\n<p>\u00a0\u00a0\u00a0 print(&#8220;%s %i&#8221; % (key, value))<\/p>\r\n\r\n\r\n\r\n<p>This code is to count the films in each ratings.<\/p>\r\n\r\n\r\n\r\n<p>NB : if you are changing the name of directory or file, you may have to change it in the file lolcation also.<\/p>\r\n\r\n\r\n\r\n<p>To exit press \u2018ctrl + x\u2019 then press \u2018y\u2019 to confirm then \u2018Enter\u2019<\/p>\r\n\r\n\r\n\r\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"620\" height=\"286\" class=\"wp-image-8929\" src=\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-294.png\" alt=\"\" srcset=\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-294.png 620w, https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-294-300x138.png 300w\" sizes=\"auto, (max-width: 620px) 100vw, 620px\" \/><\/figure>\r\n\r\n\r\n\r\n<p>Create Schema structure<\/p>\r\n\r\n\r\n\r\n<p>$\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 hadoop fs -mkdir \/user\/&lt;userid&gt;\/sparkdata\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 #To create directory named sparkdata<\/p>\r\n\r\n\r\n\r\n<p>$\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 hadoop fs -put u.data sparkdata\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 #To copy u.data file into sparkdata<\/p>\r\n\r\n\r\n\r\n<p>$\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 hadoop fs -ls sparkdata\u00a0\u00a0 \u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 \u00a0# to check the file is saved or not<\/p>\r\n\r\n\r\n\r\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"721\" height=\"99\" class=\"wp-image-8930\" src=\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-295.png\" alt=\"\" srcset=\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-295.png 721w, https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-295-300x41.png 300w\" sizes=\"auto, (max-width: 721px) 100vw, 721px\" \/><\/figure>\r\n\r\n\r\n\r\n<p>$\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 spark-submit ratingscounter.py\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 #execute the ratingscounter.py file<\/p>\r\n\r\n\r\n\r\n<p>It will display the result.<\/p>\r\n\r\n\r\n\r\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"628\" height=\"216\" class=\"wp-image-8932\" src=\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-297.png\" alt=\"\" srcset=\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-297.png 628w, https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-297-300x103.png 300w\" sizes=\"auto, (max-width: 628px) 100vw, 628px\" \/><\/figure>\r\n\r\n\r\n\r\n<p>$\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 cd\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 #change directory<\/p>\r\n\r\n\r\n\r\n<p>$\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 mkdir totalspendbycustomer\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 #make directory<\/p>\r\n\r\n\r\n\r\n<p>$\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 cd totalspendbycustomer\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 #change directory to totalspendbycustomer<\/p>\r\n\r\n\r\n\r\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"559\" height=\"89\" class=\"wp-image-8933\" src=\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-298.png\" alt=\"\" srcset=\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-298.png 559w, https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-298-300x48.png 300w\" sizes=\"auto, (max-width: 559px) 100vw, 559px\" \/><\/figure>\r\n\r\n\r\n\r\n<p>$\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 wget https:\/\/s3.amazonaws.com\/sankethadoop\/customer-orders.csv \u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 #To copy file to disk<\/p>\r\n\r\n\r\n\r\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"692\" height=\"206\" class=\"wp-image-8934\" src=\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-299.png\" alt=\"\" srcset=\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-299.png 692w, https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-299-300x89.png 300w\" sizes=\"auto, (max-width: 692px) 100vw, 692px\" \/><\/figure>\r\n\r\n\r\n\r\n<p>$\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 ls\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 #list the contents in the directory<\/p>\r\n\r\n\r\n\r\n<p>$\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 nano customer-orders.csv\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 #Open the file customer-orders.csv<\/p>\r\n\r\n\r\n\r\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"628\" height=\"62\" class=\"wp-image-8935\" src=\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-300.png\" alt=\"\" srcset=\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-300.png 628w, https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-300-300x30.png 300w\" sizes=\"auto, (max-width: 628px) 100vw, 628px\" \/><\/figure>\r\n\r\n\r\n\r\n<p>Opens the file.<\/p>\r\n\r\n\r\n\r\n<p>To exit press ctrl +x<\/p>\r\n\r\n\r\n\r\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"206\" height=\"193\" class=\"wp-image-8936\" src=\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-301.png\" alt=\"\" \/><\/figure>\r\n\r\n\r\n\r\n<p>$\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 pwd\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 #Displays the path<\/p>\r\n\r\n\r\n\r\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"544\" height=\"68\" class=\"wp-image-8937\" src=\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-302.png\" alt=\"\" srcset=\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-302.png 544w, https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-302-300x38.png 300w\" sizes=\"auto, (max-width: 544px) 100vw, 544px\" \/><\/figure>\r\n\r\n\r\n\r\n<p>$\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 nano totalspendbycustomer.py\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 #creates and opens file totalspendbycustomer.py<\/p>\r\n\r\n\r\n\r\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"628\" height=\"34\" class=\"wp-image-8938\" src=\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-303.png\" alt=\"\" srcset=\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-303.png 628w, https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-303-300x16.png 300w\" sizes=\"auto, (max-width: 628px) 100vw, 628px\" \/><\/figure>\r\n\r\n\r\n\r\n<p>paste the below code<\/p>\r\n\r\n\r\n\r\n<p>from pyspark import SparkConf, SparkContext<\/p>\r\n\r\n\r\n\r\n<p>conf = SparkConf().setMaster(&#8220;local&#8221;).setAppName(&#8220;SpendByCustomer&#8221;)<\/p>\r\n\r\n\r\n\r\n<p>sc = SparkContext(conf = conf)<\/p>\r\n\r\n\r\n\r\n<p>def extractCustomerPricePairs(line):<\/p>\r\n\r\n\r\n\r\n<p>\u00a0\u00a0\u00a0 fields = line.split(&#8216;,&#8217;)<\/p>\r\n\r\n\r\n\r\n<p>\u00a0\u00a0\u00a0 return (int(fields[0]), float(fields[2]))<\/p>\r\n\r\n\r\n\r\n<p>input = sc.textFile(&#8220;sparkdata\/customer-orders.csv&#8221;)<\/p>\r\n\r\n\r\n\r\n<p>mappedInput = input.map(extractCustomerPricePairs)<\/p>\r\n\r\n\r\n\r\n<p>totalByCustomer = mappedInput.reduceByKey(lambda x, y: x + y)<\/p>\r\n\r\n\r\n\r\n<p>results = totalByCustomer.collect();<\/p>\r\n\r\n\r\n\r\n<p>for result in results:<\/p>\r\n\r\n\r\n\r\n<p>\u00a0\u00a0\u00a0 print(result)<\/p>\r\n\r\n\r\n\r\n<p>This code is to get the amount spent by the customers for movie<\/p>\r\n\r\n\r\n\r\n<p>NB : if you are changing the name of directory or file, you may have to change it in the file lolcation also.<\/p>\r\n\r\n\r\n\r\n<p>To exit press \u2018ctrl + x\u2019 then press \u2018y\u2019 to confirm then \u2018Enter\u2019<\/p>\r\n\r\n\r\n\r\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"628\" height=\"372\" class=\"wp-image-8939\" src=\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-304.png\" alt=\"\" srcset=\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-304.png 628w, https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-304-300x178.png 300w\" sizes=\"auto, (max-width: 628px) 100vw, 628px\" \/><\/figure>\r\n\r\n\r\n\r\n<p>$\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 hadoop fs -put customer-orders.csv sparkdata\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 #moves file customer-orders.csv into sparkdata<\/p>\r\n\r\n\r\n\r\n<p>$\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 hadoop fs -ls sparkdata\u00a0\u00a0\u00a0\u00a0 \u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 # to check the file is saved or not<\/p>\r\n\r\n\r\n\r\n<p>$\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 spark-submit totalspendbycustomer.py\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 #Execute the file totalspendbycustomer.py<\/p>\r\n\r\n\r\n\r\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"628\" height=\"83\" class=\"wp-image-8940\" src=\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-305.png\" alt=\"\" srcset=\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-305.png 628w, https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-305-300x40.png 300w\" sizes=\"auto, (max-width: 628px) 100vw, 628px\" \/><\/figure>\r\n\r\n\r\n\r\n<p>It will display the customer ID and total amount spend by customer<\/p>\r\n\r\n\r\n\r\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"312\" height=\"299\" class=\"wp-image-8941\" src=\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-306.png\" alt=\"\" srcset=\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-306.png 312w, https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-306-300x288.png 300w\" sizes=\"auto, (max-width: 312px) 100vw, 312px\" \/><\/figure>\r\n\r\n\r\n\r\n<p>To find popular movies<\/p>\r\n\r\n\r\n\r\n<p>$\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 cd\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 #To Change Directory<\/p>\r\n\r\n\r\n\r\n<p>$\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 mkdir popularmovies\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 #Make directory named popularmovies<\/p>\r\n\r\n\r\n\r\n<p>$\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 cd popularmovies\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 #To change directory into popularmovies<\/p>\r\n\r\n\r\n\r\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"498\" height=\"71\" class=\"wp-image-8942\" src=\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-307.png\" alt=\"\" srcset=\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-307.png 498w, https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-307-300x43.png 300w\" sizes=\"auto, (max-width: 498px) 100vw, 498px\" \/><\/figure>\r\n\r\n\r\n\r\n<p>$\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 wget https:\/\/s3.amazonaws.com\/sankethadoop\/u.data \u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 #To copy file to disk<\/p>\r\n\r\n\r\n\r\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"703\" height=\"188\" class=\"wp-image-8943\" src=\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-308.png\" alt=\"\" srcset=\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-308.png 703w, https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-308-300x80.png 300w\" sizes=\"auto, (max-width: 703px) 100vw, 703px\" \/><\/figure>\r\n\r\n\r\n\r\n<p>$\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 nano popularmovies.py\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 #Open file popularmovies.py<\/p>\r\n\r\n\r\n\r\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"662\" height=\"57\" class=\"wp-image-8944\" src=\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-309.png\" alt=\"\" srcset=\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-309.png 662w, https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-309-300x26.png 300w\" sizes=\"auto, (max-width: 662px) 100vw, 662px\" \/><\/figure>\r\n\r\n\r\n\r\n<p>In the python file paste the below code.<\/p>\r\n\r\n\r\n\r\n<p>from pyspark import SparkConf, SparkContext<\/p>\r\n\r\n\r\n\r\n<p>conf = SparkConf().setMaster(&#8220;local&#8221;).setAppName(&#8220;PopularMovies&#8221;)<\/p>\r\n\r\n\r\n\r\n<p>sc = SparkContext(conf = conf)<\/p>\r\n\r\n\r\n\r\n<p>lines = sc.textFile(&#8220;sparkdata\/u.data&#8221;)<\/p>\r\n\r\n\r\n\r\n<p>movies = lines.map(lambda x: (int(x.split()[1]), 1))<\/p>\r\n\r\n\r\n\r\n<p>movieCounts = movies.reduceByKey(lambda x, y: x + y)<\/p>\r\n\r\n\r\n\r\n<p>flipped = movieCounts.map( lambda xy: (xy[1],xy[0]) )<\/p>\r\n\r\n\r\n\r\n<p>sortedMovies = flipped.sortByKey()<\/p>\r\n\r\n\r\n\r\n<p>results = sortedMovies.collect()<\/p>\r\n\r\n\r\n\r\n<p>for result in results:<\/p>\r\n\r\n\r\n\r\n<p>\u00a0\u00a0\u00a0\u00a0 print(result)<\/p>\r\n\r\n\r\n\r\n<p>To save and exit, Press \u2018Ctrl + x\u2019 then \u2018y\u2019 then \u2018Enter\u2019<\/p>\r\n\r\n\r\n\r\n<p>$\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 hadoop fs -put u.data sparkdata\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 #To copy u.data file into sparkdata<\/p>\r\n\r\n\r\n\r\n<p>$\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 spark-submit popularmovies.py\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 # To execute popularmovies.py<\/p>\r\n\r\n\r\n\r\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"638\" height=\"356\" class=\"wp-image-8945\" src=\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-310.png\" alt=\"\" srcset=\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-310.png 638w, https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-310-300x167.png 300w\" sizes=\"auto, (max-width: 638px) 100vw, 638px\" \/><\/figure>\r\n\r\n\r\n\r\n<p>It will show the most popular movie ID and most number of votes.<\/p>\r\n\r\n\r\n\r\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"453\" height=\"193\" class=\"wp-image-8946\" src=\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-311.png\" alt=\"\" srcset=\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-311.png 453w, https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-311-300x128.png 300w\" sizes=\"auto, (max-width: 453px) 100vw, 453px\" \/><\/figure>\r\n\r\n\r\n\r\n<p>To find most 10 popular movies<\/p>\r\n\r\n\r\n\r\n<p>$\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 cd\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 #To Change Directory<\/p>\r\n\r\n\r\n\r\n<p>$\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 mkdir 10popularmovies\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 #Make directory named popularmovies<\/p>\r\n\r\n\r\n\r\n<p>$\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 cd 10popularmovies\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 #To change directory into popularmovies<\/p>\r\n\r\n\r\n\r\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"512\" height=\"85\" class=\"wp-image-8947\" src=\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-312.png\" alt=\"\" srcset=\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-312.png 512w, https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-312-300x50.png 300w\" sizes=\"auto, (max-width: 512px) 100vw, 512px\" \/><\/figure>\r\n\r\n\r\n\r\n<p>$\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 wget https:\/\/s3.amazonaws.com\/sankethadoop\/u.item<\/p>\r\n\r\n\r\n\r\n<p>$\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 wget https:\/\/s3.amazonaws.com\/sankethadoop\/u.data<\/p>\r\n\r\n\r\n\r\n<p>It will copy the file into disk.<\/p>\r\n\r\n\r\n\r\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"703\" height=\"352\" class=\"wp-image-8948\" src=\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-313.png\" alt=\"\" srcset=\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-313.png 703w, https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-313-300x150.png 300w\" sizes=\"auto, (max-width: 703px) 100vw, 703px\" \/><\/figure>\r\n\r\n\r\n\r\n<p>$\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 nano u.item\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 #To open the file content<\/p>\r\n\r\n\r\n\r\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"576\" height=\"54\" class=\"wp-image-8951\" src=\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-316.png\" alt=\"\" srcset=\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-316.png 576w, https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-316-300x28.png 300w\" sizes=\"auto, (max-width: 576px) 100vw, 576px\" \/><\/figure>\r\n\r\n\r\n\r\n<p>To exit press ctrl+ x<\/p>\r\n\r\n\r\n\r\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"497\" height=\"197\" class=\"wp-image-8949\" src=\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-314.png\" alt=\"\" srcset=\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-314.png 497w, https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-314-300x119.png 300w\" sizes=\"auto, (max-width: 497px) 100vw, 497px\" \/><\/figure>\r\n\r\n\r\n\r\n<p>$\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 nano 10popular.py\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 Create and open the file 10popular.py<\/p>\r\n\r\n\r\n\r\n<p>Paste the below code.<\/p>\r\n\r\n\r\n\r\n<p>from pyspark.sql import SparkSession<\/p>\r\n\r\n\r\n\r\n<p>from pyspark.sql import Row<\/p>\r\n\r\n\r\n\r\n<p>from pyspark.sql import functions<\/p>\r\n\r\n\r\n\r\n<p>def loadMovieNames():<\/p>\r\n\r\n\r\n\r\n<p>\u00a0\u00a0\u00a0\u00a0 movieNames = {}<\/p>\r\n\r\n\r\n\r\n<p>\u00a0\u00a0\u00a0\u00a0 with open(&#8220;<strong>\/home\/&lt;userid&gt;\/10popularmovies\/u.item<\/strong>&#8220;, encoding=&#8221;ISO-8859-1&#8221;) as f:<\/p>\r\n\r\n\r\n\r\n<p>\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 for line in f:<\/p>\r\n\r\n\r\n\r\n<p>\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 fields = line.split(&#8216;|&#8217;)<\/p>\r\n\r\n\r\n\r\n<p>\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 movieNames[int(fields[0])] = fields [1]<\/p>\r\n\r\n\r\n\r\n<p>\u00a0\u00a0\u00a0\u00a0 return movieNames<\/p>\r\n\r\n\r\n\r\n<p>spark = SparkSession.builder.appName(&#8220;PopularMovies&#8221;).getOrCreate()<\/p>\r\n\r\n\r\n\r\n<p>nameDict = loadMovieNames()<\/p>\r\n\r\n\r\n\r\n<p>lines = spark.sparkContext.textFile(&#8220;sparkdata\/u.data&#8221;)<\/p>\r\n\r\n\r\n\r\n<p>movies = lines.map(lambda x: Row(movieID =int(x.split()[1])))<\/p>\r\n\r\n\r\n\r\n<p>movieDataset = spark.createDataFrame(movies)<\/p>\r\n\r\n\r\n\r\n<p>topMovieIDs = movieDataset.groupBy(&#8220;movieID&#8221;).count().orderBy(&#8220;count&#8221;,ascending = False).cache()<\/p>\r\n\r\n\r\n\r\n<p>topMovieIDs.show()<\/p>\r\n\r\n\r\n\r\n<p>top10 = topMovieIDs.take(10)<\/p>\r\n\r\n\r\n\r\n<p>print(&#8220;\\n&#8221;)<\/p>\r\n\r\n\r\n\r\n<p>for result in top10:<\/p>\r\n\r\n\r\n\r\n<p>\u00a0\u00a0\u00a0\u00a0 print(&#8220;%s: %d&#8221; % (nameDict[result[0]], result[1]))<\/p>\r\n\r\n\r\n\r\n<p>spark.stop()<\/p>\r\n\r\n\r\n\r\n<p>Change the highlighted area as your directory<\/p>\r\n\r\n\r\n\r\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"703\" height=\"383\" class=\"wp-image-8952\" src=\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-317.png\" alt=\"\" srcset=\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-317.png 703w, https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-317-300x163.png 300w\" sizes=\"auto, (max-width: 703px) 100vw, 703px\" \/><\/figure>\r\n\r\n\r\n\r\n<p>$\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 hadoop fs -put u.data sparkdata\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 #To copy u.data file into sparkdata<\/p>\r\n\r\n\r\n\r\n<p>$\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 hadoop fs -ls sparkdata\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 #To display the content<\/p>\r\n\r\n\r\n\r\n<p>$\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 spark-submit 10popular.py\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 #To execute the 10popular.py file<\/p>\r\n\r\n\r\n\r\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"703\" height=\"49\" class=\"wp-image-8953\" src=\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-318.png\" alt=\"\" srcset=\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-318.png 703w, https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-318-300x21.png 300w\" sizes=\"auto, (max-width: 703px) 100vw, 703px\" \/><\/figure>\r\n\r\n\r\n\r\n<p>It will display the most popular 10 movies.<\/p>\r\n\r\n\r\n\r\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"403\" height=\"731\" class=\"wp-image-8954\" src=\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-319.png\" alt=\"\" srcset=\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-319.png 403w, https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-319-165x300.png 165w\" sizes=\"auto, (max-width: 403px) 100vw, 403px\" \/><\/figure>\r\n","protected":false},"excerpt":{"rendered":"<p>Prerequisites GCP account Open Cloud Console. Open Menu &gt; Dataproc &gt; Clusters Click the Cluster. Click on VM Instances Click on SSH of master node Check whether the components is already installed. $\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 pyspark\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 #opens pyspark To exit press ctrl +d $\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 hive\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 #To check hive is available or not To exit press ctrl +d [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1634,1],"tags":[1415,1412,1413,1414,1411,605,699,700,617,683,684,685,611,1400,692],"class_list":["post-8915","post","type-post","status-publish","format-standard","hentry","category-dataproc","category-google-cloud-platform","tag-dataproc","tag-dataproc-cluster","tag-dataproc-cluster-creation","tag-dataproc-cluster-properties","tag-dataproc-in-gcp","tag-gcp","tag-gcp-certification","tag-gcp-cloud-console","tag-google-cloud","tag-google-cloud-certification","tag-google-cloud-console","tag-google-cloud-courses","tag-google-cloud-platform","tag-google-cloud-platform-tutorial","tag-google-cloud-training"],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v25.7 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Working with Dataproc - Prwatech<\/title>\n<meta name=\"robots\" content=\"noindex, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Working with Dataproc - Prwatech\" \/>\n<meta property=\"og:description\" content=\"Prerequisites GCP account Open Cloud Console. Open Menu &gt; Dataproc &gt; Clusters Click the Cluster. Click on VM Instances Click on SSH of master node Check whether the components is already installed. $\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 pyspark\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 #opens pyspark To exit press ctrl +d $\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 hive\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 #To check hive is available or not To exit press ctrl +d [&hellip;]\" \/>\n<meta property=\"og:url\" content=\"https:\/\/prwatech.in\/blog\/google-cloud-platform\/dataproc\/working-with-dataproc\/\" \/>\n<meta property=\"og:site_name\" content=\"Prwatech\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/prwatech.in\/\" \/>\n<meta property=\"article:published_time\" content=\"2021-05-30T05:41:44+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2023-07-13T11:53:24+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-281.png\" \/>\n<meta name=\"author\" content=\"Prwatech\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@Eduprwatech\" \/>\n<meta name=\"twitter:site\" content=\"@Eduprwatech\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Prwatech\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"4 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/prwatech.in\/blog\/google-cloud-platform\/dataproc\/working-with-dataproc\/\",\"url\":\"https:\/\/prwatech.in\/blog\/google-cloud-platform\/dataproc\/working-with-dataproc\/\",\"name\":\"Working with Dataproc - Prwatech\",\"isPartOf\":{\"@id\":\"https:\/\/prwatech.in\/blog\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/prwatech.in\/blog\/google-cloud-platform\/dataproc\/working-with-dataproc\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/prwatech.in\/blog\/google-cloud-platform\/dataproc\/working-with-dataproc\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-281.png\",\"datePublished\":\"2021-05-30T05:41:44+00:00\",\"dateModified\":\"2023-07-13T11:53:24+00:00\",\"author\":{\"@id\":\"https:\/\/prwatech.in\/blog\/#\/schema\/person\/db90baff7744090b2288bbc98fea87f3\"},\"breadcrumb\":{\"@id\":\"https:\/\/prwatech.in\/blog\/google-cloud-platform\/dataproc\/working-with-dataproc\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/prwatech.in\/blog\/google-cloud-platform\/dataproc\/working-with-dataproc\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/prwatech.in\/blog\/google-cloud-platform\/dataproc\/working-with-dataproc\/#primaryimage\",\"url\":\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-281.png\",\"contentUrl\":\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-281.png\",\"width\":456,\"height\":339},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/prwatech.in\/blog\/google-cloud-platform\/dataproc\/working-with-dataproc\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/prwatech.in\/blog\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Working with Dataproc\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/prwatech.in\/blog\/#website\",\"url\":\"https:\/\/prwatech.in\/blog\/\",\"name\":\"Prwatech\",\"description\":\"Share Ideas, Start Something Good.\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/prwatech.in\/blog\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/prwatech.in\/blog\/#\/schema\/person\/db90baff7744090b2288bbc98fea87f3\",\"name\":\"Prwatech\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/prwatech.in\/blog\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/c00bafc1b04045f31eda917de39891456c44fa47c092b9bb6be0f860a3a30a2f?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/c00bafc1b04045f31eda917de39891456c44fa47c092b9bb6be0f860a3a30a2f?s=96&d=mm&r=g\",\"caption\":\"Prwatech\"},\"url\":\"https:\/\/prwatech.in\/blog\/author\/prwatech123\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Working with Dataproc - Prwatech","robots":{"index":"noindex","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"og_locale":"en_US","og_type":"article","og_title":"Working with Dataproc - Prwatech","og_description":"Prerequisites GCP account Open Cloud Console. Open Menu &gt; Dataproc &gt; Clusters Click the Cluster. Click on VM Instances Click on SSH of master node Check whether the components is already installed. $\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 pyspark\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 #opens pyspark To exit press ctrl +d $\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 hive\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 #To check hive is available or not To exit press ctrl +d [&hellip;]","og_url":"https:\/\/prwatech.in\/blog\/google-cloud-platform\/dataproc\/working-with-dataproc\/","og_site_name":"Prwatech","article_publisher":"https:\/\/www.facebook.com\/prwatech.in\/","article_published_time":"2021-05-30T05:41:44+00:00","article_modified_time":"2023-07-13T11:53:24+00:00","og_image":[{"url":"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-281.png","type":"","width":"","height":""}],"author":"Prwatech","twitter_card":"summary_large_image","twitter_creator":"@Eduprwatech","twitter_site":"@Eduprwatech","twitter_misc":{"Written by":"Prwatech","Est. reading time":"4 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/prwatech.in\/blog\/google-cloud-platform\/dataproc\/working-with-dataproc\/","url":"https:\/\/prwatech.in\/blog\/google-cloud-platform\/dataproc\/working-with-dataproc\/","name":"Working with Dataproc - Prwatech","isPartOf":{"@id":"https:\/\/prwatech.in\/blog\/#website"},"primaryImageOfPage":{"@id":"https:\/\/prwatech.in\/blog\/google-cloud-platform\/dataproc\/working-with-dataproc\/#primaryimage"},"image":{"@id":"https:\/\/prwatech.in\/blog\/google-cloud-platform\/dataproc\/working-with-dataproc\/#primaryimage"},"thumbnailUrl":"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-281.png","datePublished":"2021-05-30T05:41:44+00:00","dateModified":"2023-07-13T11:53:24+00:00","author":{"@id":"https:\/\/prwatech.in\/blog\/#\/schema\/person\/db90baff7744090b2288bbc98fea87f3"},"breadcrumb":{"@id":"https:\/\/prwatech.in\/blog\/google-cloud-platform\/dataproc\/working-with-dataproc\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/prwatech.in\/blog\/google-cloud-platform\/dataproc\/working-with-dataproc\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/prwatech.in\/blog\/google-cloud-platform\/dataproc\/working-with-dataproc\/#primaryimage","url":"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-281.png","contentUrl":"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2021\/05\/image-281.png","width":456,"height":339},{"@type":"BreadcrumbList","@id":"https:\/\/prwatech.in\/blog\/google-cloud-platform\/dataproc\/working-with-dataproc\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/prwatech.in\/blog\/"},{"@type":"ListItem","position":2,"name":"Working with Dataproc"}]},{"@type":"WebSite","@id":"https:\/\/prwatech.in\/blog\/#website","url":"https:\/\/prwatech.in\/blog\/","name":"Prwatech","description":"Share Ideas, Start Something Good.","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/prwatech.in\/blog\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Person","@id":"https:\/\/prwatech.in\/blog\/#\/schema\/person\/db90baff7744090b2288bbc98fea87f3","name":"Prwatech","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/prwatech.in\/blog\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/c00bafc1b04045f31eda917de39891456c44fa47c092b9bb6be0f860a3a30a2f?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/c00bafc1b04045f31eda917de39891456c44fa47c092b9bb6be0f860a3a30a2f?s=96&d=mm&r=g","caption":"Prwatech"},"url":"https:\/\/prwatech.in\/blog\/author\/prwatech123\/"}]}},"_links":{"self":[{"href":"https:\/\/prwatech.in\/blog\/wp-json\/wp\/v2\/posts\/8915","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/prwatech.in\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/prwatech.in\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/prwatech.in\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/prwatech.in\/blog\/wp-json\/wp\/v2\/comments?post=8915"}],"version-history":[{"count":2,"href":"https:\/\/prwatech.in\/blog\/wp-json\/wp\/v2\/posts\/8915\/revisions"}],"predecessor-version":[{"id":10084,"href":"https:\/\/prwatech.in\/blog\/wp-json\/wp\/v2\/posts\/8915\/revisions\/10084"}],"wp:attachment":[{"href":"https:\/\/prwatech.in\/blog\/wp-json\/wp\/v2\/media?parent=8915"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/prwatech.in\/blog\/wp-json\/wp\/v2\/categories?post=8915"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/prwatech.in\/blog\/wp-json\/wp\/v2\/tags?post=8915"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}