Prerequisites
Hardware requirements:
Local machine
RAM 8 gb or above
Software requirements:
VMware Workstation , Version : 16 pro
Download the software from given link :- https://prwatech.in/blog/software-installation/vmware-workstation-installation/
Ubuntu , Version:18.04 :- https://prwatech.in/blog/software-installation/vmware-workstation-installation/
Programming languages: Linux
Learn Linux : https://prwatech.in/blog/linux/linux-architecture/
1.Install java
Command: $sudo apt-get install openjdk-8-jdk
2. check the java version whether it is installed
3. Download spark in terminal by using following command :
wget https://archive.apache.org/dist/spark/spark-2.4.0/spark-2.4.0-bin-hadoop2.7.tgz
4. Untar the file using following command :
tar -xvf spark-2.4.0-bin-hadoop2.7.tgz
5. Edit bashrc file
6. Add the following lines in bashrc file
export SPARK_HOME=/home/user/spark-2.4.0-bin-hadoop2.7
export PATH=$PATH:$SPARK_HOME/bin
7. Save the bashrc file
8. Run spark-shell from the bin directory of spark
9. Spark installation is completed now you can write query