Posts

Showing posts from September, 2016

Integration Spark with SQL, Hive In MapR

1. Choose the Spark master node, Worker nodes and history server nodes. 2. Install mapr-spark on all the worker nodes and spark-history server in any of the node 3. Install spark-master on the Master node 4. Run configure.sh -R script on all the nodes 5. Set the password less ssh between the master node and all the worker nodes ssh-keygen -t rsa ssh-copy-id <RSA Public key Path> <hostname> 6. Edit the node configuration file under /opt/mapr/spark/spark-1.6.1/conf directory , rename the slaves.template to slaves and add the list of worker nodes hostnames or IP addresses 7. Once you done the configure.sh -R, Spark master and spark history server will be started automatically by the warden service 8. Start the worker nodes by using below commands /opt/mapr/spark/spark-1.6.1/sbin/start-slaves.sh This script will start the worker daemons in all the nodes, configured under the slaves file Checking the spark functionality Running Sample...