- Run all commands with the sudo user
- Run
initial.sh - Change the path where the github project is downloaded in the
docker-compose.ymlfile to the one on your local machine - Access Portainer-Web at http://localhost:9000
- Create Stack on Portainer
docker-compose.ymland start the stack: https://docs.portainer.io/v/ce-2.9/user/docker/stacks/add - Run
start-project.sh - Run
run-wordcount.shto run Word Count job - Enjoy!
- Have Ansible installed
- Enter the ansible folder
- Set your hosts in hosts file in ansible folder
- Run
./install-ssh-keys.sh hostsand enter the hosts file in front of the script to set the SSH key with your remote host - Run
ansible-playbook playbook.yml
- This cluster consists of a master node and two slaves by default
- You might have to change resource configs. Current config uses 4 cores and 4 Gb RAM
- Enter directory /data/big-data-storage
- Create a text file or any other
- Access HDFS http://localhost:9870
- Navigate to Browser the file system
- Open the /flume folder
- See if the file you just created on the host machine is found
- Enjoy!
If you want to see the web UI, you have to access the following address/port:
http://localhost:9870HDFS Web UIhttp://localhost:8088YARN Web UIhttp://localhost:19888MapReduce JobHistory Web UIhttp://localhost:10002HiveServer2 Web UIhttp://localhost:9000Portainer Web UI