(Okay, so you can be up and running quicker if you have a better internet connection than me.)
Want to get your hands dirty with Hadoop related technologies but don’t have time to waste? I’ve spent way too much time trying to get HBase, for example, running on my Macbook with Brew and wish I had just tried this Virtualbox approach before.
In this short post I show how easy it was for me to get an NFS share mounted on OSX – so I could transparently and simply copy files on HDFS without needing any special tools. Here are the details…
Hortonworks Sandbox is an excellent virtual machine file, that’s primed and loaded with all the tools you would want to try out. Best of all, it’s free and a simple (meh) 2.5GB download. Virtualbox, VMWare and Hyper-V versions are all available.
Once I launched it, I had access to a simple web console, showing me all the varied apps I have running.
Here I enabled Ambari and was able to access that dashboard right away. (Some tutorials say you have to reboot the VM, but I didn’t have to.)
The first thing I wanted to do was enable file access via NFS. You read that right – I had heard earlier today that Hortonworks Data Platform (HDP) supported NFS right out of the box. To be fair, I don’t know what others are doing, so maybe this is old news, but it was good news for me.
So using various configuration windows via Ambari, I went through a tutorial for configuring NFS capabilities. However, to my joy, I found that all the configuration changes I needed to make were already made by default! So, NFS by default sounded good to me.
After switching Virtualbox from NAT networking to Bridged mode, I then told OSX to mount the drive. Et voila!
So I’m just getting started with using this Sandbox environment but it was definitely a timesaver for setting up a development environment.
I’ll want to jump into testing HBase with some other work I’m doing, but I may also get tempted into trying various other apps they’ve packaged neatly into this distribution.. when I have some spare time that is. 😉
- Learnings from TigerGraph and Expero webinar - April 1, 2020
- 4 Webinars This Week – GPU, 5G, graph analytics, cloud - March 30, 2020
- Diving into #NoSQL from the SQL Empire … - February 28, 2017
- VID: Solving Performance Problems on Hadoop - July 5, 2016
- Storing Zeppelin Notebooks in AWS S3 Buckets - June 7, 2016
- VirtualBox extension pack update on OS X - April 11, 2016
- Zeppelin Notebook Quick Start on OSX v0.5.6 - April 4, 2016
- Spark Analysis of Global Place Names (GeoNames) - January 20, 2016
- Serverspec checks settings on a Hadoop cluster - December 8, 2015
- Hadoop Options for SQL Databases - October 15, 2015