Getting Started with Big Data Technologies Using an Oracle VM
If you have been wanting to get started with Big Data and felt overwhelmed with all the lingo and technologies, I wanted to share some information that I think you might find helpful. The Oracle Big Data Lite VMs are a smaller version of Oracle’s Big Data Appliance and they come pre-installed with all the software one would need to get started.
The following components are included on Oracle Big Data Lite 4.11:
Oracle Enterprise Linux 6.9
Oracle Database 12c Release 1 Enterprise Edition (126.96.36.199) – including Oracle Big Data SQL-enabled external tables, Oracle Multitenant, Oracle Advanced Analytics, Oracle OLAP, Oracle Partitioning, Oracle Spatial and Graph, and more.
Cloudera Distribution including Apache Hadoop (CDH5.13.1)
Cloudera Manager (5.13.1)
Oracle Big Data Spatial and Graph 2.4
Oracle Big Data Connectors 4.11
Oracle SQL Connector for HDFS 3.8.1
Oracle Loader for Hadoop 3.9.1
Oracle Data Integrator 12c (188.8.131.52.0)
Oracle R Advanced Analytics for Hadoop 2.7.1
Oracle XQuery for Hadoop 4.9.1
Oracle Data Source for Apache Hadoop 1.2.1
Oracle Shell for Hadoop Loaders 1.3.1
Oracle NoSQL Database Enterprise Edition 12cR1 (4.5.12)
Oracle JDeveloper 12c (184.108.40.206.0)
Oracle SQL Developer and Data Modeler 17.3.1 with Oracle REST Data Services 3.0.7
Oracle Data Integrator 12cR1 (220.127.116.11.0)
Oracle GoldenGate 12c (18.104.22.168.2)
Oracle R Distribution 3.3.0
Oracle Perfect Balance 2.10.0
To get started, do a Google search for oracle big data lite and you will get a choice to download either the more stable version (4.2.1) or the latest version (4.11).
Once you get to the main page, accept the license agreement then download the Deployment Guide, the zip files for the VM and Oracle VirtualBox. Also, make sure your PC/Laptop meets the technical requirements. The key steps to follow are listed under the ‘To get started’ section.
Don’t forget to check out the Getting Started section on the main page, it is packed with useful information (YouTube videos, pdf files, other hands-on-labs,…). The files needed for the hands-on-labs (HOLs) are already copied to the VM, all you need to do is follow the instructions.
As a Data Integration enthusiast, I first began my journey with Tame Big Data using Oracle Data Integration HOL. This HOL will walk you through the use of Oracle Data Integrator (ODI) with Big Data Connectors, which supports native code generation for Pig Latin, Spark, and Oozie standards.
Occasionally, you may run into a few issues. If you do, you should create a new thread on GitHub with all the pertinent information.