apache hadoop client #6
Description
Hadoop is a software platform that lets one easily write and run applications that process vast amounts of data. This charm manages a dedicated client node as a place to run mapreduce jobs.
Overview
The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using a simple programming model.
This charm deploys a client node running Apache Hadoop 2.4.1 from which workloads can be manually run.
Usage
This charm is intended to be deployed as a part of the core bundle:
juju quickstart apache-core-batch-processing
This will deploy the Apache Hadoop platform with a single client unit. From there, you can manually load and run map-reduce jobs:
juju scp my-job.jar client/0:
juju ssh client/0
hadoop jar my-job.jar