anssr data engine #1
Overview
The Anssr Data Engine bundle is our reference implementation of all the compoments we can offer with the Anssr Data Platform.
All the components are aware of their relations and configure themselves automatically on any of the leading Cloud platforms.
Bundle Composition
- Apache Hadoop Client
- Ganglia
- Ganglia Node
- Namenode
- Apache Hadoop Plugin
- Apache Hadoop Resource Manager
- RSyslog
- RSyslog Forwarder
- Hadoop Slave
- Apache Spark
- Apache Zookeeper
- Apache Drill
- Apache Zeppelin
- Pig
- Hive
- HBase
- Mahout
- Flume HDFS
- Flume Kafka
- Apache Kafka
Deploying
To deploy this stack you can simple push the deploy button at the top of the page or run:
juju deploy cs:~spiculecharms/bundle/anssr-data-engine
This will spin up 9 machines and deploy the 20 components to their respective machines. Deployment time varies on Cloud and network perfomance but usually takes about 20 minutes until you have a full operational and scalable Hadoop platform.
Verifying
To check all the components have deploy successfully you can check the Status tab in the Juju GUI or run:
juju status
And ensure none of the units are reporting an error state.
Monitoring
Scaling
To scale units you can do so by selecting the charm in the GUI and then in the menu on the left, select the units and input the amout of extra units you require. Or you can run:
juju add-unit -n 1 <charm name>
Where 1 is the number of new units you want and
Issues
Contact Information
You can get help and support for this bundle from:
Resources
- Spicule’s solutions can solve your Big Data challenges
- Supported analytics
- Streaming data platforms