Juju solutions for big data

The setup and configuration of Big Data tools can be very complex and daunting — Juju frees you to explore, test and evaluate Big Data solutions to choose the one that works best for you.

Why use Juju for big data?


Reduce the time to deploy Hadoop and other solutions from days to minutes.


Experiment with different configurations and solutions to choose what works for you.


Port your solution from one infrastructure to another quickly and seamlessly.


Charms encapsulate best practice allowing you to focus on your work.

Core bundles

Anssr Data-engine

The reference implementation of the Anssr Data Platform, developed by Juju Experts at Spicule.

Hadoop Processing

Apache Hadoop is a software framework that supports distributed storage and processing of vast amounts of data. This bundle provides a core set of proven Hadoop components from Apache Bigtop, coupled with monitoring and logging software to enable cluster observability.

Spark Processing

Apache Spark is a fast processing engine for large-scale data processing. This bundle includes components from Apache Bigtop to provide Spark in standalone HA mode. Ganglia and rsyslog are included to monitor cluster health and syslog activity.

Hadoop Spark

This bundle combines the capabilities of the above Hadoop and Spark bundles. This provides users with a flexible solution consisting of HDFS, MapReduce, and Spark that can process a wide variety of workloads.

Key technologies

Hadoop Kafka

Apache Kafka is an open-source message broker that aims to provide a unified, high-throughput, low-latency platform for handling real-time data feeds. Combine Kafka with Hadoop for a powerful stream/batch solution.

Key charms included in the bundle:

Hadoop HBase

Apache HBase is known as the Hadoop database. Combined, HBase + Hadoop can process enormous tables — billions of rows by millions of columns — atop clusters of commodity hardware.

Key charms included in the bundle

Hadoop Flume Zeppelin

An end-to-end Big Data solution that enables ingestion, processing, and visualization of log data. The ingestion component highlighted here is the Apache Flume service.

Key charms included in the bundle

Use the bundles above, alter and extend them or create your own.