Please provide your feedback in this short Flings' survey.
fling logo of Hillview: Distributed Data Visualization

Hillview: Distributed Data Visualization

version 1.0 — March 18, 2021

Summary

Hillview is a simple cloud-based spreadsheet program for browsing large data collections. The data manipulated is read-only. Users can sort, find, filter, transform, query, zoom-in/out, and chart data. Operations are performed using direct manipulation in the GUI. Hillview is designed to work on very large data sets (billions of rows). Hillview can import data from a variety of sources: CSV files, ORC files, Parquet files, databases, parallel databases; new connectors can be added with relatively little effort. Hillview takes advantage of all the cores of the worker machines for fast visualizations.

Hillview is a distributed system, composed of two pieces:

  • A distributed set of one or many workers, which should be installed close to the data (e.g., on the machines that host the data).
  • A front-end service that runs a web server and aggregates data from all workers.

The source code of Hillview is available as an open-source project with an Apache-2 license from Hillview's github repository. For any questions, feature requests or bug reports please file an issue on github.

Requirements

  • Java 8 on all machines involved.
  • Python 2 or 3 for the installation scripts in deploying on a cluster or set of machines.
  • Windows subsystem for Linux if using Windows.
  • A modern web browser.

Instructions

Hillview is designed to be deployed as a set of service workers on a set of machines hosting data, and a front-end web server. (The webserver can run on one of the worker machines. The smallest legal cluster consists of exactly one machine hosting a worker and the web server.)

Instructions for installation on a cluster

  1. Please read the requirements.
  2. Install Java 8 on all cluster machines where the service will run.
  3. The Hillview service runs with the permissions of a local user.
  4. Enable password-less ssh access to all machines for the user account used by Hillview.
  5. Download and extract the zip archive.
  6. The next commands assume a bash shell. On Windows start bash from a command prompt. (You should have installed WSL as part of the requirements.)
  7. cd bin
  8. Edit the Hillview configuration file
    config.json
    This file describes the machines where Hillview will be installed. The comments in the file should serve as a guide.
  9. Run the deployment script to install Hillview on the cluster machines.
    ./deploy.py config.json
  10. To start the services run
    ./start.py config.json
  11. To use Hillview open a web browser and connect to the webserver you have configured, using port 8080.
  12. Try loading the Hillview logs: Load/Hillview logs.
  13. When loading data remember that the data itself in general must be on the worker machines.
  14. To stop the services run
    ./stop.py config.json

There are additional simplified instructions for running Hillview on a single machine (MacOS, Windows or Linux) here.

The user manual is online.

Similar Flings

No similar flings found. Check these out instead...
Aug 06, 2021
fling logo of Storage Performance Tester

Storage Performance Tester

version 1.1

Storage Performance Tester is a one-click storage performance test tool, which is able to collect IOPS, latency and CPU cycles per I/O for ESXi storage stack. This tool automates all the testing steps including the customized VMs deployment, I/O workload running, and storage performance analysis.

Feb 09, 2021
fling logo of Virtualized High Performance Computing Toolkit

Virtualized High Performance Computing Toolkit

version 0.1.0

This toolkit is intended to facilitate managing the lifecycle of these special configurations by leveraging vSphere APIs.

Oct 26, 2021
fling logo of HCIBench

HCIBench

version 2.6.1

HCIBench stands for "Hyper-converged Infrastructure Benchmark". It's essentially an automation wrapper around the popular and proven VDbench open source benchmark tool that makes it easier to automate testing across a HCI cluster.

May 07, 2018
fling logo of Cellular Module User Space USB Driver on ESXi

Cellular Module User Space USB Driver on ESXi

version 1.0

This Fling provides this driver to enable deployment on ESXi on their IoT devices.

Aug 19, 2020
fling logo of VMware Container For Folding@Home

VMware Container For Folding@Home

version 1.0

VMware Container for Folding@ Home is a docker container for running folding at home client. This container is supported on both Docker standalone clients and on a Kubernetes Cluster.

Oct 28, 2013
fling logo of Lctree

Lctree

version 1.1

Lctree is a tool designed for the visualization of linked clone VM trees created by VMware vCloud Director. Linked clone is a feature available in vSphere that creates a clone of a VM from a snapshot point.

View More