Machine-Learning Java Data Analysis / Data Visualization

Back

1. Flink

Open source platform for distributed stream and batch data processing.

2. Hadoop

Hadoop/HDFS.

3. Onyx

Distributed, masterless, high performance, fault tolerant data processing. Written entirely in Clojure.

4. Spark

Spark is a fast and general engine for large-scale data processing.

5. Storm

Storm is a distributed realtime computation system.

6. Impala

Real-time Query for Hadoop.

7. DataMelt

Mathematics software for numeric computation, statistics, symbolic calculations, data analysis and data visualization.