Machine-Learning Scala Data Analysis / Data Visualization

Back

1. NDScala

N-dimensional arrays in Scala 3. Think NumPy ndarray, but with compile-time type-checking/inference over shapes, tensor/axis labels & numeric data types

2. MLlib in Apache Spark

Distributed machine learning library in Spark

3. Hydrosphere Mist

a service for deployment Apache Spark MLLib machine learning models as realtime, batch or reactive web services.

4. Scalding

A Scala API for Cascading.

5. Summing Bird

Streaming MapReduce with Scalding and Storm.

6. Algebird

Abstract Algebra for Scala.

7. PredictionIO

PredictionIO, a machine learning server for software developers and data engineers.

8. BIDMat

CPU and GPU-accelerated matrix library intended to support large-scale exploratory data analysis.

9. Flink

Open source platform for distributed stream and batch data processing.

10. Spark Notebook

Interactive and Reactive Data Science using Scala and Spark.