Home  CV  Contact

Apache Spark Logo Apache Flink Logo

Big Data Scripts (repository)

Summary
💾 A collection of Apache Spark scripts used to get familiar with the basics of batch processing of big data and a collection of Apache Flink scripts used to get familiar with the basics of stream processing of big data.

Features

The Apache Spark scripts cover a range of topics such as:

The Apache Flink scripts cover a range of topics such as:

Tools

PurposeName
Programming languageScala
Cluster computing frameworkApache Spark, Apache Flink

Installation Process

It is assumed that both a Java JDK and an IDE such as IntelliJ are installed and that the users operating system is Windows.

Licence

These Big Data scripts are published under the MIT licence, which can be found in the LICENSE file. For this repository, the terms laid out there shall not apply to any individual that is currently enrolled at a higher education institution as a student. Those individuals shall not interact with any other part of this repository besides this README in any way by, for example cloning it or looking at its source code or have someone else interact with this repository in any way.

References

The Apache Spark logo was taken from Wikipedia and the Apache Flink logo from .