Skip to content

Latest commit

 

History

History
21 lines (18 loc) · 714 Bytes

README.md

File metadata and controls

21 lines (18 loc) · 714 Bytes

Apache Spark demo example (Python)

This is a starter app for the Apache Spark Python template.

Running the application on a Spark standalone cluster via Docker

To run the application, execute the following steps:

  1. Setup a Spark cluster as described on http://github.com/big-data-europe/docker-spark by just running:
    git https://github.com/big-data-europe/docker-spark.git
    cd docker-spark
    docker-compose up -d
  2. Build the Docker image:
    bash build.sh python-example examples/python
  3. Run the Docker container:
    docker run --rm --network dockerspark_default --name pyspark-example bde2020/spark-python-example:3.3.0-hadoop3.3