Skip to content

Commit

Permalink
Merge branch 'master' into Spark3.5.0
Browse files Browse the repository at this point in the history
  • Loading branch information
ghislainfourny authored Oct 24, 2024
2 parents 3ce80c1 + c1bd37e commit eeba107
Show file tree
Hide file tree
Showing 2 changed files with 8 additions and 6 deletions.
10 changes: 6 additions & 4 deletions docs/Getting started.md
Original file line number Diff line number Diff line change
Expand Up @@ -78,11 +78,11 @@ If you use Linux, Florian Kellner also kindly contributed an [installation scrip

RumbleDB requires an Apache Spark installation on Linux, Mac or Windows.

It is straightforward to directly [download it](https://spark.apache.org/downloads.html), unpack it and put it at a location of your choosing. We recommend to pick Spark 3.2.2. Let us call this location SPARK_HOME (it is a good idea, in fact to also define an environment variable SPARK_HOME pointing to the absolute path of this location).
It is straightforward to directly [download it](https://spark.apache.org/downloads.html), unpack it and put it at a location of your choosing. We recommend to pick Spark 3.4.3. Let us call this location SPARK_HOME (it is a good idea, in fact to also define an environment variable SPARK_HOME pointing to the absolute path of this location).

What you need to do then is to add the subdirectory "bin" within the unpacked directory to the PATH variable. On macOS this is done by adding

export SPARK_HOME=/path/to/spark-3.2.2-bin-hadoop3.2
export SPARK_HOME=/path/to/spark-3.4.3-bin-hadoop3.2
export PATH=$SPARK_HOME/bin:$PATH

(with SPARK_HOME appropriately set to match your unzipped Spark directory) to the file .zshrc in your home directory, then making sure to force the change with
Expand Down Expand Up @@ -111,9 +111,11 @@ Like Spark, RumbleDB is just a download and no installation is required.

In order to run RumbleDB, you simply need to download one of the small .jar files from the [download page](https://github.com/RumbleDB/rumble/releases) and put it in a directory of your choice, for example, right besides your data.

If you use Spark 3.2+, use rumbledb-1.22.0-for-spark-3.2.jar.
If you use Spark 3.4+, use rumbledb-1.22.0-for-spark-3.4.jar.

If you use Spark 3.3+, use rumbledb-1.22.0-for-spark-3.3.jar.
If you use Spark 3.5+, use rumbledb-1.22.0-for-spark-3.5.jar.

If you use Spark 4.0+ (preview), use rumbledb-1.22.0-for-spark-4.0.jar.

These jars do not embed Spark, since you chose to set it up separately. They will work with your Spark installation with the spark-submit command.

Expand Down
4 changes: 2 additions & 2 deletions docs/install.md
Original file line number Diff line number Diff line change
Expand Up @@ -7,9 +7,9 @@ We show here how to install RumbleDB from the github repository if you wish to d
The following software is required:

- [Java SE](http://www.oracle.com/technetwork/java/javase/downloads/index.html) 8 (last tested on OpenJDK 8u251). The version of Java is important, as Spark only works with Java 8 or java 11.
- [Spark](https://spark.apache.org/), version 3.1.2 (for example)
- [Spark](https://spark.apache.org/), version 3.4.3 (for example)
- [Ant](http://www.ant.org/), version 1.11.1
- [ANTLR](http://www.ant.org/), version 4.8 (supplied in our repository)
- [ANTLR](http://www.ant.org/), version 4.9.3 (supplied in our repository)
- [Maven](https://maven.apache.org/) 3.6.0

Important: the ANTLR version varies with the Spark version, because Spark is also shipped with an ANTLR runtime (example: Spark 3.0 and 3.1 is with ANTLR 4.7, Spark 3.2 with ANTLR 4.8). The ANTLR runtime MUST match the ANTLR generator used to generate the RumbleDB jar file.
Expand Down

0 comments on commit eeba107

Please sign in to comment.