-
Notifications
You must be signed in to change notification settings - Fork 221
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Custom Spark 3.5.3 Kernel #1392
Comments
I did some changes in the launch_ipykernel.py file. I removed all codes related with background thread in |
Latency in starting spark notebook is relating to web socket timeout. I found the error shown below in EnterpriseGateway logs:
The error doesn't affect spark notebook, notebook works as expected. So I tried to find how to decrease timeout seconds. I found a environment variable described as below in documentation:
I set EG_WS_PING_INTERVAL_SECS to 1, then timeout seconds decreased to 30 seconds. When I tried 0 second, It waited 90 seconds again. The variable configures tornado web application parameters and doesn't work like described in documentation. Finally I changed related parameters directly on Parameters related to websocket timeout:
|
Hello everyone. I am using Jupyter Enterprise Gateway with PySpark sessions on Kubernetes. The elyra/kernel-spark-py:3.2.3 image works as expected.
I modified the image and rebuilt it to upgrade the Spark version to 3.5.3. When I start this kernel through JEG, the Spark driver and executor pods are created and run as expected. However, within the notebook, the spark variable is stuck in the
WaitingForSparkSessionToBeInitialized
value. If I redefine it withspark=SparkSession.builder.getOrCreate()
, it doesn't give an error and works.The text was updated successfully, but these errors were encountered: