You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Here is the pyspark job, but I think any kind of job will meet this issue:
from pyspark.sql import SparkSession
spark = SparkSession.builder.appName("Test").getOrCreate()
df = spark.createDataFrame([("a",)], ["a"])
df.write.parquet("TEST")
df = spark.read.parquet("TEST")
df.selectExpr("ascii(a) AS ascii_value").show()
but the spark-submit hangs on
24/12/13 16:45:59 INFO DAGScheduler: Job 2 finished: showString at NativeMethodAccessorImpl.java:0, took 0.081151 s
24/12/13 16:45:59 INFO CodeGenerator: Code generated in 13.786791 ms
+-----------+
|ascii_value|
+-----------+
| 97|
+-----------+
Then control-C:
^C24/12/13 16:46:22 INFO RapidsBufferCatalog: Closing storage
24/12/13 16:46:24 WARN ProfileWriter: Profiling completed, output written to file:/home/haoyangl/rapids-nsys/[email protected]
24/12/13 16:46:24 WARN ProfilerOnDriver: Profiling: Executor driver ended profiling, profile written to file:/home/haoyangl/rapids-nsys/[email protected]
24/12/13 16:46:24 INFO AwsStorageExecutorPlugin: Shutting down S3 Plugin ...
24/12/13 16:46:24 INFO SparkContext: Invoking stop() from shutdown hook
24/12/13 16:46:24 INFO SparkContext: SparkContext is stopping with exitCode 0.
24/12/13 16:46:24 INFO SparkUI: Stopped Spark web UI at http://spark-haoyang:4040
and everything works fine in the next steps.
However, when adding a stage limit, it does not hang:
Describe the bug
I would like to profile a pyspark job with self-profiler, here's the command I use:
Here is the pyspark job, but I think any kind of job will meet this issue:
but the spark-submit hangs on
Then
control-C
:and everything works fine in the next steps.
However, when adding a stage limit, it does not hang:
The text was updated successfully, but these errors were encountered: