-
Notifications
You must be signed in to change notification settings - Fork 29k
Pull requests: apache/spark
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[SPARK-55038][SQL] Fix wrong results for array_agg(DISTINCT) with AQE…
SQL
#54021
opened Jan 28, 2026 by
anirudh83
Loading…
[SPARK-55246][SS] Add Test for Pyspark TWS and TWSInPandas and Fix StatePartitionAllColumnFamiliesWriter Bug
PYTHON
SQL
STRUCTURED STREAMING
#54019
opened Jan 28, 2026 by
zifeif2
Loading…
[SPARK-55245][PYTHON][PS][TESTS] Fix all timestamp freq usage from M to ME
PANDAS API ON SPARK
PYTHON
#54018
opened Jan 28, 2026 by
gaogaotiantian
Loading…
[SPARK-55225][PYTHON][PS] Restore to the original dtype for Datetime
PANDAS API ON SPARK
PYTHON
#54017
opened Jan 28, 2026 by
gaogaotiantian
Loading…
[SPARK-55243][CONNECT] Allow setting binary headers via the -bin suffix in the Scala Connect client
CONNECT
SQL
#54016
opened Jan 27, 2026 by
dillitz
Loading…
[SPARK-55244][PYTHON][PS] Use np.nan as default value for pandas string types
PANDAS API ON SPARK
PYTHON
#54015
opened Jan 27, 2026 by
gaogaotiantian
Loading…
[SPARK-55228][SPARK-55230][SQL] Implement Dataset.zipWithIndex in Scala API
CONNECT
SQL
#54014
opened Jan 27, 2026 by
fangchenli
Loading…
Upgrade Spark 4.1 + Affirm Specific Change
BUILD
CORE
KUBERNETES
PYTHON
#54012
opened Jan 27, 2026 by
li-isabella
•
Draft
[SPARK-55031][SQL] Add vector avg/sum aggregation function expressions
SQL
#54011
opened Jan 27, 2026 by
zhidongqu-db
Loading…
[SPARK-54943][PYTHON][TESTS] Add test coverage for
pa.Array.cast with safe=False
CORE
PYTHON
#54010
opened Jan 27, 2026 by
Yicong-Huang
Loading…
[SPARK-46167][PS] Add axis implementation to DataFrame.rank
PANDAS API ON SPARK
PYTHON
#54009
opened Jan 27, 2026 by
devin-petersohn
Loading…
[SPARK-54887] Add previously removed legacy error class back in
CONNECT
SQL
#54008
opened Jan 27, 2026 by
garlandz-db
Loading…
[SPARK-55240][CORE] Refactor LazyTry stacktrace handling to use wrapper instead of suppressed exception
CORE
#54007
opened Jan 27, 2026 by
cloud-fan
Loading…
[SPARK-55239][CONNECT][YARN] Allow to launch SparkConnectServer in YARN cluster mode
CORE
#54004
opened Jan 27, 2026 by
sarutak
Loading…
[SPARK-55238][Geo][SQL] Move Geo SRS mapping logic from
main/scala to main/java
SQL
#54003
opened Jan 27, 2026 by
uros-db
Loading…
[SPARK-55237][SQL] Suppress annoying messages when looking up nonexistent DBs
#54002
opened Jan 27, 2026 by
pan3793
Loading…
[SPARK-55236][CORE] Address unexpected exception in some CoarseGrainedExecutorBackendSuite test cases
CORE
#54001
opened Jan 27, 2026 by
ChuckLin2025
Loading…
[SPARK-47996][PS] support cross merge in pandas API
PANDAS API ON SPARK
PYTHON
#54000
opened Jan 27, 2026 by
fangchenli
Loading…
[SPARK-55235][PYTHON][TESTS] Refactor tests for pandas udf input type coercion
BUILD
DOCS
PYTHON
SQL
#53999
opened Jan 27, 2026 by
zhengruifeng
Loading…
[SPARK-55224][PYTHON] Use Spark DataType as ground truth in Pandas-Arrow serialization
CORE
PYTHON
SQL
#53992
opened Jan 27, 2026 by
Yicong-Huang
Loading…
[WIP][SPARK-55221][PYTHON] Add
to_arrow transformer and remove _create_struct_array
CORE
PYTHON
SQL
#53989
opened Jan 26, 2026 by
Yicong-Huang
•
Draft
[MINOR][PYTHON][TESTS] Consolidate DataStreamReader.name() tests into test_streaming.py
BUILD
PYTHON
STRUCTURED STREAMING
#53988
opened Jan 26, 2026 by
ericm-db
Loading…
Previous Next
ProTip!
What’s not been updated in a month: updated:<2025-12-27.