[SPARK-55240][CORE] Refactor LazyTry stacktrace handling to use wrapper instead of suppressed exception #54007

cloud-fan · 2026-01-27T12:46:02Z

What changes were proposed in this pull request?

This PR refactors how doTryWithCallerStacktrace and getTryWithCallerStacktrace handle stacktrace stitching. Instead of adding an OriginalTryStackTraceException as a suppressed exception on the original exception, we now wrap the original exception inside OriginalTryStackTraceException.

Key changes:

OriginalTryStackTraceException now wraps the original exception (as its cause) and stores the pre-computed "below" stacktrace portion
doTryWithCallerStacktrace returns Failure(wrapper) instead of modifying the original exception's suppressed list
getTryWithCallerStacktrace unwraps the original exception, stitches the stacktrace, and throws the original (not the wrapper)

Why are the changes needed?

no suppressed exceptions visible to users

Does this PR introduce any user-facing change?

Yes, exceptions thrown from LazyTry.get will no longer have a suppressed OriginalTryStackTraceException. The stacktrace stitching behavior remains the same.

How was this patch tested?

Existing unit tests in UtilsSuite and LazyTrySuite updated and passing.

Was this patch authored or co-authored using generative AI tooling?

Yes.

github-actions · 2026-01-27T12:46:13Z

JIRA Issue Information

=== Improvement SPARK-55240 ===
Summary: Refactor LazyTry stacktrace handling to use wrapper instead of suppressed exception
Assignee: None
Status: Open
Affected: ["4.2.0"]

This comment was automatically generated by GitHub Actions

cloud-fan · 2026-01-27T12:48:08Z

This is kind of followup of #48882

cc @juliuszsompolski @yaooqinn

juliuszsompolski · 2026-01-27T14:19:47Z

core/src/main/scala/org/apache/spark/util/Utils.scala

+      // Stitch the stacktrace: keep the "below" part, replace "above" with current caller
+      originalEx.setStackTrace(


IIUC, at this point, the original above stack trace is lost, and also it's indistinguishable for the user, whether the exception happened here, or was only retrieved and stitched from an earlier call?
I do find it useful for debugging to know if the exception really happened on the stack trace of the current call, or it happened in a different call and it was only stitched to pretend as happening here, while in fact it was retrieved from a Try....

I like the wrapper change that makes it cleaner than searching for OriginalTryStackTraceException among the supressed, but I personally would prefer to preserve and show the information about the original place the exception was thrown... However, I can not think of a different way to add it than again adding it as a suppressed Exception, which would then have both a wrapper and a suppressed exception, so I don't think it would be an improvement to the current state...

The suppressed Exception is not ideal, because it's more the "cause", but we cannot make it the cause, because the original exception might have already been the cause... Having it as suppressed exception was the only way I could find to tag it along without disrupting the original exception...

The more common case is people see the error once and fix the query. It's better to optimize for the common case (no suppressed error is needed).

I agree that it's useful to know if the error was caused previously. Shall we insert a special frame in the error stacktrace to indicate it? e.g.

original error stacktrace ... doTryWithCallerStacktrace (only present if it's not the first invocation) ... current caller stacktrace

They would still see the error message on top and their call stacktrace, and probably ignore the suppressed listing below.
In practical cases where I rely on it I would most likely be connected with a debugger anyway and then can get the original caller trace anyway... But if it's a bug report with no repro but only the log, the suppressed part can be useful in providing some triage... From user perspective it is indeed a longer and uglier error dump, something that @yaooqinn complained about in the past.
For me the suppressed full original trace is useful, and only a small ugliness for a casual end user, but I don't insist.

juliuszsompolski · 2026-01-27T14:24:17Z

core/src/main/scala/org/apache/spark/util/Utils.scala

+      val originalEx = wrapper.getCause
+      // Stitch the stacktrace: keep the "below" part, replace "above" with current caller
+      originalEx.setStackTrace(
+        wrapper.belowStacktrace ++ Thread.currentThread().getStackTrace.drop(1))


if going ahead with it, you don't need to store belowStacktrace at all, only the prefix length. The below part is already in originalEx, and is not changing, only the suffix of the current caller changes.

So

val belowStacktraceLength = origStackTrace.size - commonSuffixLen

in doTryWithCallerStacktrace and here

originalEx.setStackTrace( orignalEx.take(belowStacktraceLength) ++ Thread.currentThread().getStackTrace.drop(1))

should work.

juliuszsompolski · 2026-01-28T10:16:41Z

core/src/main/scala/org/apache/spark/util/Utils.scala

-      // We are modifying and throwing the original exception. It would be better if we could
-      // return a copy, but we can't easily clone it and preserve. If this is accessed from
-      // multiple threads that then look at the stack trace, this could break.


I would preserve this code comment next to stack stitching because it still applies.

…er instead of suppressed exception This PR refactors how `doTryWithCallerStacktrace` and `getTryWithCallerStacktrace` handle stacktrace stitching. Instead of adding an `OriginalTryStackTraceException` as a suppressed exception on the original exception, we now wrap the original exception inside `OriginalTryStackTraceException`. Key changes: - `OriginalTryStackTraceException` now wraps the original exception (as its cause) and stores the pre-computed "below" stacktrace portion - `doTryWithCallerStacktrace` returns `Failure(wrapper)` instead of modifying the original exception's suppressed list - `getTryWithCallerStacktrace` unwraps the original exception, stitches the stacktrace, and throws the original (not the wrapper) Benefits: - Cleaner exception output - no suppressed exceptions visible to users - Simpler code - no need to track first access or manage suppressed exception lifecycle - Original exception type is preserved when thrown

juliuszsompolski

LGTM, small nit.

juliuszsompolski · 2026-01-28T10:28:43Z

core/src/main/scala/org/apache/spark/util/LazyTry.scala

+    val isFirstAccess = !materialized
+    materialized = true
+    Utils.getTryWithCallerStacktrace(tryT, isFirstAccess)


nit: I don't think it matters much as there are some atomicity problems with resetting the trace anyway, but maybe use AtomicBoolean and just call Utils.getTryWithCallerStacktrace(tryT, !materialized.getAndSet(true))

it's not thread-safe anyway, and AtomicBoolean takes more memory.

github-actions bot added the CORE label Jan 27, 2026

cloud-fan changed the title ~~[SPARK-XXXXX][CORE] Refactor LazyTry stacktrace handling to use wrapper instead of suppressed exception~~ [SPARK-55240][CORE] Refactor LazyTry stacktrace handling to use wrapper instead of suppressed exception Jan 27, 2026

juliuszsompolski reviewed Jan 27, 2026

View reviewed changes

cloud-fan force-pushed the error branch from d697c6f to 2056caa Compare January 28, 2026 10:11

juliuszsompolski reviewed Jan 28, 2026

View reviewed changes

cloud-fan force-pushed the error branch from 2056caa to e07244f Compare January 28, 2026 10:20

cloud-fan force-pushed the error branch from e07244f to 82c8c11 Compare January 28, 2026 10:22

juliuszsompolski approved these changes Jan 28, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-55240][CORE] Refactor LazyTry stacktrace handling to use wrapper instead of suppressed exception #54007

[SPARK-55240][CORE] Refactor LazyTry stacktrace handling to use wrapper instead of suppressed exception #54007

cloud-fan commented Jan 27, 2026 •

edited

Loading

Uh oh!

github-actions bot commented Jan 27, 2026 •

edited

Loading

Uh oh!

cloud-fan commented Jan 27, 2026

Uh oh!

juliuszsompolski Jan 27, 2026

Uh oh!

juliuszsompolski Jan 27, 2026

Uh oh!

cloud-fan Jan 28, 2026 •

edited

Loading

Uh oh!

juliuszsompolski Jan 28, 2026

Uh oh!

juliuszsompolski Jan 27, 2026

Uh oh!

juliuszsompolski Jan 28, 2026

Uh oh!

juliuszsompolski left a comment

Uh oh!

juliuszsompolski Jan 28, 2026

Uh oh!

cloud-fan Jan 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		// Stitch the stacktrace: keep the "below" part, replace "above" with current caller
		originalEx.setStackTrace(

[SPARK-55240][CORE] Refactor LazyTry stacktrace handling to use wrapper instead of suppressed exception #54007

Are you sure you want to change the base?

[SPARK-55240][CORE] Refactor LazyTry stacktrace handling to use wrapper instead of suppressed exception #54007

Conversation

cloud-fan commented Jan 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Was this patch authored or co-authored using generative AI tooling?

Uh oh!

github-actions bot commented Jan 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

JIRA Issue Information

Uh oh!

cloud-fan commented Jan 27, 2026

Uh oh!

juliuszsompolski Jan 27, 2026

Choose a reason for hiding this comment

Uh oh!

juliuszsompolski Jan 27, 2026

Choose a reason for hiding this comment

Uh oh!

cloud-fan Jan 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

juliuszsompolski Jan 28, 2026

Choose a reason for hiding this comment

Uh oh!

juliuszsompolski Jan 27, 2026

Choose a reason for hiding this comment

Uh oh!

juliuszsompolski Jan 28, 2026

Choose a reason for hiding this comment

Uh oh!

juliuszsompolski left a comment

Choose a reason for hiding this comment

Uh oh!

juliuszsompolski Jan 28, 2026

Choose a reason for hiding this comment

Uh oh!

cloud-fan Jan 28, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

cloud-fan commented Jan 27, 2026 •

edited

Loading

github-actions bot commented Jan 27, 2026 •

edited

Loading

cloud-fan Jan 28, 2026 •

edited

Loading