perf: inline numeric fast path in array comparison by He-Pin · Pull Request #693 · databricks/sjsonnet

He-Pin · 2026-04-05T12:54:45Z

Motivation

Array comparison in Jsonnet (e.g., long_array + [1] < long_array + [2]) iterates element-by-element through a while loop, calling compare() recursively for each element. For numeric arrays with 1M+ elements, this means 1M recursive method calls, each going through a 5-branch pattern match (Null, Num, Str, Bool, Arr). This polymorphic dispatch overhead is unnecessary when both elements are the common Val.Num type.

Key Design Decision

Inline type check: Add a Val.Num type match inside the array comparison loop before falling back to recursive compare(), eliminating dispatch overhead for numeric elements
Uses asDouble (not raw extraction): Preserves NaN error behavior — std.log(-1) etc. can produce NaN in Val.Num, and asDouble correctly throws "not a number", matching the official C++ jsonnet behavior. This preserves correct behavior for edge cases like std.log(-1) which creates Val.Num(NaN) via MathModule
java.lang.Double.compare(): Used instead of compareTo() to avoid autoboxing overhead

Modification

sjsonnet/src/sjsonnet/Evaluator.scala — compare() method, Val.Arr branch:

Added inline Val.Num type check before recursive compare() call
Falls back to full compare() for non-numeric elements (strings, booleans, nested arrays)

Benchmark Results

JMH (same-session A/B, single-fork)

Benchmark	Master (ms/op)	Optimized (ms/op)	Change
comparison (1M numeric array)	22.9	20.3	-11.2% ✅
comparison2 (scalar, unaffected)	71.0	~71.0	~0%
bench.02	46.1	44.0	-4.5%
realistic2	70.5	69.3	-1.7%

Hyperfine (Scala Native vs jrsonnet, comparison.jsonnet)

Binary	Mean (ms)	vs jrsonnet
sjsonnet master	29.1	2.13× slower
sjsonnet optimized	30.1	2.21× slower
jrsonnet v0.5.0-pre98	13.6	1.00×

Native shows neutral results — the optimization primarily benefits JVM where JIT dispatch overhead is more significant than in AOT-compiled code.

Analysis

The -11.2% JMH improvement on the comparison benchmark (1M-element numeric array comparison) is consistent across multiple runs (-8.6% to -11.2% range). The optimization works by:

Eliminating recursive dispatch: Each element no longer requires a full method call + 5-branch pattern match
Enabling JIT specialization: The inner loop becomes monomorphic for numeric arrays, allowing better branch prediction
Direct comparison: java.lang.Double.compare() is a single JVM intrinsic

The comparison2 benchmark (scalar i < j in comprehensions) is unaffected because it uses the top-level Val.Num case in compare(), not the array branch.

References

Upstream: jit branch commit 62437d8f

Result

Targeted optimization for numeric array comparison with no regressions. Reduces the gap with jrsonnet on the comparison benchmark by avoiding unnecessary polymorphic dispatch in the hot loop.

Inline Val.Num type check in the array comparison while loop to avoid polymorphic recursive compare() dispatch per element. For numeric array comparisons (e.g. 1M elements), this eliminates 1M recursive method calls with 5-branch pattern matching overhead. Uses asDouble (not raw extraction) to preserve NaN error behavior — std.log(-1) etc. can produce NaN in Val.Num, and asDouble correctly throws 'not a number', matching the official C++ jsonnet behavior. Uses java.lang.Double.compare() instead of compareTo() to avoid autoboxing overhead. Upstream: jit branch commit 62437d8

He-Pin marked this pull request as ready for review April 5, 2026 13:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf: inline numeric fast path in array comparison#693

perf: inline numeric fast path in array comparison#693
He-Pin wants to merge 1 commit intodatabricks:masterfrom
He-Pin:perf/inline-numeric-array-compare

He-Pin commented Apr 5, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

He-Pin commented Apr 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Key Design Decision

Modification

Benchmark Results

JMH (same-session A/B, single-fork)

Hyperfine (Scala Native vs jrsonnet, comparison.jsonnet)

Analysis

References

Result

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

He-Pin commented Apr 5, 2026 •

edited

Loading