Unify the validation pipeline for full and partial data columns by aarshkshah1992 · Pull Request #16465 · OffchainLabs/prysm

aarshkshah1992 · 2026-03-04T14:02:52Z

What type of PR is this?

Uncomment one line below and remove others.
Feature

What does this PR do? Why is it needed?

This PR unifies the validation pipelines for full data columns and partial data columns so they both satisfy the same set of validation requirements once the partial data column is full. It also ensures that we can get a verified data column from a partial data column only from the verification package after all the validation requirements have been satisfied.

We account for the fact that partial data column headers and cells arrive separately and in incremental parts.
Also, cells that we read from the EL are trusted and do not need to be verified.

Acknowledgements

I have read CONTRIBUTING.md.
I have included a uniquely named changelog fragment file.
I have added a description with sufficient context for reviewers to understand this PR.
I have tested that my changes work as expected and I added a testing plan to the PR description (if applicable).

…ter' into fix/unify-validation-pipeline

aarshkshah1992 · 2026-03-05T14:08:07Z

@kasey This is now ready for review.

kasey · 2026-03-05T20:34:24Z

consensus-types/blocks/partialdatacolumn.go

-	return NewVerifiedRODataColumn(rodc), true
+// IsComplete returns true if all cells are now present in this column.
+func (p *PartialDataColumn) IsComplete() bool {
+	return uint64(len(p.KzgCommitments)) == p.Included.Count()


I would reverse these casts to len(p.KzgCommitments) == int(p.Included.Count()) . It's weird for Count to return a uint64 in the first place, int is the standard for counting things.

In fact I just made a note to self to modify Count to return an int - looking at all current usages, it is either compared to another instance of Count, or to an int that has to be cast to uint64 to accommodate this odd choice.

kasey · 2026-03-05T20:44:32Z

beacon-chain/verification/error.go

 	// errSidecarParentNotSeen means RequireSidecarParentSeen failed.
 	errSidecarParentNotSeen = errors.New("parent root has not been seen")
+	// ErrSidecarParentSlotUnavailable means that looking up a sidecar parent's slot failed.
+	ErrSidecarParentSlotUnavailable = errors.New("parent slot unavailable")


The wording here is confusing - there's no case where we know the parent but not the slot - the root cause of not knowing the slot is that the parent isn't in forkchoice. I would change this to ErrSidecarParentUnknown = errors.New("parent not found in forkchoice").

kasey · 2026-03-05T20:58:34Z

beacon-chain/verification/data_column.go

+type PartialColumnVerifier struct {
+	DataColumnsVerifier
+	Column              *blocks.PartialDataColumn
+	verifiedCellByIndex map[uint64]bool


Instead of tracking this with a separate map, can we use the pv.Column.Included? That would avoid extra conversion back and forth and the requirement to explicitly call MarkIncludedCellsVerified. Everything is simpler if we can maintain these invariants:

only verified cells are set in pv.Column

pv.Column.Included is updated when those cells are set, so that it also represents the set of verified cells.

I think this suggestion is half-baked because I'm not working through the untrusted cell path yet. One thought is to PartialColumnVerifier to have separate *blocks.PartialDataColumn fields for verified (and/or trusted) and unverified cells. When we verify an unverified PartialDataColumn, we swap the reference so they both point to the same thing.

kasey · 2026-03-06T21:34:59Z

beacon-chain/p2p/partialdatacolumnbroadcaster/partial.go

 	var shouldRepublish bool

-	if ourDataColumn == nil && hasMessage {
+	if ourVerifier == nil && hasMessage {


The logical flow of this method is clunky, and it's very long, due to these giant if statements. I would like to see large portions of this method refactored into a set of smaller methods, with early returns used to telegraph the flow more clearly.

kasey · 2026-03-06T21:41:35Z

beacon-chain/p2p/partialdatacolumnbroadcaster/partial.go

+				log.WithError(err).WithFields(logrus.Fields{
+					"topic":          topicID,
+					"columnIndex":    columnIndex,
+					"numCommitments": len(header.KzgCommitments),


With these big multi-line log statements, I find it helpful to a method to logging helpers, as typically the same log fields are used across different cases. Actually I think we could make some changes to rpcWithFrom to make it more ergonomic in multiple ways - I think the type name could also be improved.

aarshkshah1992 added 3 commits March 4, 2026 13:29

bazel fixes and validation pipeline

43cf256

fixes and tests

eadfb59

more logging and bazel fixes

bde83a2

aarshkshah1992 changed the title ~~unify the validation pipeline for full and partial data columns~~ [WIP] Unify the validation pipeline for full and partial data columns Mar 4, 2026

aarshkshah1992 added 4 commits March 4, 2026 18:22

Merge remote-tracking branch 'origin/tests/tests-for-partial-broadcas…

0434040

…ter' into fix/unify-validation-pipeline

changes based on self review

4ad1f87

add docs

707b6da

fix bazel build

9199377

aarshkshah1992 changed the title ~~[WIP] Unify the validation pipeline for full and partial data columns~~ Unify the validation pipeline for full and partial data columns Mar 5, 2026

aarshkshah1992 requested a review from kasey March 5, 2026 12:46

kasey reviewed Mar 5, 2026

View reviewed changes

kasey reviewed Mar 6, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unify the validation pipeline for full and partial data columns#16465

Unify the validation pipeline for full and partial data columns#16465
aarshkshah1992 wants to merge 7 commits intotests/tests-for-partial-broadcasterfrom
fix/unify-validation-pipeline

aarshkshah1992 commented Mar 4, 2026 •

edited

Loading

Uh oh!

aarshkshah1992 commented Mar 5, 2026

Uh oh!

kasey Mar 5, 2026

Uh oh!

kasey Mar 5, 2026

Uh oh!

kasey Mar 5, 2026

Uh oh!

kasey Mar 5, 2026

Uh oh!

kasey Mar 6, 2026

Uh oh!

kasey Mar 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

aarshkshah1992 commented Mar 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

aarshkshah1992 commented Mar 5, 2026

Uh oh!

kasey Mar 5, 2026

Choose a reason for hiding this comment

Uh oh!

kasey Mar 5, 2026

Choose a reason for hiding this comment

Uh oh!

kasey Mar 5, 2026

Choose a reason for hiding this comment

Uh oh!

kasey Mar 5, 2026

Choose a reason for hiding this comment

Uh oh!

kasey Mar 6, 2026

Choose a reason for hiding this comment

Uh oh!

kasey Mar 6, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

aarshkshah1992 commented Mar 4, 2026 •

edited

Loading