feat(loadbalancer): Add LoadBalancerType Client Side Weighted Round Robin by altaiezior · Pull Request #7407 · envoyproxy/gateway

altaiezior · 2025-11-02T03:31:56Z

What type of PR is this?

What this PR does / why we need it:
This PR provides addition of new load balancer type client side weighted round robin. This is a new load balancing extension introduced since envoy 1.32

https://www.envoyproxy.io/docs/envoy/latest/api-v3/extensions/load_balancing_policies/client_side_weighted_round_robin/v3/client_side_weighted_round_robin.proto

Which issue(s) this PR fixes:

Fixes #7305

Release Notes: Yes/No

altaiezior · 2025-11-02T03:34:49Z

@jukie I have added the implementation and also tested it on my local setup

PS: the repo is so easy to contribute everything just works with the docs given on the site :)

altaiezior · 2025-11-02T03:37:28Z

Also I wanted to know should I include slow start in client wrr?

So the thing is that I have submitted the proposal in grpc-xds grpc/proposal#498 and also in envoy I have got the proto updated.

It is not implemented yet, but I am trying to pick it up this month if my time allows

altaiezior · 2025-11-02T03:40:44Z

Also I am unsure of how to test this e2e, so I have just included an AI generated e2e test suite.

The challenge here is that we need multiple replicas with each server respond with a specific header containing rps and cpu_utilisation and then the traffic is distributed by calculating the weight (rps / cpu)

I don't know what the current e2e tests allow and if this type of test case is feasible to write

api/v1alpha1/loadbalancer_types.go

codecov · 2025-11-04T00:45:46Z

Codecov Report

❌ Patch coverage is 73.49398% with 22 lines in your changes missing coverage. Please review.
✅ Project coverage is 73.81%. Comparing base (5c2075b) to head (53f1473).

Files with missing lines	Patch %	Lines
internal/xds/translator/cluster.go	58.53%	16 Missing and 1 partial ⚠️
internal/gatewayapi/backendtrafficpolicy.go	75.00%	2 Missing and 1 partial ⚠️
internal/gatewayapi/clustersettings.go	92.30%	1 Missing and 1 partial ⚠️

Additional details and impacted files

@@           Coverage Diff           @@
##             main    #7407   +/-   ##
=======================================
  Coverage   73.81%   73.81%           
=======================================
  Files         241      241           
  Lines       36608    36688   +80     
=======================================
+ Hits        27021    27082   +61     
- Misses       7681     7698   +17     
- Partials     1906     1908    +2

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

altaiezior · 2025-11-04T14:39:41Z

Also I wanted to know should I include slow start in client wrr?

So the thing is that I have submitted the proposal in grpc-xds grpc/proposal#498 and also in envoy I have got the proto updated.

It is not implemented yet, but I am trying to pick it up this month if my time allows

I have started the implementation of slow_start_config and locality lb config with WRR as well, if possible I would also want to include them in the gatway implementation.

envoyproxy/envoy#41841

jukie · 2025-11-13T17:09:33Z

We wait to add features here until they've made it into a full envoy release. The flow would be getting this lb support added for 1.7 and if your envoy changes get merged we can add that support to gateway in 1.8.

Let's keep the scope of this PR to what's currently available and we can always include additional features in a follow-up. Are you able to make the suggested changes or can you join the contributors call next week to discuss?

altaiezior · 2025-11-14T06:19:38Z

Sure, I just paused because the other changes were also approved, but I understand I will make the respective changes as suggested. Will try to complete them by today / tomorrow @jukie

altaiezior · 2025-11-17T00:15:38Z

@jukie I have made the respective changes

…Gateway CRDs, ensuring configurable parameters and validation rules are integrated. Includes e2e test for validation. Signed-off-by: anurag.ag <anuragagarwal561994@users.noreply.github.com>

Signed-off-by: anurag.ag <anuragagarwal561994@users.noreply.github.com>

…eway CRDs and related configurations. Update associated test data and documentation. Signed-off-by: anurag.ag <anuragagarwal561994@users.noreply.github.com>

…entSideWeightedRoundRobin configuration, update affected tests and CRDs. Signed-off-by: anurag.ag <anuragagarwal561994@users.noreply.github.com>

…cross Gateway CRDs, configuration files, and related tests. Adjust documentation to reflect percentage-based representation. Signed-off-by: anurag.ag <anuragagarwal561994@users.noreply.github.com>

internal/ir/xds.go

api/v1alpha1/loadbalancer_types.go

internal/ir/xds_test.go

jukie · 2025-11-27T01:38:55Z

Overall looks good, just a few more comments @anuragagarwal561994! Thanks for adding this and make sure to run gen-check before pushing.

Sorry for the delayed review on this. I'll prioritize helping you with this next week.

api/v1alpha1/loadbalancer_types.go

…kend Utilization (ORCA) load balancing in Gateway CRDs and related docs. Refine header handling and metric formats. Signed-off-by: anurag.ag <anuragagarwal561994@users.noreply.github.com>

api/v1alpha1/loadbalancer_types.go

…ercent` across API, tests, and internal logic for clarity and precision. Adjust related documentation and validations. Signed-off-by: anurag.ag <6075379+altaiezior@users.noreply.github.com>

…n`. Adjust logic, tests, and documentation to highlight default value and ORCA header removal. Signed-off-by: anurag.ag <6075379+altaiezior@users.noreply.github.com>

arkodg · 2026-02-03T04:40:04Z

test/e2e/tests/load_balancing.go

+
+		gwAddr := kubernetes.GatewayAndRoutesMustBeAccepted(t, suite.Client, suite.TimeoutConfig, suite.ControllerName, kubernetes.NewGatewayRef(gwNN), &gwapiv1.HTTPRoute{}, false, routeNN)
+
+		t.Run("traffic should be split roughly evenly (defaults to equal weights without ORCA)", func(t *testing.T) {


can we test the feature in the e2e ? i.e. have the backend craft a endpoint-load-metrics response header and use that in LB decision making

@arkodg this we have discussed in earlier comment that it will take some time to build and can be done seaprately because this requires changes with the current echo application too.

#7407 (comment)

thanks, imo, lets track this with a GH issue
some inspiration

cat >"${WORKDIR}/backend.py" <<'PY' #!/usr/bin/env python3 import os import time from http.server import BaseHTTPRequestHandler, HTTPServer from urllib.parse import urlparse, parse_qs BACKEND_ID = os.environ.get("ORCA_ID", "backend") DEFAULT_MEM = os.environ.get("ORCA_MEM_UTIL", "0.5") DEFAULT_CPU = os.environ.get("ORCA_CPU_UTIL", "0.1") DEFAULT_EPS = os.environ.get("ORCA_EPS", "0.0") DEFAULT_RPS_FRACTIONAL = os.environ.get("ORCA_RPS_FRACTIONAL", "1.0") class Handler(BaseHTTPRequestHandler): def do_GET(self): qs = parse_qs(urlparse(self.path).query) mem = qs.get("mem", [DEFAULT_MEM])[0] cpu = qs.get("cpu", [DEFAULT_CPU])[0] eps = qs.get("eps", [DEFAULT_EPS])[0] rps_fractional = qs.get("rps_fractional", [DEFAULT_RPS_FRACTIONAL])[0] # ORCA native HTTP text encoding. orca_header = ( "TEXT " f"cpu_utilization={cpu}, " f"mem_utilization={mem}, " f"eps={eps}, " f"rps_fractional={rps_fractional}" ) body = ( f"{BACKEND_ID} mem={mem} cpu={cpu} eps={eps} " f"rps_fractional={rps_fractional}\n" ) self.send_response(200) self.send_header("content-type", "text/plain") # This is the ORCA header Envoy consumes. self.send_header("endpoint-load-metrics", orca_header) self.end_headers() self.wfile.write(body.encode("utf-8")) def log_message(self, fmt, *args): # Keep logs readable in the demo. now = time.strftime("%H:%M:%S") print(f"[{now}] {BACKEND_ID} " + fmt % args) if __name__ == "__main__": port = int(os.environ.get("PORT", "18080")) print(f"starting {BACKEND_ID} on {port}") HTTPServer(("127.0.0.1", port), Handler).serve_forever() PY chmod +x "${WORKDIR}/backend.py"

api/v1alpha1/loadbalancer_types.go

…ross API, internal logic, and templates. Adjust defaults, documentation, and validations to reflect behavior change. Signed-off-by: anurag.ag <6075379+altaiezior@users.noreply.github.com>

Signed-off-by: Arko Dasgupta <arkodg@users.noreply.github.com>

arkodg · 2026-02-08T18:14:34Z

hey @altaiezior thanks for patiently addressing all the comments, the PR looks good !
can you address the final lint issue, and should be good to get this in

Signed-off-by: anurag.ag <6075379+altaiezior@users.noreply.github.com>

altaiezior · 2026-02-09T07:02:41Z

@zirain I am not able to also run the e2e test case on my local because of the above issues, I have made the changes to include the e2e test case for wrr as well. But these seem to be an issue with the master branch itself, let me know once this is fixed so I can also pull the latest version and fix the things in my local too

arkodg · 2026-02-15T20:04:33Z

hey @altaiezior can you rebase again, looks like there's another conflict :(
@envoyproxy/gateway-maintainers can we prioritize getting this PR in to avoid more conflicts

jukie · 2026-02-15T20:16:59Z

@altaiezior could you add a release note?

jukie · 2026-02-15T20:18:52Z

internal/gatewayapi/clustersettings.go

+		backendUtilization := policy.LoadBalancer.BackendUtilization
+		if backendUtilization != nil {
+			if backendUtilization.BlackoutPeriod != nil {
+				if d, err := time.ParseDuration(string(*backendUtilization.BlackoutPeriod)); err == nil {


Can you add full error handling for this and the other options?

jukie · 2026-02-15T20:20:21Z

internal/xds/translator/cluster.go

 		}
+	case args.loadBalancer.BackendUtilization != nil:
+		cswrr := &cswrrv3.ClientSideWeightedRoundRobin{}
+		if v := args.loadBalancer.BackendUtilization; v != nil {


v is already guaranteed to be non-nil due to the case check

jukie · 2026-02-15T20:21:29Z

api/v1alpha1/loadbalancer_types.go

+	// Note: In the internal IR/XDS configuration this value is converted back to a
+	// floating point multiplier (value / 100.0).


nit: can probably remove this note

jukie · 2026-02-15T20:25:10Z

api/v1alpha1/loadbalancer_types.go

+	// Defaults to false.
+	// +optional
+	// +kubebuilder:default=false
+	KeepResponseHeaders *bool `json:"keepResponseHeaders,omitempty"`


Is this implemented? I don't see handling for it. Not opposed to adding this but there's already fields for header management so it could be good enough to mention or include in the docs follow-up.

hey i had requested for this, to avoid having the user to do this manually

altaiezior · 2026-02-16T10:32:23Z

Sure @jukie @arkodg let me try and close them by today / tomorrow

altaiezior · 2026-03-04T01:12:58Z

@jukie I will be able to pick up these changes only by next week as I am travelling and been catching up with my work lately.

altaiezior requested a review from a team as a code owner November 2, 2025 03:31

altaiezior force-pushed the client-wrr branch from 9a814bd to 573a483 Compare November 2, 2025 03:33

jukie reviewed Nov 2, 2025

View reviewed changes

api/v1alpha1/loadbalancer_types.go Outdated Show resolved Hide resolved

jukie reviewed Nov 2, 2025

View reviewed changes

api/v1alpha1/loadbalancer_types.go Outdated Show resolved Hide resolved

zirain reviewed Nov 3, 2025

View reviewed changes

api/v1alpha1/loadbalancer_types.go Outdated Show resolved Hide resolved

altaiezior force-pushed the client-wrr branch from 5f3d64b to ff2a353 Compare November 3, 2025 19:23

jukie reviewed Nov 3, 2025

View reviewed changes

api/v1alpha1/loadbalancer_types.go Outdated Show resolved Hide resolved

altaiezior force-pushed the client-wrr branch from f926f90 to c7ecca3 Compare November 16, 2025 12:07

anurag.ag added 6 commits November 17, 2025 05:48

Add support for ClientSideWeightedRoundRobin load balancer policy in …

971c3f1

…Gateway CRDs, ensuring configurable parameters and validation rules are integrated. Includes e2e test for validation. Signed-off-by: anurag.ag <anuragagarwal561994@users.noreply.github.com>

Adds / Updates test cases for client wrr

a9d01f0

Signed-off-by: anurag.ag <anuragagarwal561994@users.noreply.github.com>

Fix: gen-check ci, coverage-check

63e8b2f

Signed-off-by: anurag.ag <anuragagarwal561994@users.noreply.github.com>

Remove enableOOBLoadReport and oobReportingPeriod fields from gat…

522534b

…eway CRDs and related configurations. Update associated test data and documentation. Signed-off-by: anurag.ag <anuragagarwal561994@users.noreply.github.com>

Remove enableOOBLoadReport and oobReportingPeriod fields from Cli…

4512f9d

…entSideWeightedRoundRobin configuration, update affected tests and CRDs. Signed-off-by: anurag.ag <anuragagarwal561994@users.noreply.github.com>

Update: Change ErrorUtilizationPenalty type from float to integer a…

dd5d285

…cross Gateway CRDs, configuration files, and related tests. Adjust documentation to reflect percentage-based representation. Signed-off-by: anurag.ag <anuragagarwal561994@users.noreply.github.com>

altaiezior force-pushed the client-wrr branch from a0471e0 to dd5d285 Compare November 17, 2025 00:18

Anurag Agarwal and others added 2 commits November 26, 2025 11:36

Merge branch 'main' into client-wrr

1101c1d

Merge branch 'main' into client-wrr

14d1d59

jukie reviewed Nov 27, 2025

View reviewed changes

internal/ir/xds.go Outdated Show resolved Hide resolved

jukie reviewed Nov 27, 2025

View reviewed changes

api/v1alpha1/loadbalancer_types.go Outdated Show resolved Hide resolved

jukie reviewed Nov 27, 2025

View reviewed changes

internal/ir/xds_test.go Outdated Show resolved Hide resolved

arkodg reviewed Dec 2, 2025

View reviewed changes

api/v1alpha1/loadbalancer_types.go Outdated Show resolved Hide resolved

anurag.ag added 2 commits January 29, 2026 05:41

Update: Add detailed documentation and configuration examples for Bac…

7d7528f

…kend Utilization (ORCA) load balancing in Gateway CRDs and related docs. Refine header handling and metric formats. Signed-off-by: anurag.ag <anuragagarwal561994@users.noreply.github.com>

Merge branch 'main' into client-wrr

870c52c

arkodg reviewed Jan 30, 2026

View reviewed changes

api/v1alpha1/loadbalancer_types.go Show resolved Hide resolved

arkodg reviewed Jan 30, 2026

View reviewed changes

api/v1alpha1/loadbalancer_types.go Outdated Show resolved Hide resolved

Merge remote-tracking branch 'upstream/main' into client-wrr

1fa3ff3

altaiezior force-pushed the client-wrr branch from 848b92f to 9a8f647 Compare February 2, 2026 22:49

altaiezior added 2 commits February 3, 2026 04:22

Update: Rename ErrorUtilizationPenalty to `ErrorUtilizationPenaltyP…

c3e06cb

…ercent` across API, tests, and internal logic for clarity and precision. Adjust related documentation and validations. Signed-off-by: anurag.ag <6075379+altaiezior@users.noreply.github.com>

Update: Add support for removeResponseHeaders in `BackendUtilizatio…

9c9dc20

…n`. Adjust logic, tests, and documentation to highlight default value and ORCA header removal. Signed-off-by: anurag.ag <6075379+altaiezior@users.noreply.github.com>

altaiezior force-pushed the client-wrr branch from 621d26d to 9c9dc20 Compare February 2, 2026 22:53

arkodg reviewed Feb 3, 2026

View reviewed changes

arkodg reviewed Feb 4, 2026

View reviewed changes

api/v1alpha1/loadbalancer_types.go Outdated Show resolved Hide resolved

altaiezior added 2 commits February 5, 2026 01:35

Merge branch 'main' into client-wrr

77a5c4b

Update: Replace removeResponseHeaders with keepResponseHeaders ac…

ac1d840

…ross API, internal logic, and templates. Adjust defaults, documentation, and validations to reflect behavior change. Signed-off-by: anurag.ag <6075379+altaiezior@users.noreply.github.com>

altaiezior force-pushed the client-wrr branch from 7d50f1b to ac1d840 Compare February 4, 2026 21:13

Merge branch 'main' into client-wrr

aa37fcb

Signed-off-by: Arko Dasgupta <arkodg@users.noreply.github.com>

arkodg added this to the v1.8.0-rc.1 Release milestone Feb 8, 2026

zirain previously approved these changes Feb 9, 2026

View reviewed changes

Refactor: Reorganize import order in cluster_test.go for consistency.

53f1473

Signed-off-by: anurag.ag <6075379+altaiezior@users.noreply.github.com>

altaiezior dismissed zirain’s stale review via 53f1473 February 9, 2026 05:54

altaiezior force-pushed the client-wrr branch from f9d0cbd to 53f1473 Compare February 9, 2026 05:54

Merge branch 'main' into client-wrr

5cb433d

jukie reviewed Feb 15, 2026

View reviewed changes

jukie requested changes Feb 15, 2026

View reviewed changes

jukie mentioned this pull request Feb 16, 2026

feat: Add WeightedZones to PreferLocalZones #7251

Merged


		gwAddr := kubernetes.GatewayAndRoutesMustBeAccepted(t, suite.Client, suite.TimeoutConfig, suite.ControllerName, kubernetes.NewGatewayRef(gwNN), &gwapiv1.HTTPRoute{}, false, routeNN)

		t.Run("traffic should be split roughly evenly (defaults to equal weights without ORCA)", func(t *testing.T) {

		// Note: In the internal IR/XDS configuration this value is converted back to a
		// floating point multiplier (value / 100.0).

Conversation

altaiezior commented Nov 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

altaiezior commented Nov 2, 2025

Uh oh!

altaiezior commented Nov 2, 2025

Uh oh!

altaiezior commented Nov 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

codecov bot commented Nov 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

altaiezior commented Nov 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jukie commented Nov 13, 2025

Uh oh!

altaiezior commented Nov 14, 2025

Uh oh!

altaiezior commented Nov 17, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jukie commented Nov 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

arkodg commented Feb 8, 2026

Uh oh!

altaiezior commented Feb 9, 2026

Uh oh!

arkodg commented Feb 15, 2026

Uh oh!

jukie commented Feb 15, 2026

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

altaiezior commented Feb 16, 2026

Uh oh!

altaiezior commented Mar 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

altaiezior commented Nov 2, 2025 •

edited

Loading

altaiezior commented Nov 2, 2025 •

edited

Loading

codecov bot commented Nov 4, 2025 •

edited

Loading

altaiezior commented Nov 4, 2025 •

edited

Loading

jukie commented Nov 27, 2025 •

edited

Loading