AER-4020 Added configuration option to set max number of workers available by Hilbrand · Pull Request #118 · aerius/taskmanager

Hilbrand · 2026-04-03T13:15:35Z

When scaling workers the maxCapacityUse value can limit the dispatching of tasks because when the capacity a specific queue can use is based on the actual workers it might never cause tasks not send to workers and therefor the load of the workers might never exceed the trigger value to start adding more workers. By setting a maxWorkerSize value to the value the system can scale to tasks will be scheduled if the full capacity is available. So when a low number of workers available more tasks would still be scheduled, which will result in triggering the scaling value.

…lable When scaling workers the maxCapacityUse value can limit the dispatching of tasks because when the capacity a specific queue can use is based on the actual workers it might never cause tasks not send to workers and therefor the load of the workers might never exceed the trigger value to start adding more workers. By setting a maxWorkerAvailable value to the value the system can scale to tasks will be scheduled if the full capacity is available. So when a low number of workers available more tasks would still be scheduled, which will result in triggering the scaling value.

SerhatG

Ah, I didn't get this directly.. am I correct to assume this would be perfect for queues (chunkers come to mind) where the maxCapacityUse is lower than the scaling threshold, so by setting this you can get it to scale out, even though only this relatively small queue is 'full'.

Example scenario to see if I understand the change

connect_gml_export is set to 40% by default.
For a production environment where it starts with 16 chunkers in the morning, 40% would be 6.4 chunkers. But this will never trigger a scale out if this is the only type of chunkers running at that moment, as the threshold for starting a scale out is 50% for chunkers.

Instead if say, the maximum amount of chunkers is fictitiously 100, you could set maxWorkersAvailable to 40, which is what it is allowed to use for a fully scaled out environment.

SerhatG · 2026-04-03T13:31:22Z

source/taskmanager/src/main/java/nl/aerius/taskmanager/mq/RabbitMQQueueMonitor.java

    try {
      final JsonNode jsonObject = getJsonResultFromApi(apiPath);
-
+      LOG.info("Making RabbitMQ api call: {}", apiPath);


Probably not meant to be .info, or was it to debug while developing?

Ah seems I forgot to push my last update of commit that already had removed this log line (as it should not be here indeed).

Hilbrand · 2026-04-07T11:50:12Z

@SerhatG Yes that is exactly the intended usage.

BertScholten

Not exactly sure how this is going to be used in practice. Don't think we have automatic configuration files per environment (yet)? Or do you want to configure it based on the (rather minimal) dev environment?

BertScholten · 2026-04-07T11:38:31Z

...nager/src/main/java/nl/aerius/taskmanager/scheduler/priorityqueue/PriorityTaskScheduler.java

+   * Returns the number of workers that are available or the number of configured maximum numbers that are potential available. Which ever number
+   * is the highest. This number can be used to see if tasks can still be scheduled. If there is still room for more workers to be started
+   * (i.e. max is not reached) than a task should be able to be scheduled it it doesn't have reach it maximum relative to the total potential
+   * workers, not the actual number of workers running.


Suggested change

* Returns the number of workers that are available or the number of configured maximum numbers that are potential available. Which ever number

* is the highest. This number can be used to see if tasks can still be scheduled. If there is still room for more workers to be started

* (i.e. max is not reached) than a task should be able to be scheduled it it doesn't have reach it maximum relative to the total potential

* workers, not the actual number of workers running.

* Returns the number of workers that are available, or the configured maximum that are potentially available,

* whichever number is the highest.

* This number can be used to see if tasks can still be scheduled.

* If there is still room for more workers to be started (i.e. the max is not reached yet) then a task can be scheduled.

* This then serves as a preload mechanism while workers are starting/scaled up.

Had a hard time understanding this comment. Code was simpler.

BertScholten · 2026-04-07T11:51:38Z

README.md

+For example if worker scaling is based on the percentage workers being used it can mean the system will never scale up because the `maxCapacityUse` for certain input queues will never exceed the scaling threshold percentage.
+Especially when the system runs with a low number of workers the `maxCapacityUse` is easily reached before the scaling threshold is reached.
+By setting `maxWorkersAvailable` the scheduler will determine if tasks already have reached the `maxCapacityUse` value based on the `maxWorkersAvailable` value and not the actual number of workers running.
+It will than schedule tasks beyond the `maxCapacityUse` assuming this would trigger the system to scale up if needed,


Suggested change

It will than schedule tasks beyond the `maxCapacityUse` assuming this would trigger the system to scale up if needed,

It will then schedule tasks beyond the `maxCapacityUse`, assuming this will trigger the system to scale up as needed,

BertScholten · 2026-04-07T11:55:45Z

README.md

+Especially when the system runs with a low number of workers the `maxCapacityUse` is easily reached before the scaling threshold is reached.
+By setting `maxWorkersAvailable` the scheduler will determine if tasks already have reached the `maxCapacityUse` value based on the `maxWorkersAvailable` value and not the actual number of workers running.
+It will than schedule tasks beyond the `maxCapacityUse` assuming this would trigger the system to scale up if needed,
+and only for a short amount of time claim more resources for a specific queue than would be allowed by the `maxCapacityUse` percentage.


only for a short time if the maxWorkersAvailable matches the actual system max scaling. If that isn't configured correctly, it might end up claiming more for longer periods (bit nitpicky perhaps).

SerhatG · 2026-04-07T13:30:37Z

Something to discuss at the technical meeting?

Hilbrand requested review from BertScholten, RonniLin and SerhatG April 3, 2026 13:15

SerhatG reviewed Apr 3, 2026

View reviewed changes

Hilbrand force-pushed the AER-4020-max-workers-available branch from 3a63d33 to e37392d Compare April 7, 2026 11:49

BertScholten reviewed Apr 7, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AER-4020 Added configuration option to set max number of workers available#118

AER-4020 Added configuration option to set max number of workers available#118
Hilbrand wants to merge 1 commit intoaerius:mainfrom
Hilbrand:AER-4020-max-workers-available

Hilbrand commented Apr 3, 2026

Uh oh!

SerhatG left a comment

Uh oh!

SerhatG Apr 3, 2026

Uh oh!

Hilbrand Apr 7, 2026

Uh oh!

Hilbrand commented Apr 7, 2026

Uh oh!

BertScholten left a comment

Uh oh!

BertScholten Apr 7, 2026

Uh oh!

BertScholten Apr 7, 2026

Uh oh!

BertScholten Apr 7, 2026

Uh oh!

SerhatG commented Apr 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

-   * Returns the number of workers that are available or the number of configured maximum numbers that are potential available. Which ever number
-   * is the highest. This number can be used to see if tasks can still be scheduled. If there is still room for more workers to be started
-   * (i.e. max is not reached) than a task should be able to be scheduled it it doesn't have reach it maximum relative to the total potential
-   * workers, not the actual number of workers running.
+   * Returns the number of workers that are available, or the configured maximum that are potentially available,
+   * whichever number is the highest.
+   * This number can be used to see if tasks can still be scheduled.
+   * If there is still room for more workers to be started (i.e. the max is not reached yet) then a task can be scheduled.
+   * This then serves as a preload mechanism while workers are starting/scaled up.

	It will than schedule tasks beyond the `maxCapacityUse` assuming this would trigger the system to scale up if needed,
	It will then schedule tasks beyond the `maxCapacityUse`, assuming this will trigger the system to scale up as needed,

Conversation

Hilbrand commented Apr 3, 2026

Uh oh!

SerhatG left a comment

Choose a reason for hiding this comment

Uh oh!

SerhatG Apr 3, 2026

Choose a reason for hiding this comment

Uh oh!

Hilbrand Apr 7, 2026

Choose a reason for hiding this comment

Uh oh!

Hilbrand commented Apr 7, 2026

Uh oh!

BertScholten left a comment

Choose a reason for hiding this comment

Uh oh!

BertScholten Apr 7, 2026

Choose a reason for hiding this comment

Uh oh!

BertScholten Apr 7, 2026

Choose a reason for hiding this comment

Uh oh!

BertScholten Apr 7, 2026

Choose a reason for hiding this comment

Uh oh!

SerhatG commented Apr 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants