ComputeLite

ComputeLite is a Kubernetes-inspired compute scheduler and control-plane simulator written in Go. It models nodes, jobs, heartbeats, and controller-based reconciliation to demonstrate how modern compute platforms schedule, monitor, and reschedule workloads under failure and churn.

The system runs as a long-lived daemon, converging cluster state over time rather than executing a single scheduling pass.

High-Level Architecture

ComputeLite is structured as a control plane + node agents system.

Core Components

ClusterState

Single source of truth
Thread-safe (RWMutex)
Owns all nodes, jobs, and resource accounting
Enforces valid job state transitions

Controllers (Reconciliation Loops)

Run continuously in goroutines
Observe cluster state via snapshots
Mutate state only through ClusterState APIs

Controllers include:

SchedulerController – assigns pending jobs to healthy nodes using a pluggable policy
JobController – advances jobs through Assigned → Running → Succeeded
HealthController – evaluates node health based on heartbeat freshness
ReschedulerController – evicts and requeues jobs from unhealthy nodes

Node Agents (Simulated)

Periodically emit heartbeats
Represent node-local agents (similar to kubelet)
Failure is modeled by stopping heartbeats

Execution Model

ComputeLite runs as a daemon, not a script.

ClusterState is initialized
Controllers start and reconcile continuously
Node agents emit heartbeats
A scenario injects nodes, jobs, and failures
State converges over time
The system shuts down gracefully on SIGINT/SIGTERM

There is no “end” condition—only convergence.

Project Structure

computelite/
├── cmd/
│   └── computelite/
│       ├── main.go          # minimal entrypoint
│       └── app/             # binary-specific runtime logic
│           ├── run.go
│           ├── controllers.go
│           ├── reporting.go
│           ├── scenario.go
│           └── node_agent.go
├── pkg/
│   ├── api/                 # core types (Job, Node, Resource, states)
│   ├── cluster/             # ClusterState + snapshot reporting
│   ├── controller/          # control-plane controllers
│   └── scheduler/           # scheduling policies (e.g. BestFit)

This layout mirrors real infrastructure projects.

Scheduling Policy

ComputeLite supports pluggable scheduling policies via a simple interface.

Current implementation:

FirstFit - selects the first healthy node that can satisfy the job’s resource requirements
BestFit – selects the node that minimizes leftover resources after placement
RoundRobin - distributes jobs evenly across healthy nodes in cyclic order

Policies are injected into the scheduler controller and can be swapped without changing core logic.

Failure & Rescheduling

Nodes emit heartbeats at fixed intervals
HealthController marks nodes unhealthy when heartbeats exceed a timeout
ReschedulerController evicts jobs from unhealthy nodes
Evicted jobs are requeued and rescheduled on healthy nodes

Failures emerge naturally from time—not explicit conditionals.

Observability

The system periodically prints a cluster snapshot:

job counts by state
node resource utilization
node health and heartbeat status

This provides a live view of system convergence while running.

Running the Project

From the repository root:

go run ./cmd/computelite

The system will:

start controllers
start node agents
run the default scenario
print periodic cluster snapshots

To stop:

Ctrl + C

Shutdown is graceful and deterministic.

Design Principles

Controller-based reconciliation
Clear ownership of shared state
No direct shared-memory access outside ClusterState
Time-based failure detection
Idempotent, race-safe controllers
Minimal main.go, logic lives in app/

Future Extensions (Optional)

ComputeLite is intentionally extensible. Possible next steps include:

graceful node draining
job retry backoff
event-driven scheduler wakeups
multiple scheduling policies
metrics instead of logs
CLI flags for scenario selection

Summary

ComputeLite demonstrates how a real compute control plane behaves:

always running
always converging
resilient to failure
cleanly structured

👨‍💻 Author

Adnan T. — @adnant1

Name		Name	Last commit message	Last commit date
Latest commit History 69 Commits
cmd/computelite		cmd/computelite
pkg		pkg
README.md		README.md
go.mod		go.mod

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ComputeLite

High-Level Architecture

Core Components

Execution Model

Project Structure

Scheduling Policy

Failure & Rescheduling

Observability

Running the Project

Design Principles

Future Extensions (Optional)

Summary

👨‍💻 Author

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ComputeLite

High-Level Architecture

Core Components

Execution Model

Project Structure

Scheduling Policy

Failure & Rescheduling

Observability

Running the Project

Design Principles

Future Extensions (Optional)

Summary

👨‍💻 Author

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages