AQPy

Repository for scripts and files to read the PMS5003 air quality index sensor and the BME280 temperature/pressure/humidity sensor from a Raspberry Pi. systemctl is used to manage the read_sensors.py python script. The data is stored into a postgresql database with the timescaledb extension. From there, Grafana is used to plot the data over time.

Operational Docs

SENSORS.md: sensor overview, key specs, and primary reference links
TROUBLESHOOTING.md: end-to-end troubleshooting (SSH, DB auth/ownership, schema, sensor serial, Grafana)
REFLASH_AND_LOGIN.md: reflash + first SSH login checklist
grafana.md: Grafana background notes

Hardware

PMS5003

The PMS5003 is assumed to be connected over serial in the /dev/serial0 position. See the PMS5003 manual for wiring diagram of the PMS5003. The pinout command on the Raspberry Pi OS will show the function of the GPIO pins.

PMS Wire No.	Raspberry Pi Pin No.
VCC (1)	2
GND (2)	6
SET (3)	unused
RX (4)	8
TX (5)	10
RESET (6)	unused

BME280

The BME280 is assumed to be connected with I2C.

BME280 Terminal	Raspberry Pi Pin No.
3V3	1
GND	9
SCL	5
SDA	3

Installation

install python dependencies:
- python3 -m pip install -r requirements.txt
copy .env.example to .env and set database credentials:
- cp .env.example .env
copy aqi.service to /etc/systemd/system with sudo cp aqi.service /etc/systemd/system
run sudo systemctl daemon-reload
run sudo systemctl enable aqi to start aqi.service at boot
either sudo reboot or sudo systemctl start aqi to start the service
make sure its running with systemctl status aqi. It will say "active (running)" if things are working properly.

Grafana (Turnkey Provisioning)

This repo can provision Grafana automatically with:

datasource AQPy BME (database bme)
datasource AQPy PMS (database pms)
dashboard AQPy Edge Sensors + Forecasts (uid=aqpy-overview)

From Pi:

cd /home/pi/AQPy
sudo ./scripts/provision_grafana.sh

Open:

http://<pi-ip>:3000/d/aqpy-overview

Raw sensors dashboard:

http://<pi-ip>:3000/d/aqpy-raw

Notes:

scripts/provision_grafana.sh reads DB credentials from .env
make sure .env has real AQPY_DB_PASSWORD (not change_me)
first login is typically admin / admin and Grafana prompts password reset

Configuration

The script reads configuration from environment variables (typically from .env when run with aqi.service):

AQPY_DB_USER, AQPY_DB_PASSWORD, AQPY_DB_HOST, AQPY_DB_PORT
AQPY_DB_NAME_PMS, AQPY_DB_NAME_BME
AQPY_SERIAL_PORT, AQPY_SERIAL_BAUD
AQPY_PMS_STARTUP_DELAY, AQPY_PMS_AVG_TIME, AQPY_SLEEP_SECONDS
AQPY_BME_I2C_PORT, AQPY_BME_I2C_ADDR
AQPY_LOG_LEVEL
AQPY_RETENTION_DAYS, AQPY_RETENTION_SAFETY_HOURS
AQPY_RETENTION_DAYS_RAW, AQPY_RETENTION_SAFETY_HOURS_RAW
AQPY_RETENTION_DAYS_PREDICTIONS, AQPY_RETENTION_SAFETY_HOURS_PREDICTIONS

Ingestion Architecture

Sensor ingestion is separated into its own package:

aqpy/ingest/config.py: ingestion runtime config from environment
aqpy/ingest/interfaces.py: ingestion contracts (sensor + repository protocols)
aqpy/ingest/pms5003.py: PMS5003 sensor protocol implementation
aqpy/ingest/repository.py: SQL insert logic for PMS/BME readings
aqpy/ingest/service.py: ingestion orchestration loop and lifecycle
read_sensors.py: thin entrypoint that configures logging and runs ingestion

Service Hardening

aqi.service includes a sandboxing profile (NoNewPrivileges, ProtectSystem, ProtectHome, namespace and syscall restrictions, private temp/mounts, and tight UMask) to reduce blast radius.

After updating the unit file:

run sudo systemctl daemon-reload
run sudo systemctl restart aqi
verify with systemctl status aqi and journalctl -u aqi -n 100

If systemd reports an unknown lvalue, comment out only the unsupported directive in aqi.service and reload/restart again.

Edge ML Forecasting

This repo includes a modular edge-ML forecasting pipeline.

Edge ML Layout

read_sensors.py: ingestion service (sensor read + DB writes only)
aqpy/common/db.py: shared DB connection logic
aqpy/forecast/features.py: feature engineering
aqpy/forecast/model.py: model fit/predict logic
aqpy/forecast/nn_model.py: small neural network model (MLP) for online updates
aqpy/forecast/adaptive_ar.py: adaptive autoregressive model (RLS with forgetting)
aqpy/forecast/rnn_lite.py: lightweight GRU-style latent model with trained linear head
aqpy/forecast/repository.py: SQL data access for forecast pipeline
aqpy/forecast/training.py: orchestration for training and artifact export
aqpy/forecast/inference.py: orchestration for forecast generation and inserts
aqpy/forecast/online_repository.py: training-state, holdout metrics, and retention run logs
aqpy/forecast/online_training.py: online retraining step with holdout evaluation logging
aqpy/forecast/retention.py: training-aware retention policy
aqpy/forecast/specs.py: model spec loader/filter for multi-sensor orchestration
train_forecast_model.py: thin CLI wrapper for training
run_forecast_inference.py: thin CLI wrapper for inference
run_online_training.py: thin CLI wrapper for online retraining across model types
run_data_retention.py: thin CLI wrapper for retention
run_online_training_batch.py: batch retraining from configs/model_specs.json
run_forecast_batch.py: batch inference from configs/model_specs.json
run_data_retention_batch.py: modular retention for raw (pi) and predictions tables; derived/view sources are skipped
run_backfill_batch.py: idempotent historical one-step backfill from model artifacts
configs/model_specs.json: declarative model list (both bme and pms targets)
validate_model_specs.py: CLI validator for spec integrity before deployment
sql/forecast_schema.sql: schema for predictions and model_registry
sql/online_learning_schema.sql: schema for online training state and holdout metrics
sql/derived_schema_pms.sql: derived AQI view from PMS raw PM2.5/PM10
aqi-train-online.service + aqi-train-online.timer: scheduled batch retraining across all configured models
aqi-forecast.service + aqi-forecast.timer: scheduled batch inference across all configured models
aqi-retention.service + aqi-retention.timer: scheduled data retention pruning

Initialize Forecast Tables

Run once per database used for forecasting:

psql bme -f sql/raw_schema_bme.sql
psql bme -f sql/forecast_schema.sql
psql bme -f sql/online_learning_schema.sql
psql pms -f sql/raw_schema_pms.sql
psql pms -f sql/derived_schema_pms.sql
psql pms -f sql/forecast_schema.sql
psql pms -f sql/online_learning_schema.sql

Derived AQI (PM)

AQPy computes a PM-based AQI from PMS raw data using U.S. EPA breakpoint interpolation:

Inputs: pm25_st and pm10_st from pms.pi
Truncation before interpolation:
- PM2.5 truncated to 0.1 ug/m3
- PM10 truncated to 1 ug/m3
AQI result is max(subindex_pm25, subindex_pm10) in range [0, 500]

Implementation choice:

AQI is derived in SQL view derived.pms_aqi (and convenience view pms_aqi), not stored back into raw pi.
This is non-destructive and automatically backfills all historical rows.

Tradeoff:

View-based derivation needs no ETL timer and is always up to date, but computes at query time.
ETL/materialized-table approach can be faster for heavy query loads, but adds operational complexity (refresh/backfill jobs, timer/cron, lag handling).

Retention note:

aqi_pm models use source table pms_aqi (a view).
Retention job skips non-raw tables and only prunes raw pi tables.

Train Model (offline or on Pi)

Example for temperature forecast from the bme.pi table:

python3 train_forecast_model.py \
  --database bme \
  --table pi \
  --time-col t \
  --target temperature \
  --history-hours 336 \
  --lags 1,2,3,6,12 \
  --model-path models/bme_temperature_model.json \
  --register

Validate Model Specs (Recommended Before Deploy)

python3 validate_model_specs.py --spec-file configs/model_specs.json

Run One Inference Pass

python3 run_forecast_inference.py \
  --model-path models/bme_temperature_nn.json \
  --horizon-steps 12

Adaptive AR inference uses the same command with AR artifact path:

python3 run_forecast_inference.py \
  --model-path models/bme_temperature_ar.json \
  --horizon-steps 12

GRU-lite inference uses:

python3 run_forecast_inference.py \
  --model-path models/bme_temperature_rnn.json \
  --horizon-steps 12

Run One Online NN Retraining Step

python3 run_online_training.py \
  --database bme \
  --table pi \
  --time-col t \
  --target temperature \
  --model-name aqpy_nn_temperature \
  --model-path models/bme_temperature_nn.json \
  --model-type nn_mlp \
  --history-hours 336 \
  --burn-in-rows 200 \
  --max-train-rows 5000 \
  --lags 1,2,3,6,12 \
  --holdout-ratio 0.2 \
  --min-new-rows 30 \
  --learning-rate 0.01 \
  --epochs 40 \
  --batch-size 64 \
  --hidden-dim 8

Run One Adaptive AR Retraining Step

python3 run_online_training.py \
  --database bme \
  --table pi \
  --time-col t \
  --target temperature \
  --model-name aqpy_ar_temperature \
  --model-path models/bme_temperature_ar.json \
  --model-type adaptive_ar \
  --history-hours 336 \
  --burn-in-rows 200 \
  --max-train-rows 5000 \
  --lags 1,2,3,6,12 \
  --holdout-ratio 0.2 \
  --min-new-rows 30 \
  --forgetting-factor 0.995 \
  --ar-delta 100.0

Run One GRU-lite Retraining Step

python3 run_online_training.py \
  --database bme \
  --table pi \
  --time-col t \
  --target temperature \
  --model-name aqpy_rnn_temperature \
  --model-path models/bme_temperature_rnn.json \
  --model-type rnn_lite_gru \
  --history-hours 336 \
  --burn-in-rows 200 \
  --max-train-rows 5000 \
  --seq-len 24 \
  --holdout-ratio 0.2 \
  --min-new-rows 30 \
  --hidden-dim 8 \
  --rnn-ridge 0.001 \
  --random-seed 42

Each retraining step logs holdout metrics into online_training_metrics, including:

holdout_mae, holdout_rmse
baseline_mae, baseline_rmse
mae_improvement_pct, rmse_improvement_pct
training hyperparameters and new rows processed

Parameterization notes:

--history-hours controls database read window.
--max-train-rows caps memory/compute by trimming to the most recent rows in that window.
--burn-in-rows blocks model updates until enough data is accumulated.
--min-new-rows gates how often retraining runs; if new rows are below threshold, run result is skipped.
For AR/NN lag models use --lags; for GRU-lite use --seq-len.
Maximum effective lookback is bounded by what exists in the database and these caps.

Run One Retention Step (Training-Aware)

python3 run_data_retention.py \
  --database bme \
  --table pi \
  --time-col t \
  --retention-days 180 \
  --safety-hours 24

Retention cutoff is:

min(now() - retention_days, min(last_seen_ts) - safety_hours)

This prevents deleting records that have not been incorporated into online training.

Modular Retention Defaults (Batch)

run_data_retention_batch.py supports separate policies:

Raw tables (pi): training-watermark aware
Predictions table (predictions): time-window retention without training watermark

Defaults are now:

raw retention: 180 days, 24 safety hours
predictions retention: 180 days, 0 safety hours

Configure in .env:

AQPY_RETENTION_DAYS=180
AQPY_RETENTION_SAFETY_HOURS=24
AQPY_RETENTION_DAYS_RAW=180
AQPY_RETENTION_SAFETY_HOURS_RAW=24
AQPY_RETENTION_DAYS_PREDICTIONS=180
AQPY_RETENTION_SAFETY_HOURS_PREDICTIONS=0

Run Timers On Pi

sudo cp aqi-train-online.service /etc/systemd/system/aqi-train-online.service
sudo cp aqi-train-online.timer /etc/systemd/system/aqi-train-online.timer
sudo cp aqi-forecast.service /etc/systemd/system/aqi-forecast.service
sudo cp aqi-forecast.timer /etc/systemd/system/aqi-forecast.timer
sudo cp aqi-retention.service /etc/systemd/system/aqi-retention.service
sudo cp aqi-retention.timer /etc/systemd/system/aqi-retention.timer
sudo systemctl daemon-reload
sudo systemctl enable --now aqi-train-online.timer
sudo systemctl enable --now aqi-forecast.timer
sudo systemctl enable --now aqi-retention.timer
systemctl status aqi-train-online.timer
systemctl status aqi-forecast.timer
systemctl status aqi-retention.timer
journalctl -u aqi-train-online.service -n 100 --no-pager
journalctl -u aqi-forecast.timer -n 20 --no-pager
journalctl -u aqi-forecast.service -n 100 --no-pager
journalctl -u aqi-retention.service -n 100 --no-pager

One-Script Bring-Up (Recommended)

If the Pi already has /home/pi/AQPy and .venv set up:

cd /home/pi/AQPy
sudo ./scripts/bringup_edge_stack.sh

If network/DB/systemd readiness is delayed at boot, use retry mode:

cd /home/pi/AQPy
sudo ./scripts/bringup_edge_stack.sh --wait

To also run a one-shot bootstrap (train all configured models immediately and write initial predictions):

cd /home/pi/AQPy
sudo ./scripts/bringup_edge_stack.sh --with-bootstrap

To bootstrap later without reinstalling systemd units:

cd /home/pi/AQPy
./scripts/bootstrap_models.sh

Turnkey Fresh-Clone Install

From a newly cloned repo on Raspberry Pi:

cd /home/pi/AQPy
sudo ./scripts/install_from_fresh_clone.sh --with-bootstrap

To also install Grafana in the same run:

cd /home/pi/AQPy
sudo ./scripts/install_from_fresh_clone.sh --with-bootstrap --with-grafana

This installer:

installs OS dependencies
enables I2C + serial hardware (best effort)
creates .venv and installs Python dependencies
creates .env from template if missing
ensures Postgres databases exist
runs idempotent bring-up and optional model bootstrap
applies DB ownership/privileges for app role and prepares writable models/ artifacts directory
optional Grafana install and service enable (--with-grafana)
optional Grafana datasource + dashboard provisioning (--with-grafana)

After first run:

verify .env credentials/settings
reboot once if interface settings changed (sudo reboot)

First Verification Checklist

systemctl status aqi --no-pager
systemctl status aqi-train-online.timer --no-pager
systemctl status aqi-forecast.timer --no-pager
systemctl status aqi-retention.timer --no-pager

journalctl -u aqi -n 80 --no-pager
journalctl -u aqi-train-online.service -n 80 --no-pager
journalctl -u aqi-forecast.service -n 80 --no-pager

PGPASSWORD='<your_db_password>' psql -h localhost -U pi -d bme -c "select count(*), max(t) from pi;"
PGPASSWORD='<your_db_password>' psql -h localhost -U pi -d bme -c "select model_name, count(*) from predictions group by 1 order by 1;"

SSH Profiling Scripts

Use these when connected to Pi over SSH for quick health/profiling checks.

One-shot snapshot:

cd /home/pi/AQPy
./scripts/profile_snapshot.sh

Include recent logs + serial probe:

./scripts/profile_snapshot.sh --with-logs --serial-probe

Standalone PMS serial probe:

./scripts/probe_pms_serial.sh --iterations 30

Continuous watch (refresh every 30s):

./scripts/profile_watch.sh --interval 30

Run Batch Jobs Manually (No venv/source needed)

Use wrapper script to run immediate train/forecast from SSH shell:

cd /home/pi/AQPy
./scripts/run_edge_jobs_now.sh

Examples:

./scripts/run_edge_jobs_now.sh --databases bme
./scripts/run_edge_jobs_now.sh --train-only --databases bme
./scripts/run_edge_jobs_now.sh --forecast-only --databases bme
./scripts/run_edge_jobs_now.sh --train-only --families rnn --targets temperature,humidity,pressure --databases bme
./scripts/run_edge_jobs_now.sh --with-retention
./scripts/run_edge_jobs_now.sh --with-backfill --backfill-hours 48 --databases bme

Backfill behavior:

re-scores historical windows using currently saved model artifacts
writes one-step predictions (horizon_step=1) at historical timestamps
idempotent by default: existing rows for the same model/version/window are replaced
selective filters (--models, --databases, --targets, --families) apply uniformly to train/forecast/backfill
online_training_metrics are written only for the filtered training specs (so metrics stay in sync with selected runs)

Grafana Metrics Queries (Examples)

Holdout MAE trend:

SELECT recorded_at AS time, holdout_mae
FROM online_training_metrics
WHERE model_name = 'aqpy_nn_temperature'
ORDER BY recorded_at;

Model vs baseline improvement:

SELECT recorded_at AS time, mae_improvement_pct, rmse_improvement_pct
FROM online_training_metrics
WHERE model_name = 'aqpy_nn_temperature'
ORDER BY recorded_at;

Run Tests

Run the unit tests from repo root:

python3 -m unittest discover -s tests -p "test_*.py"

Maintenance

systemctl stores logs that can be accessed through journalctl -u aqi. journalctl uses the less linux utility to show the logs. A brief summary of aqi.service can be obtained by running systemctl status aqi. If the sensors stop working (or I didn't code things robustly enough) the python runtime errors will be recorded by systemctl. If the read_sensors.py script fails, systemctl will automatically restart it however if it fails too many times it will wait longer and longer between retries.

Name		Name	Last commit message	Last commit date
Latest commit History 44 Commits
aqpy		aqpy
configs		configs
grafana/dashboards		grafana/dashboards
models		models
scripts		scripts
sql		sql
tests		tests
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
REFLASH_AND_LOGIN.md		REFLASH_AND_LOGIN.md
SENSORS.md		SENSORS.md
TROUBLESHOOTING.md		TROUBLESHOOTING.md
aqi-forecast-ar.service		aqi-forecast-ar.service
aqi-forecast-ar.timer		aqi-forecast-ar.timer
aqi-forecast-rnn.service		aqi-forecast-rnn.service
aqi-forecast-rnn.timer		aqi-forecast-rnn.timer
aqi-forecast.service		aqi-forecast.service
aqi-forecast.timer		aqi-forecast.timer
aqi-retention.service		aqi-retention.service
aqi-retention.timer		aqi-retention.timer
aqi-train-online-ar.service		aqi-train-online-ar.service
aqi-train-online-ar.timer		aqi-train-online-ar.timer
aqi-train-online-rnn.service		aqi-train-online-rnn.service
aqi-train-online-rnn.timer		aqi-train-online-rnn.timer
aqi-train-online.service		aqi-train-online.service
aqi-train-online.timer		aqi-train-online.timer
aqi.service		aqi.service
grafana.md		grafana.md
psql.md		psql.md
read_sensors.py		read_sensors.py
requirements.txt		requirements.txt
run_backfill_batch.py		run_backfill_batch.py
run_data_retention.py		run_data_retention.py
run_data_retention_batch.py		run_data_retention_batch.py
run_forecast_batch.py		run_forecast_batch.py
run_forecast_inference.py		run_forecast_inference.py
run_online_training.py		run_online_training.py
run_online_training_batch.py		run_online_training_batch.py
train_forecast_model.py		train_forecast_model.py
validate_model_specs.py		validate_model_specs.py

Folders and files

Latest commit

History

Repository files navigation

AQPy

Operational Docs

Hardware

PMS5003

BME280

Installation

Grafana (Turnkey Provisioning)

Configuration

Ingestion Architecture

Service Hardening

Edge ML Forecasting

Edge ML Layout

Initialize Forecast Tables

Derived AQI (PM)

Train Model (offline or on Pi)

Validate Model Specs (Recommended Before Deploy)

Run One Inference Pass

Run One Online NN Retraining Step

Run One Adaptive AR Retraining Step

Run One GRU-lite Retraining Step

Run One Retention Step (Training-Aware)

Modular Retention Defaults (Batch)

Run Timers On Pi

One-Script Bring-Up (Recommended)

Turnkey Fresh-Clone Install

First Verification Checklist

SSH Profiling Scripts

Run Batch Jobs Manually (No venv/source needed)

Grafana Metrics Queries (Examples)

Run Tests

Maintenance

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages