LASER: Layer-wise Scale Alignment for Training-Free Streaming 4D Reconstruction

Tianye Ding^1*, Yiming Xie^1*, Yiqing Liang^2*, Moitreya Chatterjee³, Pedro Miraldo³, Huaizu Jiang¹
¹ Northeastern University, ² Independent Researcher, ³ Mitsubishi Electric Research Laboratories
^* Equal Contribution

📢 Updates

[2026-03-12] Loop-closure module released with robustness fix.
[2026-02-21] Paper accepted by CVPR 2026.
[2025-12-15] ArXiv preprint released.

📝 To-Do List

💡 Abstract

We propose LASER, a training-free framework that converts an offline reconstruction model into a streaming system by aligning predictions across consecutive temporal windows. We observe that simple similarity transformation (Sim(3)) alignment fails due to layer depth misalignment: monocular scale ambiguity causes relative depth scales of different scene layers to vary inconsistently between windows. To address this, we introduce layer-wise scale alignment, which segments depth predictions into discrete layers, computes per-layer scale factors, and propagates them across both adjacent windows and timestamps.

🛠️ Installation

# 1. Clone the repository
git clone --recursive git@github.com:neu-vi/LASER.git
cd LASER

# 2. Create environment
conda create -n laser -y python=3.11
conda activate laser

# 3. Install dependencies
pip install -r requirements.txt

# 4. Compile cython modules
python setup.py build_ext --inplace

# 5. Install Viser
pip install -e viser

(Optional) Download checkpoints needed for loop-closure inference

bash ./scripts/download_weights.sh

🚀 Usage

Inference

To run the inference code, you can use the following command:

export PYTHONPATH="./":$PYTHONPATH

python demo.py \
    --data_path DATA_PATH \
    --output_path "./viser_results" \
    --cache_path "./cache" \
    --sample_interval SAMPLE_INTERVAL \
    --window_size WINDOW_SIZE \
    --overlap OVERLAP \
    --depth_refine

# example inference script
python demo.py \
    --data_path "examples/titanic" \
    --output_path "./viser_results" \
    --cache_path "./cache" \
    --sample_interval 1 \
    --window_size 30 \
    --overlap 10 \
    --depth_refine

The results will be saved in the viser_results/SEQ_NAMEdirectory for future visualization.

Loop-closure inference

Loop-closure requires additional dependencies for package faiss can be installed through:

pip install faiss-gpu-cu12 numpy==1.26.4

Run loop-closure inference for kilometer-scale sequence with the following command:

python demo_lc.py \
    --config_path "configs/loop_config.yaml" \
    --data_path DATA_PATH \
    --output_path "./viser_results" \
    --cache_path "./cache" \
    --sample_interval SAMPLE_INTERVAL \
    --window_size WINDOW_SIZE \
    --overlap OVERLAP

rm -r cache/

Visualization

To visualize the interactive 4D results, you can use the following command:

python viser/visualizer_monst3r.py --data viser_results/SEQ_NAME

# example visualization script
python viser/visualizer_monst3r.py --data viser_results/titanic

Evaluation

Please refer to MonST3R for dataset setup details.

Put all datasets in data/.

Video Depth

Sintel

export PYTHONPATH="./":$PYTHONPATH

CUDA_VISIBLE_DEVICES=0 torchrun --nproc_per_node=1 --master_port=12345 eval_launch.py \
    --mode=eval_pose \
    --model=streaming_pi3 \
    --eval_dataset=sintel \
    --output_dir="outputs/video_depth/sintel_depth" \
    --full_seq \
    --no_crop

CUDA_VISIBLE_DEVICES=0 torchrun --nproc_per_node=1 --master_port=12345 depth_metric.py \
    --eval_dataset=sintel \
    --result_dir="outputs/video_depth/sintel_depth" \
    --output_dir="outputs/video_depth"

Bonn

export PYTHONPATH="./":$PYTHONPATH

CUDA_VISIBLE_DEVICES=0 torchrun --nproc_per_node=1 --master_port=12345 eval_launch.py \
    --mode=eval_pose \
    --model=streaming_pi3 \
    --eval_dataset=bonn \
    --output_dir="outputs/video_depth/bonn_depth" \
    --no_crop

CUDA_VISIBLE_DEVICES=0 torchrun --nproc_per_node=1 --master_port=12345 depth_metric.py \
    --eval_dataset=bonn \
    --result_dir="outputs/video_depth/bonn_depth" \
    --output_dir="outputs/video_depth"

KITTI

export PYTHONPATH="./":$PYTHONPATH

CUDA_VISIBLE_DEVICES=0 torchrun --nproc_per_node=1 --master_port=12345 eval_launch.py \
    --mode=eval_pose \
    --model=streaming_pi3 \
    --eval_dataset=kitti \
    --output_dir="outputs/video_depth/kitti_depth" \
    --no_crop \
    --flow_loss_weight 0 \
    --translation_weight 1e-3

CUDA_VISIBLE_DEVICES=0 torchrun --nproc_per_node=1 --master_port=12345 depth_metric.py \
    --eval_dataset=kitti \
    --result_dir="outputs/video_depth/kitti_depth" \
    --output_dir="outputs/video_depth"

Camera Pose

Sintel

export PYTHONPATH="./":$PYTHONPATH

CUDA_VISIBLE_DEVICES=0 torchrun --nproc_per_node=1 --master_port=12345 eval_launch.py \
    --mode=eval_pose \
    --model=streaming_pi3 \
    --eval_dataset=sintel \
    --output_dir="outputs/cam_pose/sintel_pose"

ScanNet

export PYTHONPATH="./":$PYTHONPATH

CUDA_VISIBLE_DEVICES=0 torchrun --nproc_per_node=1 --master_port=12345 eval_launch.py \
    --mode=eval_pose \
    --model=streaming_pi3 \
    --eval_dataset=scannet \
    --output_dir="outputs/cam_pose/scannet_pose"

TUM

export PYTHONPATH="./":$PYTHONPATH

CUDA_VISIBLE_DEVICES=0 torchrun --nproc_per_node=1 --master_port=12345 eval_launch.py \
    --mode=eval_pose \
    --model=streaming_pi3 \
    --eval_dataset=tum \
    --output_dir="outputs/cam_pose/tum_pose"

Citation

If you find this repository useful in your research, please consider giving a star ⭐ and a citation

@article{ding2025laser,
  title={LASER: Layer-wise Scale Alignment for Training-Free Streaming 4D Reconstruction},
  author={Ding, Tianye and Xie, Yiming and Liang, Yiqing and Chatterjee, Moitreya and Miraldo, Pedro and Jiang, Huaizu},
  year={2025}
}

Acknowledgements

We would like to thank the authors for the following excellent open source projects: VGGT, π³, MonST3R, CUT3R, VGGT-Long and many other inspiring works in the community.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
configs		configs
datasets		datasets
eval		eval
examples/titanic		examples/titanic
inference_engine		inference_engine
loop_closure		loop_closure
mv_recon		mv_recon
pi3		pi3
scripts		scripts
utils		utils
vggt		vggt
viser @ 9cc1733		viser @ 9cc1733
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
demo.py		demo.py
demo_lc.py		demo_lc.py
depth_metric.py		depth_metric.py
eval_launch.py		eval_launch.py
post_eval_launch.py		post_eval_launch.py
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LASER: Layer-wise Scale Alignment for Training-Free Streaming 4D Reconstruction

📢 Updates

📝 To-Do List

💡 Abstract

🛠️ Installation

🚀 Usage

Inference

Loop-closure inference

Visualization

Evaluation

Video Depth

Camera Pose

Citation

Acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

LASER: Layer-wise Scale Alignment for Training-Free Streaming 4D Reconstruction

📢 Updates

📝 To-Do List

💡 Abstract

🛠️ Installation

🚀 Usage

Inference

Loop-closure inference

Visualization

Evaluation

Video Depth

Camera Pose

Citation

Acknowledgements

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages