Skip to content

SahilB2k/ViewMyRecords

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

VMR Crawler Stability Refinement

This project aims to finalize and stabilize the VMR (ViewMyRecords) crawler for production.

Features

  • SPA-aware navigation (breadcrumbs, back-button, ".." folder)
  • Session conflict handling ("Login Here" button)
  • Resilient folder and file detection
  • Recursive discovery with duplicate prevention
  • Metadata extraction (sidecar JSON files)
  • Batch processing and error recovery

Prerequisites

  • Python 3.8+
  • Playwright (pip install playwright then playwright install)

Usage

  1. Set environment variables: VMR_CORPORATE_ID, VMR_USERNAME, VMR_PASSWORD.
  2. Run the script: python production_migration_engine.py.

Docker Usage (Recommended)

This project is fully dockerized to avoid version conflicts. See DOCKER_INSTRUCTIONS.md for details.

  1. Create a .env file (see above).
  2. Run docker-compose up -d.
  3. Execute the downloader: docker-compose exec vmr-migration python production_migration_engine_new.py.

About

A robust, production-ready automation engine to recursively crawl, download files, and extract metadata from the ViewMyRecords (VMR) platform using Playwright.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors