Global Development Indicators Analyzer

Subject & Course Information

Faculty: Faculty of Engineering and Technology

Course: PRG2104: Object Oriented Programming

Programme: BSc (Hons) Computer Science

Year: Year 2

Semester: Academic Session 2025, Semester 3

Assignment: Assignment 2

Project Overview

The Global Development Indicators Analyzer is a comprehensive Scala application that analyzes global development data from 2000-2020. This project demonstrates advanced object-oriented programming principles, polymorphism, and sophisticated collection operations to answer critical questions about global development trends.

Demonstration

Project Showcase

This Global Development Indicators Analyzer demonstrates excellence in object-oriented programming through several key implementations:

1. GUI Application Demo

JavaFX Interface: Modern, user-friendly graphical interface built with JavaFX
FXML Layout: Separation of UI design and logic using FXML files
Interactive Controls: Dynamic filtering and query execution through GUI components
Real-time Results: Immediate display of analytical results with formatted output

Figure 1: JavaFX GUI Application Interface - Professional data analysis dashboard with interactive controls

2. CLI Application Demo

Interactive Command-Line Interface: Professional CLI with guided user input and colored output
Automatic Basic Analysis: Displays answers to all three assignment questions immediately upon startup
Enhanced User Experience: Features colorized text output with red highlighting for answers
Advanced Interactive Mode: Optional detailed exploration with custom filtering options
Data Validation: Input verification for years, countries, and analysis parameters
Formatted Output: Clean, readable results presentation with professional styling

Comprehensive Analysis Overview

Complete Basic Analysis (All Three Questions): Figure 2: Complete CLI Analysis - Automatic display of all three assignment questions with filtered answers

Detailed Interactive Features

Question 1: Life Expectancy Analysis with Custom Filtering: Figure 3: CLI Life Expectancy Query - Interactive filtering by country and year with user-guided prompts

Question 2: Health & Education Excellence (3-Country Ranking): Figure 4a: CLI Health & Education Ranking - Top 3 countries with composite scoring algorithm

Question 2: Health & Education Excellence (5-Country Ranking): Figure 4b: CLI Health & Education Ranking - Extended top 5 countries analysis

Question 3: Forest Area Loss Analysis with Custom Time Range: Figure 5: CLI Forest Area Loss Analysis - Custom year range selection with temporal data comparison

Assignment Questions Addressed

1. Life Expectancy Analysis

Question: Which country achieved the highest life expectancy in the dataset, and in which year?

Implementation: Our lifeExpectancyQuery method efficiently processes the entire dataset using functional programming paradigms and collection operations to identify optimal life expectancy records with flexible filtering capabilities.

2. Health & Education Excellence

Question: Which country performed best in Health & Education throughout the entire dataset?

Evaluation Criteria:

Life expectancy
Child mortality rates
School enrollment (secondary)
Healthcare capacity
Health development ratio

Implementation: The bestHealthEducationQuery method employs advanced collection grouping and aggregation techniques to compute composite scores across multiple health and education indicators.

3. Forest Area Loss Analysis

Question: Which country had the greatest loss of forest area from 2000 to 2020, and what was the amount lost?

Implementation: Our forestLossQuery method utilizes sophisticated data transformation and temporal analysis to calculate forest area changes across the specified timeframe.

Technical Architecture

Core Object-Oriented Programming (OOP) Principles

1. Encapsulation

Data Model (GdpData trait and GdpRecord case class): Encapsulates all global development indicators in a well-defined structure
Agent Class: Encapsulates all analytical operations and business logic
DataLoader Object: Encapsulates data loading and CSV parsing functionality

// Trait defines the contract for GDP data
trait GdpData {
  def year: Int
  def country_name: String
  def life_expectancy: Option[Double]
  // ... other fields
}

// Case class provides immutable implementation
case class GdpRecord(...) extends GdpData

2. Abstraction

GdpData Trait: Provides abstract interface for all development indicator data
Agent Class: Abstracts complex analytical operations behind simple method calls
Utility Methods: Abstract common operations like averageOfDefined for reusable functionality

3. Inheritance

GdpRecord extends GdpData: Demonstrates inheritance hierarchy
JavaFX Application Structure: Extends JavaFX classes for GUI implementation

Advanced Polymorphism Implementation

1. Parametric Polymorphism

// Generic Option handling for different data types
private def averageOfDefined(seq: Seq[Option[Double]]): Option[Double]
private def safeToOptionDouble(s: String): Option[Double]

2. Method Overloading

// Multiple query variations with different parameter combinations
def lifeExpectancyQuery(highest: Boolean = true): Option[(String, Int, Double)]
def lifeExpectancyQuery(highest: Boolean, filterCountry: Option[String]): Option[(String, Int, Double)]

3. Trait-based Polymorphism

The GdpData trait enables polymorphic behavior across different data implementations, allowing for future extensibility without breaking existing code.

Sophisticated Collection Operations

1. Advanced Filtering and Mapping

val filtered = records
  .filter(_.life_expectancy.isDefined)           // Type-safe filtering
  .filter(r => filterCountry.forall(_ == r.country_name))  // Conditional filtering
  .filter(r => filterYear.forall(_ == r.year))            // Multiple criteria

2. Complex Grouping and Aggregation

val group = records.groupBy(_.country_name).map { case (country, recs) =>
  val scores = indicators.map {
    case "life_expectancy" => averageOfDefined(recs.map(_.life_expectancy))
    case "child_mortality" => averageOfDefined(recs.map(_.child_mortality)).map(100 - _)
    // Pattern matching for different indicators
  }
  country -> scores.flatten.sum  // Sophisticated aggregation
}

3. Functional Data Transformation

// Elegant functional programming with flatMap and for-comprehensions
val group = records.groupBy(_.country_name).flatMap { case (country, recs) =>
  val byYear = recs.filter(_.forest_area_pct.isDefined)
                  .map(r => r.year -> r.forest_area_pct.get).toMap
  
  for {
    start <- byYear.get(fromYear)
    end <- byYear.get(toYear)
  } yield {
    val loss = start - end
    country -> loss
  }
}

Code Quality & Maintainability Features

1. Modular Design

Separation of Concerns: Each class has a single, well-defined responsibility
DataLoader: Handles only data loading and parsing
Agent: Focuses exclusively on analytical operations
GUI Components: Separate presentation layer from business logic

2. Extensibility

Trait-based Architecture: Easy to add new data types implementing GdpData
Configurable Queries: Methods accept parameters for different analysis scenarios
Plugin Architecture: New analytical methods can be easily added to the Agent class

3. Error Handling & Robustness

// Safe parsing with default values
private def safeToDouble(s: String): Double =
  if (s == null || s.trim.isEmpty) 0.0 else s.toDouble

private def safeToOptionDouble(s: String): Option[Double] =
  if (s == null || s.trim.isEmpty) None else Some(s.toDouble)

4. Type Safety

Option Types: Proper handling of missing data with Option[Double]
Case Classes: Immutable data structures prevent accidental modifications
Strong Typing: Compile-time error detection for data type mismatches

5. Performance Optimization

Lazy Evaluation: Collections use lazy evaluation where appropriate
Efficient Grouping: Single-pass grouping operations minimize data traversal
Memory Efficiency: Immutable data structures with structural sharing

Project Structure

PRG2104-Group-Assignment/
├── .git/                           # Git version control directory
├── .gitignore                      # Git ignore file
├── build.sbt                       # SBT build configuration
├── README.md                       # Project documentation
├── images/                         # Project demonstration screenshots
│   ├── gui.png                     # GUI application interface
│   ├── 3in1.png                    # Complete CLI analysis (all 3 questions)
│   ├── cli_q1_with_customization.png          # CLI Q1 custom filtering demo
│   ├── cli_q2_with_customization-3-rank.png   # CLI Q2 top 3 countries demo
│   ├── cli_q2_with_customization-5-rank.png   # CLI Q2 top 5 countries demo
│   └── cli_q3_with_customization.png          # CLI Q3 custom time range demo
├── src/
│   └── main/
│       ├── scala/
│       │   ├── Main.scala              # GUI entry point
│       │   ├── CLIMain.scala           # CLI entry point
│       │   └── com/sunway/welovesunway/
│       │       ├── GdpData.scala       # Data model (trait + case class)
│       │       ├── Agent.scala         # Core analytical engine
│       │       ├── DataLoader.scala    # CSV parsing and data loading
│       │       ├── JavaFXApp.scala     # JavaFX application framework
│       │       └── MainController.scala # GUI controller logic
│       └── resources/
│           ├── Global_Development_Indicators_2000_2020.csv # Dataset
│           └── com/sunway/welovesunway/
│               └── MainApp.fxml        # JavaFX GUI layout definition
└── target/                         # Compiled output directory
    └── scala-3.3.6/                # Scala version-specific builds

Dependencies & Technology Stack

Scala 3.3.6: Modern functional programming language
JavaFX 21.0.5: Cross-platform GUI framework
Scala-CSV 2.0.0: Efficient CSV parsing library
SBT: Scala build tool for dependency management

Running the Application

GUI Mode

sbt run

CLI Mode

sbt "runMain com.sunway.welovesunway.CLIMain"

3. Advanced OOP Features in Action

Encapsulation Example:

// Data is safely encapsulated within the GdpRecord case class
val data = DataLoader.loadData("dataset.csv")
val analyzer = new Agent(data)  // Business logic encapsulated in Agent class

Polymorphism Demonstration:

// Trait-based polymorphism allows flexible data handling
trait GdpData { /* abstract interface */ }
case class GdpRecord(...) extends GdpData  // Concrete implementation

// Method overloading for different query scenarios
analyzer.lifeExpectancyQuery()  // Default parameters
analyzer.lifeExpectancyQuery(highest = false, filterCountry = Some("Japan"))

Advanced Collection Operations:

// Sophisticated functional programming with collections
val results = records
  .filter(_.life_expectancy.isDefined)
  .groupBy(_.country_name)
  .map { case (country, recs) => 
    country -> recs.map(_.life_expectancy.get).max 
  }
  .toSeq.sortBy(-_._2)

4. Real-World Problem Solving

Life Expectancy Analysis:

Identifies countries with highest/lowest life expectancy
Supports filtering by country or year
Demonstrates efficient data processing and optimization

Health & Education Ranking:

Multi-criteria evaluation using 5 key indicators
Composite scoring algorithm with weighted averages
Advanced aggregation techniques

Forest Area Loss Calculation:

Temporal data analysis comparing 2000 vs 2020
Handles missing data gracefully using Option types
Sophisticated data transformation pipelines

5. Code Quality Demonstrations

Type Safety:

// Option types prevent null pointer exceptions
def safeToOptionDouble(s: String): Option[Double] = 
  if (s == null || s.trim.isEmpty) None else Some(s.toDouble)

Error Handling:

// Robust parsing with safe defaults
private def safeToDouble(s: String): Double =
  if (s == null || s.trim.isEmpty) 0.0 else s.toDouble

Functional Programming:

// Elegant for-comprehensions for data transformation
for {
  start <- byYear.get(fromYear)
  end <- byYear.get(toYear)
} yield {
  val loss = start - end
  country -> loss
}

6. Performance Optimizations

Single-pass Data Processing: Efficient algorithms that minimize data traversal
Lazy Evaluation: Deferred computation for improved memory usage
Immutable Data Structures: Thread-safe operations with structural sharing
Stream Processing: Functional pipelines for large dataset handling

7. Professional Software Engineering

Modular Architecture: Clear separation of concerns across multiple classes
Extensible Design: Easy to add new analytical methods and data types
Documentation: Comprehensive code comments and API documentation
Testing-Ready: Structure supports unit testing and integration testing

Benefits

Flexible, reusable methods. Queries written once for GdpData records work for any data subset. For example, lifeExpectancyQuery can be applied to a sequence of records from any country or year without change. We wrote each query abstractly for the trait, so adding a new data source, like another CSV with the same schema requires no changes to the analysis logic.

Extensibility. Because the trait defines a common interface, adding new indicators or record types is simpler. If we extend GdpData with new fields or create a new subclass with extra data, existing queries on the trait are able to work while ignoring the new fields, and we can write additional query methods for the new fields without rewriting core logic. Allowing new cases to fit in without breaking our older code.

Clean and maintainable code. The CLI and JavaFX GUI layers simply call these abstract query methods and display results, without needing to know the concrete record type. The Agent hides all details of how the data is processed. Because we use high-order functions and trait types, the query logic is concise and clear. For example, rather than nested loops and type checks, we utilized .filter and .groupBy, which makes the code concise. Increasing our code’s readability and modifiability.

Limitations

Type inference and complexity. Chaining many collection operations on abstract types sometimes made Scala’s type inference hard to manage. The return types involving Option, and collections (Option & Seq) can become complicated. In some cases, we had to add explicit type annotations or helper methods to avoid confusing the compiler. For example, handling Option[Double] for nullable fields (life_expectancy) required explicit .get calls or .flatten, which adds verbosity and potential null-safety issues. Debugging a type-mismatch in a long chain was occasionally tricky because error messages referred to high-level types, blurring which step was wrong.

Immutable vs. mutable collections. Scala’s collections are by default immutable, which is great for safety but can sometimes be inconvenient. In this project we mostly used immutable Seq and Map, which means every transformation creates new collections. For large datasets, this can have detrimental performance costs. If we needed in-place updates or incremental building (reading a huge CSV record by record), using mutable buffers might have been more efficient, but mixing immutable and mutable code can complicate design. We found that operations like groupBy build large intermediate Maps, and there is no easy way to update them in place. Switching to mutable structures (ArrayBuffer) could improve performance but would reduce the elegance of the code.

Debugging abstract logic. When our code fails, it often fails at the abstract interface rather than concrete data. For example, if a query logic bug occurred, stack traces pointed to the collection operations on GdpData rather than indicating a problem in a specific field. Since our queries often use anonymous functions like _.country_name or _.forest_area_pct.get, an out-of-bounds or missing value error would mention those generic pieces. This made debugging a bit indirect. We had to trace back from the failing step through the polymorphic pipeline to find which record was invalid. In short, the abstraction made some error messages less immediately clear than if we had used concrete loops.

Overall, subtype polymorphism via Scala traits and the collection API gave us increased flexibility along with a cleaner code, but it required paying extra attention to type details and in some instances, it would increase the difficulty in debugging our code due to the additional layers of abstraction.

Authors

Tan Kok Feng (Project Leader):
- Designed GUI framework
- Developed data model architecture
- Led API integration and interfacing
- Finalized project documentation & GitHub readme
- GitHub Profile: KevinTan2025
Wong Yu Xuan:
- Developed CLI interface
- Built core analytical engine
- Planned and defined API architecture
- GitHub Profile: Meghan924
Yeoh JinWei:
- Restructured data model
- Developed GUI interface
- Wrote Section 2 of documentation
- GitHub Profile: Yeoh-JinWei

Name		Name	Last commit message	Last commit date
Latest commit History 60 Commits
images		images
py		py
src/main		src/main
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
build.sbt		build.sbt

Folders and files

Latest commit

History

Repository files navigation