LLM4UnitTests-SC: Unit Test Generation for Smart Contracts Using LLMs

Overview

LLM4UnitTests-SC is a platform that automates the generation of unit tests for Solidity smart contracts using Large Language Models (LLMs).
It was developed as part of an academic end-of-year project at the National Engineering School of Sfax (ENIS) to explore the potential of LLMs in automating blockchain testing workflows.

The system integrates Hardhat, Node.js, Spring Boot, and React into a unified environment that:

Builds and optimizes prompts
Interacts with local LLMs
Validates generated code syntax
Executes smart contract tests
Calculates code coverage
Presents results through an interactive web interface

Objectives

Automate the generation, validation, and execution of smart contract unit tests.
Evaluate the performance of multiple LLMs.
Compare prompt engineering strategies for improving code quality and coverage.
Provide developers with a user-friendly platform for AI-assisted testing.

Evaluated Models
Codestral
DeepSeek-R1 (7B / 14B)
Llama 3 (8B)
Phi-4 (14B)

System Architecture

LLM4UnitTests-SC follows a 3-tier architecture:

1. Frontend (React)

User interface for uploading smart contracts, configuring prompts, and visualizing test results.
Displays coverage metrics, syntax errors, and execution outcomes.
Enables downloading generated test reports (PDF).

2. Backend (Spring Boot)

Handles prompt construction, LLM interaction, and process orchestration.
Exposes REST APIs for live feedback and test progress.
Integrates Spring AI to connect with local and API-based LLMs.

3. Processing Layer (Node.js + Hardhat)

Uses Babel Parser for syntax validation and correction.
Executes Solidity test suites through Hardhat.
Generates coverage reports and sends structured results to the backend.

Core Features

Upload Solidity smart contracts
Generate prompts dynamically (5 types)
Send prompts to selected LLM (API or local)
Filter and clean raw LLM output
Validate JavaScript syntax (Babel Parser)
Execute tests (Hardhat) and calculate coverage
Visualize metrics in real time
Download PDF summary of results

Prompt Strategies

Five prompt configurations were implemented and tested:

Type 1: Zero-shot (no contract code)
Type 2: Zero-shot with example test
Type 3: Zero-shot with contract code
Type 4: One-shot with contract code (best performing)
Type 5: Zero-shot with ABI

Metrics Used

Syntax Error Rate (via Babel Parser)
Coverage (Statements, Branches, Functions, Lines — via Hardhat Coverage)
Human Intervention (amount of manual correction needed)

Tools and Technologies

Category	Tools / Frameworks
Smart Contracts	Solidity
Test Framework	Hardhat (Mocha + Chai)
Syntax Validation	Node.js (Babel Parser)
Backend	Spring Boot + Spring AI
Frontend	React
LLM Runtime	Ollama (local models)
Visualization	Hardhat Coverage + React Dashboard

Installation & Setup

Prerequisites

Node.js ≥ 18
Java ≥ 17
npm
Hardhat
Maven
Ollama (for local LLMs)

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.mvn/wrapper		.mvn/wrapper
frontend		frontend
node_modules		node_modules
src		src
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
mvnw		mvnw
mvnw.cmd		mvnw.cmd
package-lock.json		package-lock.json
package.json		package.json
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLM4UnitTests-SC: Unit Test Generation for Smart Contracts Using LLMs

Overview

Objectives

System Architecture

1. Frontend (React)

2. Backend (Spring Boot)

3. Processing Layer (Node.js + Hardhat)

Core Features

Prompt Strategies

Metrics Used

Tools and Technologies

Installation & Setup

Prerequisites

About

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

LLM4UnitTests-SC: Unit Test Generation for Smart Contracts Using LLMs

Overview

Objectives

System Architecture

1. Frontend (React)

2. Backend (Spring Boot)

3. Processing Layer (Node.js + Hardhat)

Core Features

Prompt Strategies

Metrics Used

Tools and Technologies

Installation & Setup

Prerequisites

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Contributors

Uh oh!

Languages