Skip to content

feat: add --verbose flag to benchmark CLI for per-file metric output …#27

Merged
umayer16 merged 1 commit intomainfrom
umayer16-patch-12
Mar 19, 2026
Merged

feat: add --verbose flag to benchmark CLI for per-file metric output …#27
umayer16 merged 1 commit intomainfrom
umayer16-patch-12

Conversation

@umayer16
Copy link
Copy Markdown
Owner

…(#23)

Summary

Fixes #23 — adds a --verbose flag to the benchmark subcommand so metric details are visible during a run without needing to open the JSON.

Usage

python vibebench.py benchmark --tasks datasets/prompts.json --verbose

Example Output (verbose mode)

[CHATGPT] Analyzing TASK-001_chatgpt.py...
  Complexity      : 1.8
  Docstring Cover : 0.0%
  Bad Practices   : 0
  Execution Time  : 0.060s
  Status          : Success

Changes

vibebench.py

  • VibeBench.__init__ accepts new verbose=False parameter
  • New private _print_verbose(record) method formats and prints per-file metrics with safe N/A fallbacks for null values
  • run_benchmark calls _print_verbose after each file when verbose
  • benchmark subparser gains --verbose / action="store_true" flag

Default (non-verbose) output is completely unchanged.

Notes

This PR is stacked on top of the #24 schema fixes — both are included in this vibebench.py. Merge #24 first if you are cherry-picking.

Closes #23

…23)

## Summary

Fixes #23 — adds a `--verbose` flag to the `benchmark` subcommand so
metric details are visible during a run without needing to open the JSON.

## Usage
```bash
python vibebench.py benchmark --tasks datasets/prompts.json --verbose
```

## Example Output (verbose mode)
```
[CHATGPT] Analyzing TASK-001_chatgpt.py...
  Complexity      : 1.8
  Docstring Cover : 0.0%
  Bad Practices   : 0
  Execution Time  : 0.060s
  Status          : Success
```

## Changes

**`vibebench.py`**
- `VibeBench.__init__` accepts new `verbose=False` parameter
- New private `_print_verbose(record)` method formats and prints
  per-file metrics with safe `N/A` fallbacks for null values
- `run_benchmark` calls `_print_verbose` after each file when verbose
- `benchmark` subparser gains `--verbose` / `action="store_true"` flag

Default (non-verbose) output is completely unchanged.

## Notes

This PR is stacked on top of the #24 schema fixes — both are included
in this `vibebench.py`. Merge #24 first if you are cherry-picking.

Closes #23
@umayer16 umayer16 merged commit 7fc9b3c into main Mar 19, 2026
2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Add --verbose flag to CLI for detailed per-file output

1 participant