ci: Add LLM Security Scan Prompt by 0xNeshi · Pull Request #171 · OpenZeppelin/contracts-sui

0xNeshi · 2026-02-18T18:15:00Z

WARNING

MOST OF THIS WAS CREATED USING LLM AND IS NOT YET TESTED
This is an untested draft proposal created as a result of recent meeting with auditors who suggested we use similar prompts during development, the idea being to gauge whether this is something we even want to have in our code base
Before this is merged, we should double- and triple-check that it is safe to do so (e.g. that it CANNOT POSSIBLY expose any sensitive API keys)

Proposal

How this is imagined to behave:

runs whenever main is updated (alternative is to run every X days/weeks to accumulate enough meaningful changes)
analyzes the code for likely security vulnerabilities
if any potential vulnerabilities are found, the CI job fails and we get a notification on Github
we verify whether this is an actual vulnerability or an LLM hallucination
- if the former - we address it
- if the latter - we ignore it

Potential Issues

1. We already have a tool that does exactly this, it's called X.

Awesome, let's integrate that instead, and close this PR!

2. The CI job implementation actually has problem with X, it should do Y instead.

It's excellent that you noticed this problem and suggested an improvement! The better we make the CI job now, the more bugs we'll catch when it runs later.

3. The CI job turns out to have too many false positives OR reports too few basic vulnerabilities that are later surfaced in the actual audit.

We try to improve the prompt to improve the job's effectiveness. In the extreme case, we determine the CI job is more often than not useless, so we remove it, and call it a day.

4. The risk of leaking sensitive data is too great to allow running such a CI job.

Fair critique and very important! If possible, let's try to make the LLM sandboxed enough that we feel at ease. If not possible, or we feel to uncomfortable, we shouldn't just close the PR without merging and that's that.

coderabbitai · 2026-02-18T18:15:12Z

Important

Review skipped

Draft detected.

Please check the settings in the CodeRabbit UI or the .coderabbit.yaml file in this repository. To trigger a single review, invoke the @coderabbitai review command.

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: 002fb691-f8cd-4dde-8a65-22a76eb9a861

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

Use the checkbox below for a quick retry:

🔍 Trigger review

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch llm-sec-ci-job

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

codecov · 2026-02-18T18:17:05Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 89.88%. Comparing base (63a864e) to head (0faaef0).

Additional details and impacted files

@@           Coverage Diff           @@
##             main     #171   +/-   ##
=======================================
  Coverage   89.88%   89.88%           
=======================================
  Files          19       19           
  Lines        1790     1790           
  Branches      484      484           
=======================================
  Hits         1609     1609           
  Misses        168      168           
  Partials       13       13

Flag	Coverage Δ
contracts/access	`44.87% <ø> (ø)`
math/core	`86.12% <ø> (ø)`
math/fixed_point	`58.71% <ø> (ø)`

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

This reverts commit aa22868.

This reverts commit d414322.

0xNeshi added 2 commits February 18, 2026 19:04

ci: add LLM security scan job

dc8d48c

ref: remove Published.toml

cffb656

0xNeshi self-assigned this Feb 18, 2026

0xNeshi added 13 commits February 18, 2026 19:30

ci: use codex-action instead of manual setup

cff6a28

Merge remote-tracking branch 'origin/main' into llm-sec-ci-job

21655f7

align actions/checkout to v4

aa22868

Revert "align actions/checkout to v4"

d9089dd

This reverts commit aa22868.

pin openai/codex-action version tag

6c44da2

focus the scan on Move + refactor

11b5e0a

extract schema into a local file

cc198cb

include workflow_dispatch trigger

0041cbf

split scan and processing into separate conditional job

d998c4b

remove artifact upload step from LLM security scan workflow

d414322

Revert "remove artifact upload step from LLM security scan workflow"

fd5c2c8

This reverts commit d414322.

add push trigger to LLM security scan workflow

533cfe7

remove ephemeral flag from codex-args in LLM security scan workflow

0faaef0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ci: Add LLM Security Scan Prompt#171

ci: Add LLM Security Scan Prompt#171
0xNeshi wants to merge 15 commits intomainfrom
llm-sec-ci-job

0xNeshi commented Feb 18, 2026 •

edited

Loading

Uh oh!

coderabbitai bot commented Feb 18, 2026 •

edited

Loading

Review skipped

Uh oh!

codecov bot commented Feb 18, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

0xNeshi commented Feb 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

WARNING

Proposal

Potential Issues

1. We already have a tool that does exactly this, it's called X.

2. The CI job implementation actually has problem with X, it should do Y instead.

3. The CI job turns out to have too many false positives OR reports too few basic vulnerabilities that are later surfaced in the actual audit.

4. The risk of leaking sensitive data is too great to allow running such a CI job.

Uh oh!

coderabbitai bot commented Feb 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review skipped

Uh oh!

codecov bot commented Feb 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

0xNeshi commented Feb 18, 2026 •

edited

Loading

coderabbitai bot commented Feb 18, 2026 •

edited

Loading

codecov bot commented Feb 18, 2026 •

edited

Loading