Twist returns a categorical string ("BUILDABLE"), whereas IDT provides a numerical score.
I was thinking of a simple approach of normalizing outputs to a common scale (e.g., mapping "BUILDABLE" to a
specific numerical score like 0.0, and 10.0 for a sequence with issues).