Will you release the human preference results for verifing you evaluation method?
Will you release the human preference results for verifing you evaluation method?