Thank you for your implementation, I have a question regarding the application of your framework NeuralREG. is it applicable to visual dataset like COCO or RefCOCO where the input is an image + target region? If so, would you please provide an example / instruction on applying NeuralREG to such type of data?
thank you so much in advance!