You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
A Study on the Image Classification Performance of Data Augmentation Techniques Using NeRF*(3D Reconstruction Model)
The project aims to utilize 3D object generation models like NeRF, 3D Gaussian Splatting, or Mesh to better understand occluded or unseen parts of objects, such as their backsides, and to address the issue of viewpoint variation in image classification.
We initiated the project with the belief that these models can enhance image classification accuracy by generating diverse viewpoints.
Applicable to various categories, not just for car and chair categories.
Point of View
In this project, we use 14 povs as follows.
Filename
Perspective
filename_0.png
Front
filename_1.png
Back
filename_2.png
Left Side
filename_3.png
Right Side
filename_4.png
Top
filename_5.png
Top Left
filename_6.png
Top Right
filename_7.png
(Back) Top Right
filename_8.png
(Back) Top Left
filename_9.png
Bottom
filename_10.png
Bottom Left
filename_11.png
Bottom Right
filename_12.png
(Back) Bottom Right
filename_13.png
(Back) Bottom Left
When using InstantMesh, we use filename_6.png (Top Right perspective) because it represents at least three planes, providing more detailed information about the object.
Classes
The original ShapeNetCore dataset categories are identified by synset_id. We have mapped these to custom-defined indices as shown below: