![]() ![]() ![]() These models are trained on 14 different sequences from the TUM RGB-D dataset (totaling over 23,000 samples) without the need for any ground truth pose information. This work presents a set of logistic regression and neural network models that can be used to select good frames for the structure from motion process by using a small set of summary data related to the matched correspondences. This is due to the fact that frame suitability is largely determined by the cameras’ baseline length and the coplanarity of the scene points, both of which are unknown prior to initialization. Single-camera (monocular) systems that use these techniques need to select appropriate frames for the structure from motion process from the camera’s pre-initialization motion, but this frame selection problem is non-trivial. Markerless visual SLAM systems that use natural features in images and make use of multiple view geometry methods require an initialization procedure that typically involves using structure from motion techniques to solve for the first pair of keyframes and subsequently triangulating the initial set of 3-space points with the resulting keyframes. Visual SLAM utilizes visual data from a camera to accomplish the same. Simultaneous Localization and Mapping (SLAM) is a method to simultaneously map the environment and estimate the location and pose of an agent. ![]()
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |