Extraction and Research of Crop Feature Points Based on Computer Vision

Jingwen Cui; Jianping Zhang; Guiling Sun; Bowen Zheng

doi:10.3390/s19112553

Extraction and Research of Crop Feature Points Based on Computer Vision

Sensors ◽

10.3390/s19112553 ◽

2019 ◽

Vol 19 (11) ◽

pp. 2553 ◽

Cited By ~ 3

Author(s):

Jingwen Cui ◽

Jianping Zhang ◽

Guiling Sun ◽

Bowen Zheng

Keyword(s):

Computer Vision ◽

Point Cloud ◽

High Accuracy ◽

Depth Camera ◽

Positioning Error ◽

Feature Points ◽

Matching Method ◽

Depth Images ◽

Computer Vision Technology ◽

Rgb Images

Based on computer vision technology, this paper proposes a method for identifying and locating crops in order to successfully capture crops in the process of automatic crop picking. This method innovatively combines the YOLOv3 algorithm under the DarkNet framework with the point cloud image coordinate matching method, and can achieve the goal of this paper very well. Firstly, RGB (RGB is the color representing the three channels of red, green and blue) images and depth images are obtained by using the Kinect v2 depth camera. Secondly, the YOLOv3 algorithm is used to identify the various types of target crops in the RGB images, and the feature points of the target crops are determined. Finally, the 3D coordinates of the feature points are displayed on the point cloud images. Compared with other methods, this method of crop identification has high accuracy and small positioning error, which lays a good foundation for the subsequent harvesting of crops using mechanical arms. In summary, the method used in this paper can be considered effective.

Download Full-text

Feature Matching Based on Triangle Guidance and Constraints

International Journal of Pattern Recognition and Artificial Intelligence ◽

10.1142/s0218001418550145 ◽

2018 ◽

Vol 32 (08) ◽

pp. 1855014 ◽

Cited By ~ 2

Author(s):

Hongmin Liu ◽

Hongya Zhang ◽

Zhiheng Wang ◽

Yiming Zheng

Keyword(s):

Feature Matching ◽

High Accuracy ◽

Geometric Constraints ◽

Feature Point ◽

Feature Points ◽

Matching Method ◽

Shared Feature ◽

Triangular Structure ◽

Circle Center ◽

Image Transformations

For images with distortions or repetitive patterns, the existing matching methods usually work well just on one of the two kinds of images. In this paper, we present novel triangle guidance and constraints (TGC)-based feature matching method, which can achieve good results on both kinds of images. We first extract stable matched feature points and combine these points into triangles as the initial matched triangles, and triangles combined by feature points are as the candidates to be matched. Then, triangle guidance based on the connection relationship via the shared feature point between the matched triangles and the candidates is defined to find the potential matching triangles. Triangle constraints, specially the location of a vertex relative to the inscribed circle center of the triangle, the scale represented by the ratio of corresponding side lengths of two matching triangles and the included angles between the sides of two triangles with connection relationship, are subsequently used to verify the potential matches and obtain the correct ones. Comparative experiments show that the proposed TGC can increase the number of the matched points with high accuracy under various image transformations, especially more effective on images with distortions or repetitive patterns due to the fact that the triangular structure are not only stable to image transformations but also provides more geometric constraints.

Download Full-text

Automatic 3D Landmark Extraction System Based on an Encoder–Decoder Using Fusion of Vision and LiDAR

Remote Sensing ◽

10.3390/rs12071142 ◽

2020 ◽

Vol 12 (7) ◽

pp. 1142

Author(s):

Jeonghoon Kwak ◽

Yunsick Sung

Keyword(s):

Point Cloud ◽

Point Clouds ◽

Depth Image ◽

3D Point Cloud ◽

Digital World ◽

Depth Images ◽

3D Point Clouds ◽

Rgb Images ◽

Rgb Image ◽

3D Landmarks

To provide a realistic environment for remote sensing applications, point clouds are used to realize a three-dimensional (3D) digital world for the user. Motion recognition of objects, e.g., humans, is required to provide realistic experiences in the 3D digital world. To recognize a user’s motions, 3D landmarks are provided by analyzing a 3D point cloud collected through a light detection and ranging (LiDAR) system or a red green blue (RGB) image collected visually. However, manual supervision is required to extract 3D landmarks as to whether they originate from the RGB image or the 3D point cloud. Thus, there is a need for a method for extracting 3D landmarks without manual supervision. Herein, an RGB image and a 3D point cloud are used to extract 3D landmarks. The 3D point cloud is utilized as the relative distance between a LiDAR and a user. Because it cannot contain all information the user’s entire body due to disparities, it cannot generate a dense depth image that provides the boundary of user’s body. Therefore, up-sampling is performed to increase the density of the depth image generated based on the 3D point cloud; the density depends on the 3D point cloud. This paper proposes a system for extracting 3D landmarks using 3D point clouds and RGB images without manual supervision. A depth image provides the boundary of a user’s motion and is generated by using 3D point cloud and RGB image collected by a LiDAR and an RGB camera, respectively. To extract 3D landmarks automatically, an encoder–decoder model is trained with the generated depth images, and the RGB images and 3D landmarks are extracted from these images with the trained encoder model. The method of extracting 3D landmarks using RGB depth (RGBD) images was verified experimentally, and 3D landmarks were extracted to evaluate the user’s motions with RGBD images. In this manner, landmarks could be extracted according to the user’s motions, rather than by extracting them using the RGB images. The depth images generated by the proposed method were 1.832 times denser than the up-sampling-based depth images generated with bilateral filtering.

Download Full-text

Flexible point cloud matching method based on three-dimensional image feature points

Advances in Mechanical Engineering ◽

10.1177/1687814018795032 ◽

2018 ◽

Vol 10 (9) ◽

pp. 168781401879503

Author(s):

Haihua Cui ◽

Wenhe Liao ◽

Xiaosheng Cheng ◽

Ning Dai ◽

Changye Guo

Keyword(s):

Point Cloud ◽

Three Dimensional ◽

Point Clouds ◽

Feature Point ◽

Image Feature ◽

Surface Measurement ◽

Surface Model ◽

Feature Points ◽

Matching Method ◽

Dimensional Image

Flexible and robust point cloud matching is important for three-dimensional surface measurement. This article proposes a new matching method based on three-dimensional image feature points. First, an intrinsic shape signature algorithm is used to detect the key shape feature points, using a weighted three-dimensional occupational histogram of the data points within the angular space, which is a view-independent representation of the three-dimensional shape. Then, the point feature histogram is used to represent the underlying surface model properties at a point whose computation is based on the combination of certain geometrical relations between the point’s nearest k-neighbors. The two-view point clouds are robustly matched using the proposed double neighborhood constraint of minimizing the sum of the Euclidean distances between the local neighbors of the point and feature point. The proposed optimization method is immune to noise, reduces the search range for matching points, and improves the correct feature point matching rate for a weak surface texture. The matching accuracy and stability of the proposed method are verified using experiments. This method can be used for a flat surface with weak features and in other applications. The method has a larger application range than the traditional methods.

Download Full-text

Point-Plane SLAM Using Supposed Planes for Indoor Environments

Sensors ◽

10.3390/s19173795 ◽

2019 ◽

Vol 19 (17) ◽

pp. 3795 ◽

Cited By ~ 5

Author(s):

Xiaoyu Zhang ◽

Wei Wang ◽

Xianyu Qi ◽

Ziwei Liao ◽

Ran Wei

Keyword(s):

Fundamental Problem ◽

Factor Graph ◽

Indoor Environments ◽

Feature Points ◽

Estimation Errors ◽

Depth Images ◽

Localization And Mapping ◽

Rgb Images ◽

The Cost ◽

Structural Parallel

Simultaneous localization and mapping (SLAM) is a fundamental problem for various applications. For indoor environments, planes are predominant features that are less affected by measurement noise. In this paper, we propose a novel point-plane SLAM system using RGB-D cameras. First, we extract feature points from RGB images and planes from depth images. Then plane correspondences in the global map can be found using their contours. Considering the limited size of real planes, we exploit constraints of plane edges. In general, a plane edge is an intersecting line of two perpendicular planes. Therefore, instead of line-based constraints, we calculate and generate supposed perpendicular planes from edge lines, resulting in more plane observations and constraints to reduce estimation errors. To exploit the orthogonal structure in indoor environments, we also add structural (parallel or perpendicular) constraints of planes. Finally, we construct a factor graph using all of these features. The cost functions are minimized to estimate camera poses and global map. We test our proposed system on public RGB-D benchmarks, demonstrating its robust and accurate pose estimation results, compared with other state-of-the-art SLAM systems.

Download Full-text

Automated generation of FE mesh of concrete structures from 3D point cloud using computer vision technology

Bridge Maintenance, Safety, Management, Life-Cycle Sustainability and Innovations ◽

10.1201/9780429279119-448 ◽

2021 ◽

pp. 704-704

Author(s):

J. Shu ◽

W. Zhao ◽

K. Zandi

Keyword(s):

Computer Vision ◽

Point Cloud ◽

Concrete Structures ◽

3D Point Cloud ◽

Automated Generation ◽

Computer Vision Technology

Download Full-text

Computer vision technology in the food and beverage industries

10.1533/9780857095770 ◽

2012 ◽

Cited By ~ 8

Author(s):

Da-Wen Sun

Keyword(s):

Computer Vision ◽

Food And Beverage ◽

Computer Vision Technology

Download Full-text

Registration and Fusion of UAV LiDAR System Sequence Images and Laser Point Clouds

Journal of Imaging Science and Technology ◽

10.2352/j.imagingsci.technol.2021.65.1.01050110.2352/j.imagingsci.technol.2021.65.1.010501 ◽

2020 ◽

Author(s):

Jiayong Yu ◽

Longchen Ma ◽

Maoyi Tian, ◽

Xiushan Lu

Keyword(s):

Point Cloud ◽

Image Data ◽

Point Clouds ◽

Optical Image ◽

Point Cloud Data ◽

Feature Points ◽

Lidar System ◽

Registration Accuracy ◽

Cloud Data ◽

Intensity Image

The unmanned aerial vehicle (UAV)-mounted mobile LiDAR system (ULS) is widely used for geomatics owing to its efficient data acquisition and convenient operation. However, due to limited carrying capacity of a UAV, sensors integrated in the ULS should be small and lightweight, which results in decrease in the density of the collected scanning points. This affects registration between image data and point cloud data. To address this issue, the authors propose a method for registering and fusing ULS sequence images and laser point clouds, wherein they convert the problem of registering point cloud data and image data into a problem of matching feature points between the two images. First, a point cloud is selected to produce an intensity image. Subsequently, the corresponding feature points of the intensity image and the optical image are matched, and exterior orientation parameters are solved using a collinear equation based on image position and orientation. Finally, the sequence images are fused with the laser point cloud, based on the Global Navigation Satellite System (GNSS) time index of the optical image, to generate a true color point cloud. The experimental results show the higher registration accuracy and fusion speed of the proposed method, thereby demonstrating its accuracy and effectiveness.

Download Full-text

A computer vision approach to identifying the manufacturer and model of anterior cervical spinal hardware

Journal of Neurosurgery Spine ◽

10.3171/2019.6.spine19463 ◽

2019 ◽

Vol 31 (6) ◽

pp. 844-850 ◽

Cited By ~ 1

Author(s):

Kevin T. Huang ◽

Michael A. Silva ◽

Alfred P. See ◽

Kyle C. Wu ◽

Troy Gallerani ◽

...

Keyword(s):

Computer Vision ◽

Feature Detection ◽

High Accuracy ◽

Detection Accuracy ◽

Data Sets ◽

Visual Words ◽

Fusion Systems ◽

Kaze Feature ◽

Applications Of Machine Learning ◽

Cervical Plating

OBJECTIVERecent advances in computer vision have revolutionized many aspects of society but have yet to find significant penetrance in neurosurgery. One proposed use for this technology is to aid in the identification of implanted spinal hardware. In revision operations, knowing the manufacturer and model of previously implanted fusion systems upfront can facilitate a faster and safer procedure, but this information is frequently unavailable or incomplete. The authors present one approach for the automated, high-accuracy classification of anterior cervical hardware fusion systems using computer vision.METHODSPatient records were searched for those who underwent anterior-posterior (AP) cervical radiography following anterior cervical discectomy and fusion (ACDF) at the authors’ institution over a 10-year period (2008–2018). These images were then cropped and windowed to include just the cervical plating system. Images were then labeled with the appropriate manufacturer and system according to the operative record. A computer vision classifier was then constructed using the bag-of-visual-words technique and KAZE feature detection. Accuracy and validity were tested using an 80%/20% training/testing pseudorandom split over 100 iterations.RESULTSA total of 321 total images were isolated containing 9 different ACDF systems from 5 different companies. The correct system was identified as the top choice in 91.5% ± 3.8% of the cases and one of the top 2 or 3 choices in 97.1% ± 2.0% and 98.4 ± 13% of the cases, respectively. Performance persisted despite the inclusion of variable sizes of hardware (i.e., 1-level, 2-level, and 3-level plates). Stratification by the size of hardware did not improve performance.CONCLUSIONSA computer vision algorithm was trained to classify at least 9 different types of anterior cervical fusion systems using relatively sparse data sets and was demonstrated to perform with high accuracy. This represents one of many potential clinical applications of machine learning and computer vision in neurosurgical practice.

Download Full-text

A Review on Computer Vision Technology for Monitoring Poultry Farm – Application, Hardware, and Software

IEEE Access ◽

10.1109/access.2020.3047818 ◽

2020 ◽

pp. 1-1

Author(s):

Nur Syazarin Natasha Abd Aziz ◽

Salwani Mohd Daud ◽

Rudzidatul Akmam Dziyauddin ◽

Mohamad Zulkefli Adam ◽

Azizul Azizan

Keyword(s):

Computer Vision ◽

Poultry Farm ◽

Computer Vision Technology

Download Full-text

HRDepthNet: Depth Image-Based Marker-Less Tracking of Body Joints

Sensors ◽

10.3390/s21041356 ◽

2021 ◽

Vol 21 (4) ◽

pp. 1356

Author(s):

Linda Christin Büker ◽

Finnja Zuber ◽

Andreas Hein ◽

Sebastian Fudickar

Keyword(s):

Color Images ◽

Depth Image ◽

Accuracy Evaluation ◽

Timed Up And Go ◽

Position Errors ◽

Depth Images ◽

Upper And Lower Extremities ◽

Rgb Images ◽

Human Joints ◽

Body Joints

With approaches for the detection of joint positions in color images such as HRNet and OpenPose being available, consideration of corresponding approaches for depth images is limited even though depth images have several advantages over color images like robustness to light variation or color- and texture invariance. Correspondingly, we introduce High- Resolution Depth Net (HRDepthNet)—a machine learning driven approach to detect human joints (body, head, and upper and lower extremities) in purely depth images. HRDepthNet retrains the original HRNet for depth images. Therefore, a dataset is created holding depth (and RGB) images recorded with subjects conducting the timed up and go test—an established geriatric assessment. The images were manually annotated RGB images. The training and evaluation were conducted with this dataset. For accuracy evaluation, detection of body joints was evaluated via COCO’s evaluation metrics and indicated that the resulting depth image-based model achieved better results than the HRNet trained and applied on corresponding RGB images. An additional evaluation of the position errors showed a median deviation of 1.619 cm (x-axis), 2.342 cm (y-axis) and 2.4 cm (z-axis).

Download Full-text