scholarly journals An OCaNet Model Based on Octave Convolution and Attention Mechanism for Iris Recognition

2021 ◽  
Vol 2021 ◽  
pp. 1-10
Author(s):  
Dong Zou ◽  
Jianbing Feng ◽  
Zhixin He ◽  
Liping Liu ◽  
Meijun Zhao ◽  
...  

Iris recognition refers to identifying individuals based on iris patterns, which have been widely used in security systems, such as subway security and access control attendance, because everyone has a unique iris shape. In the study, we propose an OCaNet model for the iris recognition task. First, binarized threshold segmentation is used to locate the pupil and the pupil boundary is obtained; then, the Hough transform is applied to locate the outer edge of the iris; according to the located pupil and iris, the iris area image is obtained through image segmentation; finally, the iris image is normalized to adjust each original image to the same size and corresponding position, so as to eliminate the influence of translation, scaling, and rotation on iris recognition. Second, the normalized iris images are both input into the octave convolution module and attention module. The octave convolution module is used to extract the shape and contour features of the iris by decomposing the feature map into high and low frequencies. The attention module is applied to extract the color and texture characteristics of the iris. Finally, the two feature maps are concatenated and produce a distribution of output classes. Experimental results show that the proposed OCaNet model is significantly more accurate.

Electronics ◽  
2021 ◽  
Vol 10 (3) ◽  
pp. 348
Author(s):  
Choongsang Cho ◽  
Young Han Lee ◽  
Jongyoul Park ◽  
Sangkeun Lee

Semantic image segmentation has a wide range of applications. When it comes to medical image segmentation, its accuracy is even more important than those of other areas because the performance gives useful information directly applicable to disease diagnosis, surgical planning, and history monitoring. The state-of-the-art models in medical image segmentation are variants of encoder-decoder architecture, which is called U-Net. To effectively reflect the spatial features in feature maps in encoder-decoder architecture, we propose a spatially adaptive weighting scheme for medical image segmentation. Specifically, the spatial feature is estimated from the feature maps, and the learned weighting parameters are obtained from the computed map, since segmentation results are predicted from the feature map through a convolutional layer. Especially in the proposed networks, the convolutional block for extracting the feature map is replaced with the widely used convolutional frameworks: VGG, ResNet, and Bottleneck Resent structures. In addition, a bilinear up-sampling method replaces the up-convolutional layer to increase the resolution of the feature map. For the performance evaluation of the proposed architecture, we used three data sets covering different medical imaging modalities. Experimental results show that the network with the proposed self-spatial adaptive weighting block based on the ResNet framework gave the highest IoU and DICE scores in the three tasks compared to other methods. In particular, the segmentation network combining the proposed self-spatially adaptive block and ResNet framework recorded the highest 3.01% and 2.89% improvements in IoU and DICE scores, respectively, in the Nerve data set. Therefore, we believe that the proposed scheme can be a useful tool for image segmentation tasks based on the encoder-decoder architecture.


Author(s):  
Zhenzhen Yang ◽  
Pengfei Xu ◽  
Yongpeng Yang ◽  
Bing-Kun Bao

The U-Net has become the most popular structure in medical image segmentation in recent years. Although its performance for medical image segmentation is outstanding, a large number of experiments demonstrate that the classical U-Net network architecture seems to be insufficient when the size of segmentation targets changes and the imbalance happens between target and background in different forms of segmentation. To improve the U-Net network architecture, we develop a new architecture named densely connected U-Net (DenseUNet) network in this article. The proposed DenseUNet network adopts a dense block to improve the feature extraction capability and employs a multi-feature fuse block fusing feature maps of different levels to increase the accuracy of feature extraction. In addition, in view of the advantages of the cross entropy and the dice loss functions, a new loss function for the DenseUNet network is proposed to deal with the imbalance between target and background. Finally, we test the proposed DenseUNet network and compared it with the multi-resolutional U-Net (MultiResUNet) and the classic U-Net networks on three different datasets. The experimental results show that the DenseUNet network has significantly performances compared with the MultiResUNet and the classic U-Net networks.


2020 ◽  
Vol 0 (0) ◽  
Author(s):  
Mohammadreza Azimi ◽  
Seyed Ahmad Rasoulinejad ◽  
Andrzej Pacut

AbstractIn this paper, we attempt to answer the questions whether iris recognition task under the influence of diabetes would be more difficult and whether the effects of diabetes and individuals’ age are uncorrelated. We hypothesized that the health condition of volunteers plays an important role in the performance of the iris recognition system. To confirm the obtained results, we reported the distribution of usable area in each subgroup to have a more comprehensive analysis of diabetes effects. There is no conducted study to investigate for which age group (young or old) the diabetes effect is more acute on the biometric results. For this purpose, we created a new database containing 1,906 samples from 509 eyes. We applied the weighted adaptive Hough ellipsopolar transform technique and contrast-adjusted Hough transform for segmentation of iris texture, along with three different encoding algorithms. To test the hypothesis related to physiological aging effect, Welches’s t-test and Kolmogorov–Smirnov test have been used to study the age-dependency of diabetes mellitus influence on the reliability of our chosen iris recognition system. Our results give some general hints related to age effect on performance of biometric systems for people with diabetes.


Sensors ◽  
2021 ◽  
Vol 21 (1) ◽  
pp. 268
Author(s):  
Yeganeh Jalali ◽  
Mansoor Fateh ◽  
Mohsen Rezvani ◽  
Vahid Abolghasemi ◽  
Mohammad Hossein Anisi

Lung CT image segmentation is a key process in many applications such as lung cancer detection. It is considered a challenging problem due to existing similar image densities in the pulmonary structures, different types of scanners, and scanning protocols. Most of the current semi-automatic segmentation methods rely on human factors therefore it might suffer from lack of accuracy. Another shortcoming of these methods is their high false-positive rate. In recent years, several approaches, based on a deep learning framework, have been effectively applied in medical image segmentation. Among existing deep neural networks, the U-Net has provided great success in this field. In this paper, we propose a deep neural network architecture to perform an automatic lung CT image segmentation process. In the proposed method, several extensive preprocessing techniques are applied to raw CT images. Then, ground truths corresponding to these images are extracted via some morphological operations and manual reforms. Finally, all the prepared images with the corresponding ground truth are fed into a modified U-Net in which the encoder is replaced with a pre-trained ResNet-34 network (referred to as Res BCDU-Net). In the architecture, we employ BConvLSTM (Bidirectional Convolutional Long Short-term Memory)as an advanced integrator module instead of simple traditional concatenators. This is to merge the extracted feature maps of the corresponding contracting path into the previous expansion of the up-convolutional layer. Finally, a densely connected convolutional layer is utilized for the contracting path. The results of our extensive experiments on lung CT images (LIDC-IDRI database) confirm the effectiveness of the proposed method where a dice coefficient index of 97.31% is achieved.


2021 ◽  
Vol 17 (1) ◽  
pp. 287-292
Author(s):  
Adriana-Meda UDROIU ◽  
Ștefan-Antonio DAN-ȘUTEU

Abstract: We introduce the term usable security to refer to security systems, models, mechanisms and applications that have as the main goal usability. Secure systems cannot exist without secure authentication methods. Thus we outline biometric authentication methods and we focus on iris recognition because is the most reliable and accurate method for human identification]. The most important advantage of iris biometric over other biometrics is that irises have enormous pattern variability meaning that the variation between individual is almost maximum and variation for any person across time or conditions is minimum. Taking into consideration this observations, this survey covers researches in this field, methods of technical implementation and the usability of this method as an authentication system on iOS environment.


2013 ◽  
Vol 734-737 ◽  
pp. 2912-2916
Author(s):  
Hui Li ◽  
Ping He

Automation strain measurement of the sheet metal deforming becomes one of the important application fields of computer vision. The algorithm of image segmentation based on adaptability threshold was presented for image segmentation of metal steel. In order to validate the proposed method, it is tested and compared with Ostu method and the one-dimensional maximum entropy method. Experiment results indicate that the method is simple and effective, and has an advantage of reservation of the main features of the original image.


2020 ◽  
Vol 2020 ◽  
pp. 1-13
Author(s):  
Xiaodong Huang ◽  
Hui Zhang ◽  
Li Zhuo ◽  
Xiaoguang Li ◽  
Jing Zhang

Extracting the tongue body accurately from a digital tongue image is a challenge for automated tongue diagnoses, as the blurred edge of the tongue body, interference of pathological details, and the huge difference in the size and shape of the tongue. In this study, an automated tongue image segmentation method using enhanced fully convolutional network with encoder-decoder structure was presented. In the frame of the proposed network, the deep residual network was adopted as an encoder to obtain dense feature maps, and a Receptive Field Block was assembled behind the encoder. Receptive Field Block can capture adequate global contextual prior because of its structure of the multibranch convolution layers with varying kernels. Moreover, the Feature Pyramid Network was used as a decoder to fuse multiscale feature maps for gathering sufficient positional information to recover the clear contour of the tongue body. The quantitative evaluation of the segmentation results of 300 tongue images from the SIPL-tongue dataset showed that the average Hausdorff Distance, average Symmetric Mean Absolute Surface Distance, average Dice Similarity Coefficient, average precision, average sensitivity, and average specificity were 11.2963, 3.4737, 97.26%, 95.66%, 98.97%, and 98.68%, respectively. The proposed method achieved the best performance compared with the other four deep-learning-based segmentation methods (including SegNet, FCN, PSPNet, and DeepLab v3+). There were also similar results on the HIT-tongue dataset. The experimental results demonstrated that the proposed method can achieve accurate tongue image segmentation and meet the practical requirements of automated tongue diagnoses.


2014 ◽  
Vol 3 (2) ◽  
pp. 10
Author(s):  
Anna Sedrak Hovakimyan ◽  
Siranush Gegham Sargsyan ◽  
Arshak Nazaryan

Human iris is  a good subject of biometrical identification, since  iris patterns are unique like fingerprints. Iris is well protected against damage, unlike fingerprints, which can be harder to recognize after years of certain types of manual labor.A problem of iris recognition is considered in the paper. In machine learning, pattern recognition is the assignment of a label to a given input value. Pattern classification is an example of pattern recognition: it attempts to assign each input value to one of a given set of classes. Nowadays various techniques are used for this purpose, and in particular artificial neural networks.For iris recognition problem solving  Kohenen Self Organizing Maps are suggested to use. The software for iris recognition is developed  which is customizable and allows to select the appropriate parameters of the neural network to obtain the most satisfactory results. The developed Self-Organizing Map Library of classes can be used for various kinds of object classification problem solving as well as for any problems suitable to solve with Self-Organizing Maps.


2020 ◽  
Vol 10 (3) ◽  
pp. 724-730
Author(s):  
Chunjiang Fan ◽  
Zijian Wang ◽  
Gang Li ◽  
Jian Luo ◽  
Yang Cao ◽  
...  

Image segmentation technologies play a crucial role in medical diagnosis. This paper proposed a novel paralleling structure based on conventional 3D U-net deep network for improving the performance of CT image segmentation. In our model architecture, a new connection channel from analysis path to synthesis path was constructed for exploiting feature maps from deep spatial dimensions. 60 CT scan images of stroke patients were collected for lesion location. Finally, there were 36 valid data were selected for further analysis. The improved method led to better achievement for this task, which segment stroke CT scan images into healthy parts and injury parts. The performance on the test set obtained by our method was compared with other state-of-art U-net models, to demonstrate the effectiveness of our architecture. Furthermore, the result verified that paralleling structure was useful for the convergence of loss curve.


Sign in / Sign up

Export Citation Format

Share Document