RGB Inter-Channel Measures for Morphological Color Texture Characterization

Nelson Luis Durañona Durañona Sosa; José Luis Vázquez Vázquez Noguera; Juan José Cáceres Cáceres Silva; Miguel García García Torres; and Horacio Legal-Ayala

doi:10.3390/sym11101190

RGB Inter-Channel Measures for Morphological Color Texture Characterization

Symmetry ◽

10.3390/sym11101190 ◽

2019 ◽

Vol 11 (10) ◽

pp. 1190

Author(s):

Nelson Luis Durañona Durañona Sosa ◽

José Luis Vázquez Vázquez Noguera ◽

Juan José Cáceres Cáceres Silva ◽

Miguel García García Torres ◽

and Horacio Legal-Ayala

Keyword(s):

Image Processing ◽

State Of The Art ◽

Texture Classification ◽

Texture Descriptor ◽

Classification Techniques ◽

Rgb Images ◽

Different Color ◽

High Level ◽

Texture Characterization ◽

Morphological Series

The perception of textures is based on high-level features such as symmetry, brightness, color or direction. Texture characterization is a widely studied topic in the image processing community. The normalized volume of morphological series is used as a texture descriptor in RGB images. However, the correlation between different color channels is not exploited with this descriptor. We propose the usage of inter-channel measures in addition to the volume, to enhance the descriptors potential to discriminate textures. The experiments show that standard texture classification techniques increase between 3%–10% in performance when using our descriptor instead of other state of the art descriptors that do not use inter-channel measures.

Download Full-text

SeDAR: Reading Floorplans Like a Human—Using Deep Learning to Enable Human-Inspired Localisation

International Journal of Computer Vision ◽

10.1007/s11263-019-01239-4 ◽

2019 ◽

Vol 128 (5) ◽

pp. 1286-1310 ◽

Cited By ~ 3

Author(s):

Oscar Mendez ◽

Simon Hadfield ◽

Nicolas Pugeault ◽

Richard Bowden

Keyword(s):

Deep Learning ◽

Semantic Information ◽

State Of The Art ◽

Depth Information ◽

Semantic Maps ◽

Novel Method ◽

Rgb Images ◽

High Level ◽

Robotic Tasks ◽

And Robotics

Abstract The use of human-level semantic information to aid robotic tasks has recently become an important area for both Computer Vision and Robotics. This has been enabled by advances in Deep Learning that allow consistent and robust semantic understanding. Leveraging this semantic vision of the world has allowed human-level understanding to naturally emerge from many different approaches. Particularly, the use of semantic information to aid in localisation and reconstruction has been at the forefront of both fields. Like robots, humans also require the ability to localise within a structure. To aid this, humans have designed high-level semantic maps of our structures called floorplans. We are extremely good at localising in them, even with limited access to the depth information used by robots. This is because we focus on the distribution of semantic elements, rather than geometric ones. Evidence of this is that humans are normally able to localise in a floorplan that has not been scaled properly. In order to grant this ability to robots, it is necessary to use localisation approaches that leverage the same semantic information humans use. In this paper, we present a novel method for semantically enabled global localisation. Our approach relies on the semantic labels present in the floorplan. Deep Learning is leveraged to extract semantic labels from RGB images, which are compared to the floorplan for localisation. While our approach is able to use range measurements if available, we demonstrate that they are unnecessary as we can achieve results comparable to state-of-the-art without them.

Download Full-text

CNN performance dependence on linear image processing

Electronic Imaging ◽

10.2352/issn.2470-1173.2020.10.ipas-182 ◽

2020 ◽

Vol 2020 (10) ◽

pp. 310-1-310-7

Author(s):

Khalid Omer ◽

Luca Caucci ◽

Meredith Kupinski

Keyword(s):

Image Processing ◽

Texture Classification ◽

Full Rank ◽

Detection Performance ◽

Ideal Observer ◽

Training Data ◽

Image Texture ◽

Training Images ◽

Analytic Expressions ◽

Linear Compression

This work reports on convolutional neural network (CNN) performance on an image texture classification task as a function of linear image processing and number of training images. Detection performance of single and multi-layer CNNs (sCNN/mCNN) are compared to optimal observers. Performance is quantified by the area under the receiver operating characteristic (ROC) curve, also known as the AUC. For perfect detection AUC = 1.0 and AUC = 0.5 for guessing. The Ideal Observer (IO) maximizes AUC but is prohibitive in practice because it depends on high-dimensional image likelihoods. The IO performance is invariant to any fullrank, invertible linear image processing. This work demonstrates the existence of full-rank, invertible linear transforms that can degrade both sCNN and mCNN even in the limit of large quantities of training data. A subsequent invertible linear transform changes the images’ correlation structure again and can improve this AUC. Stationary textures sampled from zero mean and unequal covariance Gaussian distributions allow closed-form analytic expressions for the IO and optimal linear compression. Linear compression is a mitigation technique for high-dimension low sample size (HDLSS) applications. By definition, compression strictly decreases or maintains IO detection performance. For small quantities of training data, linear image compression prior to the sCNN architecture can increase AUC from 0.56 to 0.93. Results indicate an optimal compression ratio for CNN based on task difficulty, compression method, and number of training images.

Download Full-text

Towards an Efficient High-Level Modeling of Heterogeneous Image Processing Systems

TMS/DEVS Symposium on Theory of Modeling & Simulation (TMS/DEVS 2016) ◽

10.22360/springsim.2016.tmsdevs.036 ◽

2016 ◽

Keyword(s):

Image Processing ◽

High Level Modeling ◽

High Level

Download Full-text

Efficient End-to-End Sentence-Level Lipreading with Temporal Convolutional Networks

Applied Sciences ◽

10.3390/app11156975 ◽

2021 ◽

Vol 11 (15) ◽

pp. 6975

Author(s):

Tao Zhang ◽

Lun He ◽

Xudong Li ◽

Guoqing Feng

Keyword(s):

Performance Improvement ◽

State Of The Art ◽

Error Rates ◽

Convolutional Network ◽

Convolutional Networks ◽

Sentence Level ◽

End To End ◽

High Level ◽

Improved Accuracy ◽

Talking Face

Lipreading aims to recognize sentences being spoken by a talking face. In recent years, the lipreading method has achieved a high level of accuracy on large datasets and made breakthrough progress. However, lipreading is still far from being solved, and existing methods tend to have high error rates on the wild data and have the defects of disappearing training gradient and slow convergence. To overcome these problems, we proposed an efficient end-to-end sentence-level lipreading model, using an encoder based on a 3D convolutional network, ResNet50, Temporal Convolutional Network (TCN), and a CTC objective function as the decoder. More importantly, the proposed architecture incorporates TCN as a feature learner to decode feature. It can partly eliminate the defects of RNN (LSTM, GRU) gradient disappearance and insufficient performance, and this yields notable performance improvement as well as faster convergence. Experiments show that the training and convergence speed are 50% faster than the state-of-the-art method, and improved accuracy by 2.4% on the GRID dataset.

Download Full-text

Computer Vision and robotics in postal automation

Human Systems Management ◽

10.3233/hsm-1999-183-411 ◽

1999 ◽

Vol 18 (3-4) ◽

pp. 265-273

Author(s):

Giovanni B. Garibotto

Keyword(s):

Image Processing ◽

Computer Vision ◽

Pattern Recognition ◽

Material Handling ◽

State Of The Art ◽

Short Description ◽

The Other ◽

Functional Requirements ◽

Postal Automation ◽

And Robotics

The paper is intended to provide an overview of advanced robotic technologies within the context of Postal Automation services. The main functional requirements of the application are briefly referred, as well as the state of the art and new emerging solutions. Image Processing and Pattern Recognition have always played a fundamental role in Address Interpretation and Mail sorting and the new challenging objective is now off-line handwritten cursive recognition, in order to be able to handle all kind of addresses in a uniform way. On the other hand, advanced electromechanical and robotic solutions are extremely important to solve the problems of mail storage, transportation and distribution, as well as for material handling and logistics. Finally a short description of new services of Postal Automation is referred, by considering new emerging services of hybrid mail and paper to electronic conversion.

Download Full-text

IMG2nDSM: Height Estimation from Single Airborne RGB Images with Deep Learning

Remote Sensing ◽

10.3390/rs13122417 ◽

2021 ◽

Vol 13 (12) ◽

pp. 2417

Author(s):

Savvas Karatsiolis ◽

Andreas Kamilaris ◽

Ian Cole

Keyword(s):

Deep Learning ◽

State Of The Art ◽

Aerial Imagery ◽

Aerial Images ◽

Surface Model ◽

Large Area ◽

Digital Terrain ◽

Terrain Models ◽

Architectural Features ◽

Rgb Images

Estimating the height of buildings and vegetation in single aerial images is a challenging problem. A task-focused Deep Learning (DL) model that combines architectural features from successful DL models (U-NET and Residual Networks) and learns the mapping from a single aerial imagery to a normalized Digital Surface Model (nDSM) was proposed. The model was trained on aerial images whose corresponding DSM and Digital Terrain Models (DTM) were available and was then used to infer the nDSM of images with no elevation information. The model was evaluated with a dataset covering a large area of Manchester, UK, as well as the 2018 IEEE GRSS Data Fusion Contest LiDAR dataset. The results suggest that the proposed DL architecture is suitable for the task and surpasses other state-of-the-art DL approaches by a large margin.

Download Full-text

Caged-electron States and Split-electron States in Endohedral Alkali C60

Physical Chemistry Chemical Physics ◽

10.1039/d1cp01341f ◽

2021 ◽

Author(s):

yifan yang ◽

Lorenz S Cederbaum

Keyword(s):

State Of The Art ◽

Electronic States ◽

Electron States ◽

The Common ◽

High Level

The low-lying electronic states of neutral X@C60(X=Li, Na, K, Rb) have been computed and analyzed by employing state-of-the-art high level many-electron methods. Apart from the common charge-separated states, well known...

Download Full-text

InstantDL: an easy-to-use deep learning pipeline for image segmentation and classification

BMC Bioinformatics ◽

10.1186/s12859-021-04037-3 ◽

2021 ◽

Vol 22 (1) ◽

Author(s):

Dominik Jens Elias Waibel ◽

Sayedali Shetab Boushehri ◽

Carsten Marr

Keyword(s):

Image Processing ◽

Deep Learning ◽

Specific Problem ◽

State Of The Art ◽

Image Data ◽

Semantic Segmentation ◽

Parameter Tuning ◽

Cellular Processes ◽

Minimal Effort ◽

Instance Segmentation

Abstract Background Deep learning contributes to uncovering molecular and cellular processes with highly performant algorithms. Convolutional neural networks have become the state-of-the-art tool to provide accurate and fast image data processing. However, published algorithms mostly solve only one specific problem and they typically require a considerable coding effort and machine learning background for their application. Results We have thus developed InstantDL, a deep learning pipeline for four common image processing tasks: semantic segmentation, instance segmentation, pixel-wise regression and classification. InstantDL enables researchers with a basic computational background to apply debugged and benchmarked state-of-the-art deep learning algorithms to their own data with minimal effort. To make the pipeline robust, we have automated and standardized workflows and extensively tested it in different scenarios. Moreover, it allows assessing the uncertainty of predictions. We have benchmarked InstantDL on seven publicly available datasets achieving competitive performance without any parameter tuning. For customization of the pipeline to specific tasks, all code is easily accessible and well documented. Conclusions With InstantDL, we hope to empower biomedical researchers to conduct reproducible image processing with a convenient and easy-to-use pipeline.

Download Full-text

France’s State of the Art Distributed Optical Fibre Sensors Qualified for the Monitoring of the French Underground Repository for High Level and Intermediate Level Long Lived Radioactive Wastes

Sensors ◽

10.3390/s17061377 ◽

2017 ◽

Vol 17 (6) ◽

pp. 1377 ◽

Cited By ~ 17

Author(s):

Sylvie Delepine-Lesoille ◽

Sylvain Girard ◽

Marcel Landolt ◽

Johan Bertrand ◽

Isabelle Planes ◽

...

Keyword(s):

Optical Fibre ◽

State Of The Art ◽

Radioactive Wastes ◽

Intermediate Level ◽

Optical Fibre Sensors ◽

High Level

Download Full-text

A Comparison of State-of-the-Art Classification Techniques for Expert Automobile Insurance Claim Fraud Detection

Journal of Risk & Insurance ◽

10.1111/1539-6975.00023 ◽

2002 ◽

Vol 69 (3) ◽

pp. 373-421 ◽

Cited By ~ 99

Author(s):

Stijn Viaene ◽

Richard A. Derrig ◽

Bart Baesens ◽

Guido Dedene

Keyword(s):

State Of The Art ◽

Fraud Detection ◽

Automobile Insurance ◽

Insurance Claim ◽

Classification Techniques

Download Full-text