Prediction of Merchandise Sales on E-Commerce Platforms Based on Data Mining and Deep Learning

Scientific Programming ◽

10.1155/2021/2179692 ◽

2021 ◽

Vol 2021 ◽

pp. 1-9

Author(s):

Xiaoting Yin ◽

Xiaosha Tao

Keyword(s):

Data Mining ◽

Deep Learning ◽

Learning Algorithm ◽

Research Process ◽

Deep Learning Algorithm ◽

Sales Prediction ◽

Different Types ◽

Online Business ◽

Product Sales ◽

Deep Learning Model

Online business has grown exponentially during the last decade, and the industries are focusing on online business more than before. However, just setting up an online store and starting selling might not work. Different machine learning and data mining techniques are needed to know the users’ preferences and know what would be best for business. According to the decision-making needs of online product sales, combined with the influencing factors of online product sales in various industries and the advantages of deep learning algorithm, this paper constructs a sales prediction model suitable for online products and focuses on evaluating the adaptability of the model in different types of online products. In the research process, the full connection model is compared with the training results of CNN, which proves the accuracy and generalization ability of CNN model. By selecting the non-deep learning model as the comparison baseline, the performance advantages of CNN model under different categories of products are proved. In addition, the experiment concludes that the unsupervised pretrained CNN model is more effective and adaptable in sales forecasting.

Download Full-text

Application of Model-Based Deep Learning Algorithm in Fault Diagnosis of Coal Mills

Mathematical Problems in Engineering ◽

10.1155/2020/3753274 ◽

2020 ◽

Vol 2020 ◽

pp. 1-14

Author(s):

Yifan Jian ◽

Xianguo Qing ◽

Yang Zhao ◽

Liang He ◽

Xiao Qi

Keyword(s):

Deep Learning ◽

Fault Diagnosis ◽

Learning Algorithm ◽

Mechanism Model ◽

Model Based ◽

Deep Learning Algorithm ◽

Stacked Autoencoders ◽

Different Types ◽

Operation Status ◽

Types Of Faults

The coal mill is one of the important auxiliary engines in the coal-fired power station. Its operation status is directly related to the safe and steady operation of the units. In this paper, a model-based deep learning algorithm for fault diagnosis is proposed to effectively detect the operation state of coal mills. Based on the system mechanism model of coal mills, massive fault data are obtained by analyzing and simulating the different types of faults. Then, stacked autoencoders (SAEs) are established by combining the said data with the deep learning algorithm. The SAE model is trained by the fault data, which provide it with the learning and identification capability of the characteristics of faults. According to the simulation results, the accuracy of fault diagnosis of coal mills based on SAE is high at 98.97%. Finally, the proposed SAEs can well detect the fault in coal mills and generate the warnings in advance.

Download Full-text

A Deep Learning Model for Screening Type 2 Diabetes from Retinal Photographs

10.1101/2021.06.29.21259606 ◽

2021 ◽

Author(s):

Jae-Seung Yun ◽

Jaesik Kim ◽

Sang-Hyuk Jung ◽

Seon-Ah Cha ◽

Seung-Hyun Ko ◽

...

Keyword(s):

Type 2 Diabetes ◽

Deep Learning ◽

Learning Algorithm ◽

Learning Model ◽

Retinal Images ◽

Uk Biobank ◽

Non Invasive ◽

Deep Learning Algorithm ◽

Deep Learning Model

Objective: We aimed to develop and evaluate a non-invasive deep learning algorithm for screening type 2 diabetes in UK Biobank participants using retinal images. Research Design and Methods: The deep learning model for prediction of type 2 diabetes was trained on retinal images from 50,077 UK Biobank participants and tested on 12,185 participants. We evaluated its performance in terms of predicting traditional risk factors (TRFs) and genetic risk for diabetes. Next, we compared the performance of three models in predicting type 2 diabetes using 1) an image-only deep learning algorithm, 2) TRFs, 3) the combination of the algorithm and TRFs. Assessing net reclassification improvement (NRI) allowed quantification of the improvement afforded by adding the algorithm to the TRF model. Results: When predicting TRFs with the deep learning algorithm, the areas under the curve (AUCs) obtained with the validation set for age, sex, and HbA1c status were 0.931 (0.928-0.934), 0.933 (0.929-0.936), and 0.734 (0.715-0.752), respectively. When predicting type 2 diabetes, the AUC of the composite logistic model using non-invasive TRFs was 0.810 (0.790-0.830), and that for the deep learning model using only fundus images was 0.731 (0.707-0.756). Upon addition of TRFs to the deep learning algorithm, discriminative performance was improved to 0.844 (0.826-0.861). The addition of the algorithm to the TRFs model improved risk stratification with an overall NRI of 50.8%. Conclusions: Our results demonstrate that this deep learning algorithm can be a useful tool for stratifying individuals at high risk of type 2 diabetes in the general population.

Download Full-text

Deep learning algorithm and location big data mining

Proceedings of the 2015 4th International Conference on Computer, Mechatronics, Control and Electronic Engineering ◽

10.2991/iccmcee-15.2015.167 ◽

2015 ◽

Author(s):

FaQin Gao

Keyword(s):

Data Mining ◽

Big Data ◽

Deep Learning ◽

Learning Algorithm ◽

Big Data Mining ◽

Deep Learning Algorithm

Download Full-text

Optimizing the Experimental Method for Stomata-Profiling Automation of Soybean Leaves Based on Deep Learning

Plants ◽

10.3390/plants10122714 ◽

2021 ◽

Vol 10 (12) ◽

pp. 2714

Author(s):

Syada Nizer Sultana ◽

Halim Park ◽

Sung Hoon Choi ◽

Hyun Jo ◽

Jong Tae Song ◽

...

Keyword(s):

Deep Learning ◽

Learning Algorithm ◽

Imaging Data ◽

Web Based ◽

Detection Model ◽

Deep Learning Algorithm ◽

Soybean Leaves ◽

Deep Learning Model ◽

Clearing Method ◽

Good Substitute

Stomatal observation and automatic stomatal detection are useful analyses of stomata for taxonomic, biological, physiological, and eco-physiological studies. We present a new clearing method for improved microscopic imaging of stomata in soybean followed by automated stomatal detection by deep learning. We tested eight clearing agent formulations based upon different ethanol and sodium hypochlorite (NaOCl) concentrations in order to improve the transparency in leaves. An optimal formulation—a 1:1 (v/v) mixture of 95% ethanol and NaOCl (6–14%)—produced better quality images of soybean stomata. Additionally, we evaluated fixatives and dehydrating agents and selected absolute ethanol for both fixation and dehydration. This is a good substitute for formaldehyde, which is more toxic to handle. Using imaging data from this clearing method, we developed an automatic stomatal detector using deep learning and improved a deep-learning algorithm that automatically analyzes stomata through an object detection model using YOLO. The YOLO deep-learning model successfully recognized stomata with high mAP (~0.99). A web-based interface is provided to apply the model of stomatal detection for any soybean data that makes use of the new clearing protocol.

Download Full-text

Aspect Based Emotion Detection and Topic Modeling on Social Media Reviews

10.3233/apc210242 ◽

2021 ◽

Author(s):

Ganesh N. Jorvekar ◽

Mohit Gangwar

Keyword(s):

Neural Network ◽

Social Media ◽

Deep Learning ◽

Recurrent Neural Network ◽

Learning Algorithm ◽

Emotion Detection ◽

Natural Language Text ◽

Deep Learning Algorithm ◽

User Comments ◽

Deep Learning Model

In recent years, the number of user comments and text materials has increased dramatically. Analysis of the emotions has drawn interest from researchers. Earlier research in the field of artificial-intelligence concentrate on identification of emotion and exploring the explanation the emotions can’t recognized or misrecognized. The association between the emotions leads to the understanding of emotion loss. In this Work we are trying to fill the gap between emotional recognition and emotional co-relation mining through social media reviews of natural language text. The association between emotions, represented as the emotional uncertainty and evolution, is mainly triggered by cognitive bias in the human emotion. Numerous types of features and Recurrent neural-network (RNN) as deep learning model provided to mine the emotion co-relation from emotion detection using text. The rule on conflict of emotions is derived on a symmetric basis. TF-IDF, NLP Features and Co-relation features has used for feature extraction as well as section and Recurrent Neural Network (RNN) and Hybrid deep learning algorithm for classification has used to demonstrates the entire research experiments. Finally evaluate the system performance with various existing system and show the effectiveness of proposed system.

Download Full-text

Diffeomorphic unsupervised deep learning model for mono- and multi-modality registration

Journal of Algorithms & Computational Technology ◽

10.1177/1748302620973528 ◽

2020 ◽

Vol 14 ◽

pp. 174830262097352

Author(s):

Anis Theljani ◽

Ke Chen

Keyword(s):

Deep Learning ◽

Learning Algorithm ◽

Training Data ◽

Variational Model ◽

Learning Network ◽

Deep Learning Algorithm ◽

Unsupervised Deep Learning ◽

Deep Learning Network ◽

Affine Transforms ◽

Deep Learning Model

Different from image segmentation, developing a deep learning network for image registration is less straightforward because training data cannot be prepared or supervised by humans unless they are trivial (e.g. pre-designed affine transforms). One approach for an unsupervised deep leaning model is to self-train the deformation fields by a network based on a loss function with an image similarity metric and a regularisation term, just with traditional variational methods. Such a function consists in a smoothing constraint on the derivatives and a constraint on the determinant of the transformation in order to obtain a spatially smooth and plausible solution. Although any variational model may be used to work with a deep learning algorithm, the challenge lies in achieving robustness. The proposed algorithm is first trained based on a new and robust variational model and tested on synthetic and real mono-modal images. The results show how it deals with large deformation registration problems and leads to a real time solution with no folding. It is then generalised to multi-modal images. Experiments and comparisons with learning and non-learning models demonstrate that this approach can deliver good performances and simultaneously generate an accurate diffeomorphic transformation.

Download Full-text

A Deep-Learning Algorithm (ECG12Net) for Detecting Hypokalemia and Hyperkalemia by Electrocardiography: Algorithm Development

JMIR Medical Informatics ◽

10.2196/15931 ◽

2020 ◽

Vol 8 (3) ◽

pp. e15931 ◽

Cited By ~ 1

Author(s):

Chin-Sheng Lin ◽

Chin Lin ◽

Wen-Hui Fang ◽

Chia-Jung Hsu ◽

Sy-Jou Chen ◽

...

Keyword(s):

Deep Learning ◽

Cardiac Tissue ◽

Learning Algorithm ◽

Area Under The Curve ◽

Absolute Error ◽

Learning Model ◽

Cardiac Events ◽

Deep Learning Algorithm ◽

And Performance ◽

Deep Learning Model

Background The detection of dyskalemias—hypokalemia and hyperkalemia—currently depends on laboratory tests. Since cardiac tissue is very sensitive to dyskalemia, electrocardiography (ECG) may be able to uncover clinically important dyskalemias before laboratory results. Objective Our study aimed to develop a deep-learning model, ECG12Net, to detect dyskalemias based on ECG presentations and to evaluate the logic and performance of this model. Methods Spanning from May 2011 to December 2016, 66,321 ECG records with corresponding serum potassium (K+) concentrations were obtained from 40,180 patients admitted to the emergency department. ECG12Net is an 82-layer convolutional neural network that estimates serum K+ concentration. Six clinicians—three emergency physicians and three cardiologists—participated in human-machine competition. Sensitivity, specificity, and balance accuracy were used to evaluate the performance of ECG12Net with that of these physicians. Results In a human-machine competition including 300 ECGs of different serum K+ concentrations, the area under the curve for detecting hypokalemia and hyperkalemia with ECG12Net was 0.926 and 0.958, respectively, which was significantly better than that of our best clinicians. Moreover, in detecting hypokalemia and hyperkalemia, the sensitivities were 96.7% and 83.3%, respectively, and the specificities were 93.3% and 97.8%, respectively. In a test set including 13,222 ECGs, ECG12Net had a similar performance in terms of sensitivity for severe hypokalemia (95.6%) and severe hyperkalemia (84.5%), with a mean absolute error of 0.531. The specificities for detecting hypokalemia and hyperkalemia were 81.6% and 96.0%, respectively. Conclusions A deep-learning model based on a 12-lead ECG may help physicians promptly recognize severe dyskalemias and thereby potentially reduce cardiac events.

Download Full-text

Extracting non-small cell lung cancer (NSCLC) diagnosis and diagnosis dates from electronic health record (EHR) text using a deep learning algorithm.

Journal of Clinical Oncology ◽

10.1200/jco.2021.39.15_suppl.1556 ◽

2021 ◽

Vol 39 (15_suppl) ◽

pp. 1556-1556

Author(s):

Alexander S. Rich ◽

Barry Leybovich ◽

Melissa Estevez ◽

Jamie Irvine ◽

Nisha Singh ◽

...

Keyword(s):

Lung Cancer ◽

Deep Learning ◽

Real World ◽

Learning Algorithm ◽

Learning Model ◽

Early Stage Disease ◽

Deep Learning Algorithm ◽

The Impact ◽

Deep Learning Model ◽

Diagnosis Date

1556 Background: Identifying patients with a particular cancer and determining the date of that diagnosis from EHR data is important for selecting real world research cohorts and conducting downstream analyses. However, cancer diagnoses and their dates are often not accurately recorded in the EHR in a structured form. We developed a unified deep learning model for identifying patients with NSCLC and their initial and advanced diagnosis date(s). Methods: The study used a cohort of 52,834 patients with lung cancer ICD codes from the nationwide deidentified Flatiron Health EHR-derived database. For all patients in the cohort, abstractors used an in-house technology-enabled platform to identify an NSCLC diagnosis, advanced disease, and relevant diagnosis date(s) via chart review. Advanced NSCLC was defined as stage IIIB or IV disease at diagnosis or early stage disease that recurred or progressed. The deep learning model was trained on 38,517 patients, with a separate 14,317 patient test cohort. The model input was a set of sentences containing keywords related to (a)NSCLC, extracted from a patient’s EHR documents. Each sentence was associated with a date, using the document timestamp or, if present, a date mentioned explicitly in the sentence. The sentences were processed by a GRU network, followed by an attentional network that integrated across sentences, outputting a prediction of whether the patient had been diagnosed with (a)NSCLC and the diagnosis date(s) if so. We measured sensitivity and positive predictive value (PPV) of extracting the presence of initial and advanced diagnoses in the test cohort. Among patients with both model-extracted and abstracted diagnosis dates, we also measured 30-day accuracy, defined as the proportion of patients where the dates match to within 30 days. Real world overall survival (rwOS) for patients abstracted vs. model-extracted as advanced was calculated using Kaplan-Meier methods (index date: abstracted vs. model-extracted advanced diagnosis date). Results: Results in the Table show the sensitivity, PPV, and accuracy of the model extracted diagnoses and dates. RwOS was similar using model extracted aNSCLC diagnosis dates (median = 13.7) versus abstracted diagnosis dates (median = 13.3), with a difference of 0.4 months (95% CI = [0.0, 0.8]). Conclusions: Initial and advanced diagnosis of NSCLC and dates of diagnosis can be accurately extracted from unstructured clinical text using a deep learning algorithm. This can further enable the use of EHR data for research on real-world treatment patterns and outcomes analysis, and other applications such as clinical trials matching. Future work should aim to understand the impact of model errors on downstream analyses.[Table: see text]

Download Full-text

Progress in developing a hybrid deep learning algorithm for identifying and locating primary vertices

EPJ Web of Conferences ◽

10.1051/epjconf/202125104012 ◽

2021 ◽

Vol 251 ◽

pp. 04012

Author(s):

Simon Akar ◽

Gowtham Atluri ◽

Thomas Boettcher ◽

Michael Peters ◽

Henry Schreiner ◽

...

Keyword(s):

Deep Learning ◽

High Efficiency ◽

Learning Algorithm ◽

Learning Model ◽

Operating Conditions ◽

Proton Collision ◽

Proton Proton ◽

Deep Learning Algorithm ◽

Proton Proton Collision ◽

Deep Learning Model

The locations of proton-proton collision points in LHC experiments are called primary vertices (PVs). Preliminary results of a hybrid deep learning algorithm for identifying and locating these, targeting the Run 3 incarnation of LHCb, have been described at conferences in 2019 and 2020. In the past year we have made significant progress in a variety of related areas. Using two newer Kernel Density Estimators (KDEs) as input feature sets improves the fidelity of the models, as does using full LHCb simulation rather than the “toy Monte Carlo” originally (and still) used to develop models. We have also built a deep learning model to calculate the KDEs from track information. Connecting a tracks-to-KDE model to a KDE-to-hists model used to find PVs provides a proof-of-concept that a single deep learning model can use track information to find PVs with high efficiency and high fidelity. We have studied a variety of models systematically to understand how variations in their architectures affect performance. While the studies reported here are specific to the LHCb geometry and operating conditions, the results suggest that the same approach could be used by the ATLAS and CMS experiments.

Download Full-text

Liver Disease Classification using Deep Learning Algorithm

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.l2747.1081219 ◽

2019 ◽

Vol 8 (12) ◽

pp. 5105-5111 ◽

Cited By ~ 1

Keyword(s):

Data Mining ◽

Deep Learning ◽

Liver Disease ◽

Learning Algorithm ◽

Liver Disorder ◽

Disease Classification ◽

Human Culture ◽

Deep Learning Algorithm

Data Mining is one of the prevalent elucidating portions of programmed request and distinguishing proof. It involves data mining counts and strategies to examine helpful data. Of late, liver dissents have disproportionately expanded and liver infections are complimenting one of the most human pains in different countries. Early assurance of Liver Disorder is essential for the welfare of human culture. This complaint should be considered sincerely by setting up watchful structures for the early break down and expectation of Liver contaminations. The robotized gathering system suffers with non attendance of precision results when differentiated and cautious biopsy. We propose another model for liver issue request for separating the patient's helpful, data using ANN algorithm. The remedial records are organized whether there is a believability of essence of disorder or not. This proposed methodology uses extracted features using M-PSO and ANN for classifying the features. The ANN methodology improves the accuracy when appeared differently in relation to existing request computations. This paper focuses classification of selected features for classification.

Download Full-text