scholarly journals Identification of genetic markers for cortical areas using a Random Forest classification routine and the Allen Mouse Brain Atlas

2019 ◽  
Author(s):  
Natalie Weed ◽  
Trygve Bakken ◽  
Nile Graddis ◽  
Nathan Gouwens ◽  
Daniel Millman ◽  
...  

AbstractThe mammalian neocortex is subdivided into a series of ‘cortical areas’ that are functionally and anatomically distinct, and are often distinguished in brain sections using histochemical stains and other markers of protein expression. We searched the Allen Mouse Brain Atlas, a database of gene expression, for novel markers of cortical areas. We employed a random forest algorithm to screen for genes that change expression at area borders. We found novel genetic markers for 19 of 39 areas and provide code that quickly and efficiently searches the Allen Mouse Brain Atlas.

PLoS ONE ◽  
2019 ◽  
Vol 14 (9) ◽  
pp. e0212898 ◽  
Author(s):  
Natalie Weed ◽  
Trygve Bakken ◽  
Nile Graddis ◽  
Nathan Gouwens ◽  
Daniel Millman ◽  
...  

2021 ◽  
Vol 8 (2) ◽  
pp. 77-84
Author(s):  
Nohuddin et al. ◽  

In this paper, a study is established for exploiting a document classification technique for categorizing a set of random online documents. The technique is aimed to assign one or more classes or categories to a document, making it easier to manage and sort. This paper describes an experiment on the proposed method for classifying documents effectively using the decision tree technique. The proposed research framework is a Document Analysis based on the Random Forest Algorithm (DARFA). The proposed framework consists of 5 components, which are (i) Document dataset, (ii) Data Preprocessing, (iii) Document Term Matrix, (iv) Random Forest classification, and (v) Visualization. The proposed classification method can analyze the content of document datasets and classifies documents according to the text content. The proposed framework use algorithms that include TF-IDF and Random Forest algorithm. The outcome of this study benefits as an enhancement to document management procedures like managing documents in daily business operations, consolidating inventory systems, organizing files in databases, and categorizing document folders.


2021 ◽  
Vol 11 (15) ◽  
pp. 7140
Author(s):  
Radko Mesiar ◽  
Ayyub Sheikhi

In this work, we use a copula-based approach to select the most important features for a random forest classification. Based on associated copulas between these features, we carry out this feature selection. We then embed the selected features to a random forest algorithm to classify a label-valued outcome. Our algorithm enables us to select the most relevant features when the features are not necessarily connected by a linear function; also, we can stop the classification when we reach the desired level of accuracy. We apply this method on a simulation study as well as a real dataset of COVID-19 and for a diabetes dataset.


2020 ◽  
Vol 2020 ◽  
pp. 1-10
Author(s):  
Xiaoyi Duan ◽  
Dong Chen ◽  
Xiaohong Fan ◽  
Xiuying Li ◽  
Ding Ding ◽  
...  

In the power analysis attack, when the Hamming weight model is used to describe the power consumption of the chip operation data, the result of the random forest (RF) algorithm is not ideal, so a random forest classification method based on synthetic minority oversampling technique (SMOTE) is proposed. It compensates for the problem that the random forest algorithm is affected by the data imbalance and the classification accuracy of the minority classification is low, which improves the overall classification accuracy rate. The experimental results show that when the training set data is 800, the random forest algorithm predicts the correct rate of 84%, but the classification accuracy of the minority data is 0%, and the SMOTE-based random forest algorithm improves the prediction accuracy of the same set of test data by 91%. The classification accuracy rate of a few categories has increased from 0% to 100%.


2020 ◽  
Vol 117 (52) ◽  
pp. 33474-33485
Author(s):  
Vittorio Fortino ◽  
Lukas Wisgrill ◽  
Paulina Werner ◽  
Sari Suomela ◽  
Nina Linder ◽  
...  

Contact dermatitis tremendously impacts the quality of life of suffering patients. Currently, diagnostic regimes rely on allergy testing, exposure specification, and follow-up visits; however, distinguishing the clinical phenotype of irritant and allergic contact dermatitis remains challenging. Employing integrative transcriptomic analysis and machine-learning approaches, we aimed to decipher disease-related signature genes to find suitable sets of biomarkers. A total of 89 positive patch-test reaction biopsies against four contact allergens and two irritants were analyzed via microarray. Coexpression network analysis and Random Forest classification were used to discover potential biomarkers and selected biomarker models were validated in an independent patient group. Differential gene-expression analysis identified major gene-expression changes depending on the stimulus. Random Forest classification identified CD47, BATF, FASLG, RGS16, SYNPO, SELE, PTPN7, WARS, PRC1, EXO1, RRM2, PBK, RAD54L, KIFC1, SPC25, PKMYT, HISTH1A, TPX2, DLGAP5, TPX2, CH25H, and IL37 as potential biomarkers to distinguish allergic and irritant contact dermatitis in human skin. Validation experiments and prediction performances on external testing datasets demonstrated potential applicability of the identified biomarker models in the clinic. Capitalizing on this knowledge, novel diagnostic tools can be developed to guide clinical diagnosis of contact allergies.


2016 ◽  
Vol 146 ◽  
pp. 370-385 ◽  
Author(s):  
Adam Hedberg-Buenz ◽  
Mark A. Christopher ◽  
Carly J. Lewis ◽  
Kimberly A. Fernandes ◽  
Laura M. Dutca ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document