scholarly journals SUPERSMART: ecology and evolution in the era of big data

Author(s):  
Alexandre Antonelli ◽  
Fabien L. Condamine ◽  
Hannes Hettling ◽  
Karin Nilsson ◽  
R Henrik Nilsson ◽  
...  

Rapidly growing biological data volumes – including molecular sequences, species traits, geographic occurrences, specimen collections, and fossil records – hold an unprecedented, yet largely unexplored potential to reveal how ecological and evolutionary processes generate and maintain biodiversity. Most biodiversity studies integrating ecological data and evolutionary history use an idiosyncratic step-by-step approach for the reconstruction of time-calibrated phylogenies in light of ecological and evolutionary scenarios. Here we introduce a conceptual framework, termed SUPERSMART (Self-Updating Platform for Estimating Rates of Speciation and Migration, Ages, and Relationships of Taxa), and provide a proof of concept for dealing with the moving targets of biodiversity research. This framework reconstructs dated phylogenies based on the assembly of molecular datasets and collects pertinent data on ecology, distribution, and fossils of the focal clade. The data handled for each step are continuously updated as databases accumulate new records. We exemplify the practice of our method by presenting comprehensive phylogenetic and dating analyses for the orders Primates and the Gentianales. We believe that this emerging framework will provide an invaluable tool for a wide range of hypothesis-driven research questions in ecology and evolution.

2014 ◽  
Author(s):  
Alexandre Antonelli ◽  
Fabien L. Condamine ◽  
Hannes Hettling ◽  
Karin Nilsson ◽  
R Henrik Nilsson ◽  
...  

Rapidly growing biological data volumes – including molecular sequences, species traits, geographic occurrences, specimen collections, and fossil records – hold an unprecedented, yet largely unexplored potential to reveal how ecological and evolutionary processes generate and maintain biodiversity. Most biodiversity studies integrating ecological data and evolutionary history use an idiosyncratic step-by-step approach for the reconstruction of time-calibrated phylogenies in light of ecological and evolutionary scenarios. Here we introduce a conceptual framework, termed SUPERSMART (Self-Updating Platform for Estimating Rates of Speciation and Migration, Ages, and Relationships of Taxa), and provide a proof of concept for dealing with the moving targets of biodiversity research. This framework reconstructs dated phylogenies based on the assembly of molecular datasets and collects pertinent data on ecology, distribution, and fossils of the focal clade. The data handled for each step are continuously updated as databases accumulate new records. We exemplify the practice of our method by presenting comprehensive phylogenetic and dating analyses for the orders Primates and the Gentianales. We believe that this emerging framework will provide an invaluable tool for a wide range of hypothesis-driven research questions in ecology and evolution.


Author(s):  
Gaurav Vaidya ◽  
Hilmar Lapp ◽  
Nico Cellinese

Most biological data and knowledge are directly or indirectly linked to biological taxa via taxon names. Using taxon names is one of the most fundamental and ubiquitous ways in which a wide range of biological data are integrated, aggregated, and indexed, from genomic and microbial diversity to macro-ecological data. To this day, the names used, as well as most methods and resources developed for this purpose, are drawn from Linnaean nomenclature. This leads to numerous problems when applied to data-intensive science that depends on computation to take full advantage of the vast – and rapidly increasing – amount of available digital biodiversity data. The theoretical and practical complexities of reconciling taxon names and concepts has plagued the systematics community for decades and now more than ever before, Linnaean names based in Linnaean taxonomy, by far the most prevalent means of linking data to taxa, are unfit for the age of computation-driven data science, due to fundamental theoretical and practical shortfalls that cannot be cured. We propose an alternate approach based on the use of phylogenetic clade definitions, which is a well-developed method for unambiguously defining the semantics of a clade concept in terms of shared evolutionary ancestry (de Queiroz and Gauthier 1990, de Queiroz and Gauthier 1994). These semantics allow locating the defined clade on any phylogeny, or showing that a clade is inconsistent with the topology of a given phylogeny and hence cannot be present on it at all. We have built a workflow for defining phylogenetic clade definitions in terms of shared ancestor and excluded lineage properties, and locating these definitions on any input phylogeny. Once these definitions have been located, we can use the list of species found within that clade on that phylogeny in order to aggregate occurrence data from the Global Biodiversity Information Facility (GBIF). Thus, our approach uses clade definitions with machine-understandable semantics to programmatically and reproducibly aggregate biodiversity data by higher-level taxonomic concepts. This approach has several advantages over the use of taxonomic hierarchies: Unlike taxa, the semantics of clade definitions can be expressed in unambiguous, machine-understandable and reproducible terms and language. The resolution of a given clade definition will depend on the phylogeny being used. Thus, if the phylogeny of groups of interest is updated in light of new evolutionary knowledge, the clade definition can be applied to the new phylogeny to obtain an updated list of clade members consistent with the updated evolutionary knowledge. Machine reproducibility of analyses is possible simply by archiving the machine-readable representations of the clade definition and the phylogeny being used. Unlike taxa, the semantics of clade definitions can be expressed in unambiguous, machine-understandable and reproducible terms and language. The resolution of a given clade definition will depend on the phylogeny being used. Thus, if the phylogeny of groups of interest is updated in light of new evolutionary knowledge, the clade definition can be applied to the new phylogeny to obtain an updated list of clade members consistent with the updated evolutionary knowledge. Machine reproducibility of analyses is possible simply by archiving the machine-readable representations of the clade definition and the phylogeny being used. Clade definitions can be created by biologists as needed or can be reused from those published in peer-reviewed journals. In addition, nearly 300 peer-reviewed clade definitions were recently published as part of the Phylonym volume of the PhyloCode (de Queiroz et al. 2020) and are now available on the Regnum website. As part of the Phyloreferencing Project, we digitize this collection as a machine-readable ontology, where each clade is represented as a class defined by logical conjunctions for class membership, corresponding to a set of necessary and sufficient conditions of shared or divergent evolutionary ancestry. We call these classes phyloreferences, and have created a fully automated workflow for digitizing the Regnum database content into an OWL ontology (W3C OWL Working Group 2012) that we call the Clade Ontology. This ontology includes reference phylogenies and additional metadata about the verbatim clade definitions. Once complete, the Clade Ontology will include all clade definitions from RegNum, both those included in Phylonym after passing peer-review, and those contributed by the community, whether or not under the PhyloCode nomenclature. As an openly available community resource, this will allow researchers to use them to aggregate biodiversity data for comparative biology with grouping semantics that are transparent, machine-processable, and reproducible. In our presentation, we will demonstrate the use of phyloreferences to locate clades on the Open Tree of Life synthetic tree (Hinchliff et al. 2015), to retrieve lists of species in each clade, and to use them to find and aggregate occurrence records in GBIF. We will also describe the workflow we are currently using to build and test the Clade Ontology, and describe our plans for publishing this resource. Finally, we will discuss the advantages and disadvantages of this approach as compared to taxonomic checklists.


Fault Tolerant Reliable Protocol (FTRP) is proposed as a novel routing protocol designed for Wireless Sensor Networks (WSNs). FTRP offers fault tolerance reliability for packet exchange and support for dynamic network changes. The key concept used is the use of node logical clustering. The protocol delegates the routing ownership to the cluster heads where fault tolerance functionality is implemented. FTRP utilizes cluster head nodes along with cluster head groups to store packets in transient. In addition, FTRP utilizes broadcast, which reduces the message overhead as compared to classical flooding mechanisms. FTRP manipulates Time to Live values for the various routing messages to control message broadcast. FTRP utilizes jitter in messages transmission to reduce the effect of synchronized node states, which in turn reduces collisions. FTRP performance has been extensively through simulations against Ad-hoc On-demand Distance Vector (AODV) and Optimized Link State (OLSR) routing protocols. Packet Delivery Ratio (PDR), Aggregate Throughput and End-to-End delay (E-2-E) had been used as performance metrics. In terms of PDR and aggregate throughput, it is found that FTRP is an excellent performer in all mobility scenarios whether the network is sparse or dense. In stationary scenarios, FTRP performed well in sparse network; however, in dense network FTRP’s performance had degraded yet in an acceptable range. This degradation is attributed to synchronized nodes states. Reliably delivering a message comes to a cost, as in terms of E-2-E. results show that FTRP is considered a good performer in all mobility scenarios where the network is sparse. In sparse stationary scenario, FTRP is considered good performer, however in dense stationary scenarios FTRP’s E-2-E is not acceptable. There are times when receiving a network message is more important than other costs such as energy or delay. That makes FTRP suitable for wide range of WSNs applications, such as military applications by monitoring soldiers’ biological data and supplies while in battlefield and battle damage assessment. FTRP can also be used in health applications in addition to wide range of geo-fencing, environmental monitoring, resource monitoring, production lines monitoring, agriculture and animals tracking. FTRP should be avoided in dense stationary deployments such as, but not limited to, scenarios where high application response is critical and life endangering such as biohazards detection or within intensive care units.


Author(s):  
Thomas K. Ogorzalek

Recent electoral cycles have drawn attention to an urban–rural divide at the heart of American politics. This book traces the origins of red and blue America. The urbanicity divide began with the creation of an urban political order that united leaders from major cities and changed the Democratic Party during the New Deal era. These cities, despite being the site of serious, complex conflicts at home, are remarkably cohesive in national politics because members of city delegations represent their city as well as their district. Even though their constituents often don’t see eye-to-eye on important issues, members of these city delegations represent a united city position known as progressive liberalism. Using a wide range of congressional evidence and a unique dataset measuring the urbanicity of U.S. House districts over time, this book argues that city cohesion, an invaluable tool used by cities to address their urgent governance needs through higher levels of government, is fostered by local institutions developed to provide local political order. Crucially, these integrative institutions also helped foster the development of civil rights liberalism by linking constituencies that were not natural allies in support of group pluralism and racial equality. This in turn led to the departure from the coalition of the Southern Democrats, and to our contemporary political environment. The urban combination of diversity and liberalism—supported by institutions that make allies out of rivals—teaches us lessons for governing in a world increasingly characterized by deep social difference and political fragmentation.


2021 ◽  
Vol 8 (1) ◽  
Author(s):  
Joffrey L. Leevy ◽  
John Hancock ◽  
Richard Zuech ◽  
Taghi M. Khoshgoftaar

AbstractMachine learning algorithms efficiently trained on intrusion detection datasets can detect network traffic capable of jeopardizing an information system. In this study, we use the CSE-CIC-IDS2018 dataset to investigate ensemble feature selection on the performance of seven classifiers. CSE-CIC-IDS2018 is big data (about 16,000,000 instances), publicly available, modern, and covers a wide range of realistic attack types. Our contribution is centered around answers to three research questions. The first question is, “Does feature selection impact performance of classifiers in terms of Area Under the Receiver Operating Characteristic Curve (AUC) and F1-score?” The second question is, “Does including the Destination_Port categorical feature significantly impact performance of LightGBM and Catboost in terms of AUC and F1-score?” The third question is, “Does the choice of classifier: Decision Tree (DT), Random Forest (RF), Naive Bayes (NB), Logistic Regression (LR), Catboost, LightGBM, or XGBoost, significantly impact performance in terms of AUC and F1-score?” These research questions are all answered in the affirmative and provide valuable, practical information for the development of an efficient intrusion detection model. To the best of our knowledge, we are the first to use an ensemble feature selection technique with the CSE-CIC-IDS2018 dataset.


Author(s):  
Jose A. Gallud ◽  
Monica Carreño ◽  
Ricardo Tesoriero ◽  
Andrés Sandoval ◽  
María D. Lozano ◽  
...  

AbstractTechnology-based education of children with special needs has become the focus of many research works in recent years. The wide range of different disabilities that are encompassed by the term “special needs”, together with the educational requirements of the children affected, represent an enormous multidisciplinary challenge for the research community. In this article, we present a systematic literature review of technology-enhanced and game-based learning systems and methods applied on children with special needs. The article analyzes the state-of-the-art of the research in this field by selecting a group of primary studies and answering a set of research questions. Although there are some previous systematic reviews, it is still not clear what the best tools, games or academic subjects (with technology-enhanced, game-based learning) are, out of those that have obtained good results with children with special needs. The 18 articles selected (carefully filtered out of 614 contributions) have been used to reveal the most frequent disabilities, the different technologies used in the prototypes, the number of learning subjects, and the kind of learning games used. The article also summarizes research opportunities identified in the primary studies.


Cells ◽  
2021 ◽  
Vol 10 (5) ◽  
pp. 1219
Author(s):  
Yan Teng ◽  
Yibin Fan ◽  
Jingwen Ma ◽  
Wei Lu ◽  
Na Liu ◽  
...  

The phosphatidylinositol 3-kinase (PI3K)/protein kinase B (Akt) signaling pathway regulates cell proliferation, differentiation, and migration, along with angiogenesis and metabolism. Additionally, it could mediate skin development and homeostasis. There is much evidence to suggest that dysregulation of PI3K/Akt pathway is frequently associated with several human cutaneous malignancies like malignant melanoma (MM), basal cell carcinoma (BCC), and cutaneous squamous cell carcinoma (SCC), as well as their poor outcomes. Nevertheless, emerging roles of PI3K/Akt pathway cascade in a group of common non-malignant skin disorders including acne and psoriasis, among others, have been recognized. The enhanced understanding of dysfunction of PI3K/Akt pathway in patients with these non-malignant disorders has offered a solid foundation for the progress of updated therapeutic targets. This article reviews the latest advances in the roles of PI3K/Akt pathway and their targets in the skin homeostasis and progression of a wide range of non-malignant skin disorders and describes the current progress in preclinical and clinical researches on the involvement of PI3K/Akt pathway targeted therapies.


2019 ◽  
Vol 400 (11) ◽  
pp. 1481-1496 ◽  
Author(s):  
Lingye Chen ◽  
Benjamin König ◽  
Tianbao Liu ◽  
Sumaira Pervaiz ◽  
Yasmin S. Razzaque ◽  
...  

Abstract The volume-regulated anion channel (VRAC) is a key player in the volume regulation of vertebrate cells. This ubiquitously expressed channel opens upon osmotic cell swelling and potentially other cues and releases chloride and organic osmolytes, which contributes to regulatory volume decrease (RVD). A plethora of studies have proposed a wide range of physiological roles for VRAC beyond volume regulation including cell proliferation, differentiation and migration, apoptosis, intercellular communication by direct release of signaling molecules and by supporting the exocytosis of insulin. VRAC was additionally implicated in pathological states such as cancer therapy resistance and excitotoxicity under ischemic conditions. Following extensive investigations, 5 years ago leucine-rich repeat-containing family 8 (LRRC8) heteromers containing LRRC8A were identified as the pore-forming components of VRAC. Since then, molecular biological approaches have allowed further insight into the biophysical properties and structure of VRAC. Heterologous expression, siRNA-mediated downregulation and genome editing in cells, as well as the use of animal models have enabled the assessment of the proposed physiological roles, together with the identification of new functions including spermatogenesis and the uptake of antibiotics and platinum-based cancer drugs. This review discusses the recent molecular biological insights into the physiology of VRAC in relation to its previously proposed roles.


2014 ◽  
Vol 60 (2) ◽  
pp. 221-232 ◽  
Author(s):  
Leonard Sandin ◽  
Astrid Schmidt-Kloiber ◽  
Jens-Christian Svenning ◽  
Erik Jeppesen ◽  
Nikolai Friberg

Abstract Freshwater habitats and organisms are among the most threatened on Earth, and freshwater ecosystems have been subject to large biodiversity losses. We developed a Climate Change Sensitivity (CCS) indicator based on trait information for a selection of stream- and lake-dwelling Ephemeroptera, Plecoptera and Trichoptera taxa. We calculated the CCS scores based on ten species traits identified as sensitive to global climate change. We then assessed climate change sensitivity between the six main ecoregions of Sweden as well as the three Swedish regions based on Illies. This was done using biological data from 1, 382 stream and lake sites where we compared large-scale (ecoregional) patterns in climate change sensitivity with potential future exposure of these ecosystems to increased temperatures using ensemble-modelled future changes in air temperature. Current (1961~1990) measured temperature and ensemble-modelled future (2100) temperature showed an increase from the northernmost towards the southern ecoregions, whereas the predicted temperature change increased from south to north. The CCS indicator scores were highest in the two northernmost boreal ecoregions where we also can expect the largest global climate change-induced increase in temperature, indicating an unfortunate congruence of exposure and sensitivity to climate change. These results are of vital importance when planning and implementing management and conservation strategies in freshwater ecosystems, e.g., to mitigate increased temperatures using riparian buffer strips. We conclude that traits information on taxa specialization, e.g., in terms of feeding specialism or taxa having a preference for high altitudes as well as sensitivity to changes in temperature are important when assessing the risk from future global climate change to freshwater ecosystems.


Author(s):  
Zahra Zakeri Khatir ◽  
Hamid Irannejad

: 1, 2, 4-Triazine derivatives have received much attention due to their multifunctional nature, especially in diverse pharmacological properties as well as a key fragment in many drug candidates. Introduction of a vicinal 5, 6-diaryl/heteroaryl moiety on the 1, 2, 4-triazine ring has attracted plentiful attention in the field of medicinal chemistry. 5, 6-Diaryl/heteroaryl-3-substituted-1, 2, 4-triazine is as a prominent scaffold in many drug candidates which has shown a wide range of pharmacological activities such as anti-diabetic, antifungal, anti-inflammatory, anticancer, anti-HIV, neuroprotective, anticonvulsant, anti- Alzheimer, anti-Parkinson and antioxidant. In this review, we have discussed synthesis, various pharmacological activities of 5, 6-diaryl/heteroaryl-3-substituted-1, 2, 4-triazines, their structure-activity relationship (SAR), pharmacophoric elements and their mechanism of action reported in the published articles during 2000-2019. Evaluation of compounds by PAINS filtering tool was accomplished and showed that this versatile structure could be considered as a privileged structure. Compilation of the biological data confirmed that the position 3 of the 1,2,4-triazine is a key location to determine the affinity and selectivity of the 5,6-diaryl/heteroaryl-3-substituted-1, 2, 4-triazines towards different biologic targets. Specific geometrical and thermodynamic characters of this motif have prompted it as a frequent hitter.


Sign in / Sign up

Export Citation Format

Share Document