Algorithmic techniques for detection and classification of digital objects

Mapping Intimacies ◽

10.12681/eadd/46311 ◽

2019 ◽

Author(s):

Ιωσήφ Πολενάκης

Keyword(s):

Monte Carlo ◽

Digital Objects ◽

Algorithmic Techniques

Σε αυτή τη διδακτορική διατριβή διερευνήθηκαν οι αλγοριθμικές τεχνικές για την ανίχνευση και τη κατάταξη ψηφιακών αντικειμένων. Στον τομέα των ψηφιακών αντικειμένων, η παρούσα εργασία επικεντρώνεται κυρίως στη μελέτη του σχεδιασμού και της περαιτέρω ανάπτυξης αλγοριθμικών τεχνικών ανίχνευσης και κατάταξης μιας συγκεκριμένης κατηγορίας ψηφιακών αντικειμένων, αυτής του λογισμικού και πιο συγκεκριμένα του κακόβουλου λογισμικού, δημιουργώντας εν τέλει ένα ολοκληρωμένο αλγοριθμικό πλαίσιο για την προστασία ενάντια σε αυτό. Οι δημιουργοί των κακόβουλων λογισμικών, προκειμένου να αποφύγουν τις καθιερωμένες μεθόδους ανίχνευσης, έχουν αναπτύξει ευφυείς τεχνικές που εστιάζουν στη μετάλλαξη των παραγόμενων κακόβουλων λογισμικών, ενσωματώνοντας μηχανισμούς μετάλλαξης που στόχο έχουν να τροποποιήσουν ριζικά τη δομή των παραγόμενων δειγμάτων. Ως εκ τούτου, στον πρώτο άξονα, η έρευνα επικεντρώνεται στο σχεδιασμό και την πρόταση μιας αναπαράστασης μέσω γραφήματος της συμπεριφοράς του δείγματος κακόβουλου λογισμικού (συμπεριφοριστικό γράφημα) ανθεκτικής σε μεταλλάξεις, η οποία προκύπτει από τα Γραφήματα Εξάρτησης Κλήσεων Συναρτήσεων Συστήματος (κατευθυνόμενα άκυκλα γραφήματα), τα οποία κατασκευάζονται αντλώντας πληροφορία από την εκτέλεση δυναμικής ανάλυσης του εκτελεσθέντος κακόβουλου λογισμικού. Έτσι, σε πρώτο επίπεδο, προτείνουμε το Γράφημα Συσχετίσεων Ομάδων (κατευθυνόμενο έμβαρο γράφημα) το οποίο προκύπτει έπειτα από ομαδοποίηση των κόμβων του γραφήματος Εξάρτησης Κλήσεων Συναρτήσεων Συστήματος, αξιοποιώντας την ιδιότητα ότι οι κλήσεις συναρτήσεων συστήματος μπορούν να συγχωνευθούν σε ομάδες ανάλογα με την ομοειδή λειτουργικότητά τους. Επιπρόσθετα, επεκτείνουμε αυτή την προσέγγιση προτείνοντας το Γράφημα Κάλυψης, όπου διερευνούμε τις ``σχέσεις κυριαρχίας" μεταξύ των κόμβων του γραφήματος Συσχετίσεων Ομάδων, αναφορικά με το βάρος και το βαθμό αυτών. Επιπλέον, επεκτείνοντας τις δυνατότητες των παραπάνω γραφημάτων προτείνουμε επίσης τα Χρονικά Μεταβαλλόμενα Γραφήματα, τα οποία απεικονίζουν τη δομική εξέλιξη των προτεινόμενων γραφημάτων (δηλ. Γραφήματα Συσχετίσεων Ομάδων και Γραφήματα Κάλυψης) απεικονίζοντας την εξέλιξη στη δομή τους μέσω στιγμιότυπων αυτών, τα οποία καταγράφονται ανα συγκεκριμένες περιόδους. Μεταξύ άλλων, προτείνουμε ένα σύνολο μετρικών ομοιότητας, όπου αξιοποιούνται τα ποσοτικά, σχεσιακά και ποιοτικά χαρακτηριστικά των παραπάνω γραφημάτων αναφορικά με τη συμπεριφορά των κακόβουλων λογισμικών, αξιοποιώντας αυτές τις μετρικές για τη μετέπειτα αποτίμηση των δυνατοτήτων ανίχνευσης και κατάταξης των προταθέντων μοντέλων. Επιπλέον, δεδομένου ότι η χρήση των φορητών συσκευών παρουσιάζει ευρεία εξάπλωση, κατά την εκπόνηση της παρούσας διατριβής μελετήθηκε η ανάπτυξη αλγοριθμικών τεχνικών βασισμένων σε γραφήματα, οι οποίες θα ολοκληρώνουν το ευρύτερο αλγοριθμικό πλαίσιο προστασίας ενάντια στο κακόβουλο λογισμικό, με τη διερεύνηση στρατηγικών βασισμένων σε γραφήματα για την καταστολή και περαιτέρω αποφυγή εν δυνάμει πανδημικών φαινομένων που θα προκύψουν από την εξάπλωση του κακόβουλου λογισμικού. Πιο συγκεκριμένα, προτείνουμε μια σειρά τεχνικών για τη μοντελοποίηση της τοπολογίας του πολεοδομικού σχεδιασμού, των μοτίβων κίνησης των φορητών συσκευών καθώς επίσης και της συμπεριφοράς μετάδοσης (αναφορικά με το ακολουθούμενο επιδημιολογικό μοντέλο), συντονίζοντας τα μοντέλα αυτά στην πλαισίωση μιας αλγοριθμικής τεχνικής που θα καθορίζει τον μέγιστο επιτρεπόμενο χρονικό όριο που απαιτείται από ένα αντίμετρο προστασίας προκειμένου να απομακρυνθεί το κακόβουλο λογισμικό από μια μολυσμένη συσκευή (χρόνος απόκρισης), ώστε τελικά να αποφευχθεί η πανδημία. Τέλος, η ευρύτερη απόδοση της προτεινόμενης προσέγγισης παρουσιάζεται μέσα από μια σειρά επαναλαμβανόμενων πειραμάτων (Monte Carlo) ακολουθώντας διαφορετικά επιδημιολογικά μοντέλα, λαμβάνοντας παράλληλα υπόψη και ένα σύνολο παραγόντων που επηρεάζουν την εξάπλωση του κακόβουλου λογισμικού.

Download Full-text

Automatic classification of digital objects for improved metadata quality of electronic theses and dissertations in institutional repositories

International Journal of Metadata Semantics and Ontologies ◽

10.1504/ijmso.2020.112804 ◽

2020 ◽

Vol 14 (3) ◽

pp. 234

Author(s):

Lighton Phiri

Keyword(s):

Automatic Classification ◽

Institutional Repositories ◽

Metadata Quality ◽

Electronic Theses And Dissertations ◽

Digital Objects ◽

Theses And Dissertations

Download Full-text

Automatic classification of digital objects for improved metadata quality of electronic theses and dissertations in institutional repositories

International Journal of Metadata Semantics and Ontologies ◽

10.1504/ijmso.2020.10035206 ◽

2020 ◽

Vol 14 (3) ◽

pp. 234

Author(s):

Lighton Phiri

Keyword(s):

Automatic Classification ◽

Institutional Repositories ◽

Metadata Quality ◽

Electronic Theses And Dissertations ◽

Digital Objects ◽

Theses And Dissertations

Download Full-text

Monte Carlo Simulation of Long-Chain Branched Polyolefins Made with Dual Catalysts: A Classification of Chain Structures in Topological Branching Families

Industrial & Engineering Chemistry Research ◽

10.1021/ie049615l ◽

2005 ◽

Vol 44 (8) ◽

pp. 2461-2468 ◽

Cited By ~ 10

Author(s):

Leonardo C. Simon ◽

João B. P. Soares

Keyword(s):

Monte Carlo Simulation ◽

Monte Carlo ◽

Long Chain

Download Full-text

Classification of process dynamics with Monte Carlo singular spectrum analysis

Computers & Chemical Engineering ◽

10.1016/j.compchemeng.2005.12.005 ◽

2006 ◽

Vol 30 (5) ◽

pp. 816-831 ◽

Cited By ~ 21

Author(s):

Gorden T. Jemwa ◽

Chris Aldrich

Keyword(s):

Monte Carlo ◽

Spectrum Analysis ◽

Singular Spectrum Analysis ◽

Process Dynamics ◽

Singular Spectrum

Download Full-text

Action Thresholds for Summer Fungicide Sprays and Sequential Classification of Apple Scab Incidence

Plant Disease ◽

10.1094/pdis-93-5-0490 ◽

2009 ◽

Vol 93 (5) ◽

pp. 490-498 ◽

Cited By ~ 13

Author(s):

Odile Carisse ◽

Catherine Meloche ◽

Guy Boivin ◽

Tristan Jobin

Keyword(s):

Decision Making ◽

Monte Carlo ◽

Monte Carlo Simulations ◽

Sequential Sampling ◽

Apple Scab ◽

Parameter Estimates ◽

Ratio Test ◽

Sample Number ◽

Action Thresholds

The objectives of this study were to establish scab incidence thresholds at which summer fungicide sprays should be initiated and to develop sequential sampling for classification plans to facilitate decision making. The relationship between proportion of scabbed leaves per shoot and percent scabbed fruits at harvest was established based on data collected in both experimental and commercial orchards. Action threshold at which fungicide spray program should be initiated in order to maintain fruit scab at harvest below 2% was established at 0.006 scabbed leaves per shoot; hence incidence thresholds of 0.005 and 0.01 scabbed leaves per shoot were evaluated. Sequential sampling for classification procedures (SSCP), using the Wald's sequential probability ratio test (SPRT), were developed based on a beta-binomial distribution and using parameter estimates of the binary power law. Monte Carlo simulations were used to establish the probability of classifying mean scab incidence as less than the action thresholds (pth = 0.005 and pth = 0.01). Operating characteristic (OC) and average sample number (ASN) curves were established for eight combinations of stop lines and error levels (α and β). Based on the results of the Monte Carlo simulations, four sets of stop lines (two for each pth threshold) were selected for further evaluation. Bootstrap simulations of 50 data sets indicated that both the OC and ASN curves for each of the four pairs of stop lines were similar to OC and ASN values determined by Monte Carlo simulation. When validated with data not used to construct the models, the SSCPs provided the appropriate classification in 98 and 95% of the simulations for the pth = 0.005 and pth = 0.01 (α = 0.05 and β = 0.10), respectively. The sequential sampling plans developed in this study should allow for rapid and accurate classification of the incidence of apple scab on leaves, and aid in sampling for summer scab management decision making.

Download Full-text

Bayesian Classification of Proteomics Biomarkers from Selected Reaction Monitoring Data using an Approximate Bayesian Computation-Markov Chain Monte Carlo Approach

Cancer Informatics ◽

10.1177/1176935118786927 ◽

2018 ◽

Vol 17 ◽

pp. 117693511878692

Author(s):

Kashyap Nagaraja ◽

Ulisses Braga-Neto

Keyword(s):

Monte Carlo ◽

Markov Chain ◽

Markov Chain Monte Carlo ◽

Sample Size ◽

Approximate Bayesian Computation ◽

Selected Reaction Monitoring ◽

Bayesian Computation ◽

Reaction Monitoring ◽

Approximate Bayesian

Selected reaction monitoring (SRM) has become one of the main methods for low-mass-range–targeted proteomics by mass spectrometry (MS). However, in most SRM-MS biomarker validation studies, the sample size is very small, and in particular smaller than the number of proteins measured in the experiment. Moreover, the data can be noisy due to a low number of ions detected per peptide by the instrument. In this article, those issues are addressed by a model-based Bayesian method for classification of SRM-MS data. The methodology is likelihood-free, using approximate Bayesian computation implemented via a Markov chain Monte Carlo procedure and a kernel-based Optimal Bayesian Classifier. Extensive experimental results demonstrate that the proposed method outperforms classical methods such as linear discriminant analysis and 3NN, when sample size is small, dimensionality is large, the data are noisy, or a combination of these.

Download Full-text

Monte Carlo particle trajectory simulation for classification of C3+ cluster ion structure utilizing Coulomb explosion imaging

Radiation Physics and Chemistry ◽

10.1016/j.radphyschem.2008.05.049 ◽

2008 ◽

Vol 77 (10-12) ◽

pp. 1328-1332

Author(s):

M. Adachi ◽

Y. Saitoh ◽

A. Chiba ◽

K. Narumi ◽

K. Yamada ◽

...

Keyword(s):

Monte Carlo ◽

Particle Trajectory ◽

Coulomb Explosion ◽

Ion Structure ◽

Trajectory Simulation ◽

Cluster Ion

Download Full-text

The Influence of the Accuracy of Statistical Data on the Results of a Classification of Eu Countries in Terms of Innovation

Folia Oeconomica Stetinensia ◽

10.2478/foli-2020-0024 ◽

2020 ◽

Vol 20 (1) ◽

pp. 408-420

Author(s):

Małgorzata Stec

Keyword(s):

Monte Carlo ◽

Monte Carlo Method ◽

European Commission ◽

Research Methodology ◽

Statistical Data ◽

Eu Countries ◽

A Value ◽

The Monte Carlo Method ◽

Diagnostic Variables

AbstractResearch background: The article attempts to include the accuracy of statistical data in a synthetic evaluation and classification of EU countries in terms of innovation.Purpose: The aim of the article is to evaluate an influence of the accuracy of statistical data on a classification of EU countries in terms of innovation.Research methodology: The research employed diagnostic variables determining the innovation of EU countries and a methodology proposed by the European Commission in the European Innovation Scoreboard 2019. The influence of the uncertainty of the measurement of the diagnostic variables on the Summary Innovation Index of EU countries was evaluated. In order to do this, a procedure employing the Monte Carlo method was proposed.Results: Taking into account the uncertainty of the measurement of variables in the evaluation of the innovation of EU countries resulted in qualifying one of the countries to another innovation group.Novelty: The article draws attention to an important but often neglected problem related to the accuracy of statistical data used in research, and the evaluation of their influence on the calculation of a value of synthetic measure (based on the innovation of EU countries).

Download Full-text

Monte Carlo Convex Hull Model for classification of traditional Chinese paintings

Neurocomputing ◽

10.1016/j.neucom.2015.08.013 ◽

2016 ◽

Vol 171 ◽

pp. 788-797 ◽

Cited By ~ 11

Author(s):

Meijun Sun ◽

Dong Zhang ◽

Zheng Wang ◽

Jinchang Ren ◽

Jesse S. Jin

Keyword(s):

Monte Carlo ◽

Convex Hull

Download Full-text

A Classification of Digital Emergence: A Critical Approach to the Production of Digital Objects in Special Collections

Canadian Journal of Academic Librarianship ◽

10.33137/cjal-rcbu.v1.24305 ◽

2016 ◽

Vol 1 ◽

pp. 42-59

Author(s):

Robert D. Montoya

Keyword(s):

Digital Libraries ◽

Academic Library ◽

Research Process ◽

Lessons Learned ◽

Information Control ◽

Critical Approach ◽

Special Collections ◽

Case Study Site ◽

Digital Objects

This paper examines the infrastructure of digital libraries and teases out the subtle ways their formation and construction is a digital extension and representation of the social, political, and institutional circumstances by which they are created. Building off lessons learned from UCLA Library Special Collections as a case study site, this paper proposes a classification of digital emergence that provides more transparency about how digital surrogates come to exist in digital libraries and how we can use this information to better contextualize the importance of these surrogates within academic library services. The discussion then situates digital libraries as medial interfacing infrastructures that are fundamentally non-neutral social apparatuses that disappear in the course of daily use. Marcuse’s notion of technological rationality is incorporated to illustrate the extent to which technological infrastructures influence and reformulate the way we understand the research process using special collections and archives, and how these infrastructures can function as a mechanism for information control. Finally, Bowker and Star’s text, Sorting Things Out: Classification and Its Consequences, briefly illustrates how librarians can contextualize the emergence of digital objects, and how this context, and the concomitant technological biases, can be methodologically brought to light using infrastructural inversion.

Download Full-text