multiple query Latest Research Papers

Topic Difficulty: Collection and Query Formulation Effects

ACM Transactions on Information Systems ◽

10.1145/3470563 ◽

2022 ◽

Vol 40 (1) ◽

pp. 1-36

Author(s):

J. Shane Culpepper ◽

Guglielmo Faggioli ◽

Nicola Ferro ◽

Oren Kurland

Keyword(s):

Effect Size ◽

Fundamental Problem ◽

Interaction Effects ◽

Word Choice ◽

Information Need ◽

Retrieval Models ◽

Multiple User ◽

Ranking Algorithms ◽

Ranking Models ◽

Multiple Query

Several recent studies have explored the interaction effects between topics, systems, corpora, and components when measuring retrieval effectiveness. However, all of these previous studies assume that a topic or information need is represented by a single query. In reality, users routinely reformulate queries to satisfy an information need. In recent years, there has been renewed interest in the notion of “query variations” which are essentially multiple user formulations for an information need. Like many retrieval models, some queries are highly effective while others are not. This is often an artifact of the collection being searched which might be more or less sensitive to word choice. Users rarely have perfect knowledge about the underlying collection, and so finding queries that work is often a trial-and-error process. In this work, we explore the fundamental problem of system interaction effects between collections, ranking models, and queries. To answer this important question, we formalize the analysis using ANalysis Of VAriance (ANOVA) models to measure multiple components effects across collections and topics by nesting multiple query variations within each topic. Our findings show that query formulations have a comparable effect size of the topic factor itself, which is known to be the factor with the greatest effect size in prior ANOVA studies. Both topic and formulation have a substantially larger effect size than any other factor, including the ranking algorithms and, surprisingly, even query expansion. This finding reinforces the importance of further research in understanding the role of query rewriting in IR related tasks.

Download Full-text

Efficiently computing Pareto optimal G-skyline query in wireless sensor network

International Journal of Distributed Sensor Networks ◽

10.1177/15501477211060673 ◽

2021 ◽

Vol 17 (12) ◽

pp. 155014772110606

Author(s):

Leigang Dong ◽

Guohua Liu ◽

Xiaowei Cui ◽

Quan Yu

Keyword(s):

Decision Making ◽

Wireless Sensor Network ◽

Sensor Network ◽

Synthetic Data ◽

Sensor Data ◽

Wireless Sensor ◽

Pareto Optimal ◽

Skyline Query ◽

Data Set ◽

Multiple Query

There are much data transmitted from sensors in wireless sensor network. How to mine vital information from these large amount of data is very important for decision-making. Aiming at mining more interesting information for users, the skyline technology has attracted more attention due to its widespread use for multi-criteria decision-making. The point which is not dominated by any other points can be called skyline point. The skyline consists of all these points which are candidates for users. However, traditional skyline which consists of individual points is not suitable for combinations. To address this gap, we focus on the group skyline query and propose efficient algorithm to computing the Pareto optimal group-based skyline (G-skyline). We propose multiple query windows to compute key skyline layers, then optimize the method to compute directed skyline graph, finally introduce primary points definition and propose a fast algorithm based on it to compute G-skyline groups directly and efficiently. The experiments on the real-world sensor data set and the synthetic data set show that our algorithm performs more efficiently than the existing algorithms.

Download Full-text

Correction: Reducing the Cost of Ensemble-Based Data Assimilation in Multiple-Query Scenarios through Covariance Augmentation

10.2514/6.2021-3632.c1 ◽

2021 ◽

Author(s):

Andrew F. Ilersich ◽

Kyle Schau ◽

Joseph C. Oefelein ◽

Adam M. Steinberg ◽

Masayuki Yano

Keyword(s):

Data Assimilation ◽

The Cost ◽

Multiple Query

Download Full-text

A Data Assimilation Framework for Reacting Flow in Multiple-Query Scenarios

AIAA Propulsion and Energy 2021 Forum ◽

10.2514/6.2021-3632 ◽

2021 ◽

Author(s):

Andrew F. Ilersich ◽

Kyle Schau ◽

Joseph C. Oefelein ◽

Adam M. Steinberg ◽

Masayuki Yano

Keyword(s):

Data Assimilation ◽

Reacting Flow ◽

Multiple Query

Download Full-text

Is Query Performance Prediction With Multiple Query Variations Harder Than Topic Performance Prediction?

Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval ◽

10.1145/3404835.3463039 ◽

2021 ◽

Author(s):

Oleg Zendel ◽

J. Shane Culpepper ◽

Falk Scholer

Keyword(s):

Performance Prediction ◽

Query Performance ◽

Multiple Query

Download Full-text

SWARM: Adaptive Load Balancing in Distributed Streaming Systems for Big Spatial Data

ACM Transactions on Spatial Algorithms and Systems ◽

10.1145/3460013 ◽

2021 ◽

Vol 7 (3) ◽

pp. 1-43

Author(s):

Anas Daghistani ◽

Walid G. Aref ◽

Arif Ghafoor ◽

Ahmed R. Mahmood

Keyword(s):

Real Time ◽

Spatial Data ◽

Location Based Services ◽

Data Streaming ◽

Distributed Processes ◽

Data Persistence ◽

History Of ◽

Synthetic Datasets ◽

Multiple Query ◽

Streaming Systems

The proliferation of GPS-enabled devices has led to the development of numerous location-based services. These services need to process massive amounts of streamed spatial data in real-time. The current scale of spatial data cannot be handled using centralized systems. This has led to the development of distributed spatial streaming systems. Existing systems are using static spatial partitioning to distribute the workload. In contrast, the real-time streamed spatial data follows non-uniform spatial distributions that are continuously changing over time. Distributed spatial streaming systems need to react to the changes in the distribution of spatial data and queries. This article introduces SWARM, a lightweight adaptivity protocol that continuously monitors the data and query workloads across the distributed processes of the spatial data streaming system and redistributes and rebalances the workloads as soon as performance bottlenecks get detected. SWARM is able to handle multiple query-execution and data-persistence models. A distributed streaming system can directly use SWARM to adaptively rebalance the system’s workload among its machines with minimal changes to the original code of the underlying spatial application. Extensive experimental evaluation using real and synthetic datasets illustrate that, on average, SWARM achieves 2 improvement in throughput over a static grid partitioning that is determined based on observing a limited history of the data and query workloads. Moreover, SWARM reduces execution latency on average 4 compared with the other technique.

Download Full-text

GraphZero

ACM SIGOPS Operating Systems Review ◽

10.1145/3469379.3469383 ◽

2021 ◽

Vol 55 (1) ◽

pp. 21-37

Author(s):

Daniel Mawhirter ◽

Sam Reinehr ◽

Connor Holmes ◽

Tongping Liu ◽

Bo Wu

Keyword(s):

Group Theory ◽

Symmetry Breaking ◽

High Performance ◽

State Of The Art ◽

Input Graph ◽

Query Pattern ◽

Subgraph Matching ◽

Compilation Process ◽

Multiple Query ◽

Query Patterns

Subgraph matching is a fundamental task in many applications which identifies all the embeddings of a query pattern in an input graph. Compilation-based subgraph matching systems generate specialized implementations for the provided patterns and often substantially outperform other systems. However, the generated code causes significant computation redundancy and the compilation process incurs too much overhead to be used online, both due to the inherent symmetry in the structure of the query pattern. In this paper, we propose an optimizing query compiler, named GraphZero, to completely address these limitations through symmetry breaking based on group theory. GraphZero implements three novel techniques. First, its schedule explorer efficiently prunes the schedule space without missing any high-performance schedule. Second, it automatically generates and enforces a set of restrictions to eliminate computation redundancy. Third, it generalizes orientation, a surprisingly effective optimization that was only used for clique patterns, to apply to arbitrary patterns. Evaluation on multiple query patterns shows that GraphZero outperforms two state-of-the-art compilation and non-compilation based systems by up to 40X and 2654X, respectively.

Download Full-text

Exploring a Novel Multiple-Query Resistive Grid-Based Planning Method Applied to High-DOF Robotic Manipulators

Sensors ◽

10.3390/s21093274 ◽

2021 ◽

Vol 21 (9) ◽

pp. 3274

Author(s):

Jesus Huerta-Chua ◽

Gerardo Diaz-Arango ◽

Hector Vazquez-Leal ◽

Javier Flores-Mendez ◽

Mario Moreno-Moreno ◽

...

Keyword(s):

Configuration Space ◽

Sparse Matrix ◽

Sparse Matrices ◽

Robotic Manipulators ◽

Industrial Sector ◽

Industrial Applications ◽

Study Results ◽

Six Dof ◽

Multiple Query ◽

Grid Based

The applicability of the path planning strategy to robotic manipulators has been an exciting topic for researchers in the last few decades due to the large demand in the industrial sector and its enormous potential development for space, surgical, and pharmaceutical applications. The automation of high-degree-of-freedom (DOF) manipulator robots is a challenging task due to the high redundancy in the end-effector position. Additionally, in the presence of obstacles in the workspace, the task becomes even more complicated. Therefore, for decades, the most common method of integrating a manipulator in an industrial automated process has been the demonstration technique through human operator intervention. Although it is a simple strategy, some drawbacks must be considered: first, the path’s success, length, and execution time depend on operator experience; second, for a structured environment with few objects, the planning task is easy. However, for most typical industrial applications, the environments contain many obstacles, which poses challenges for planning a collision-free trajectory. In this paper, a multiple-query method capable of obtaining collision-free paths for high DOF manipulators with multiple surrounding obstacles is presented. The proposed method is inspired by the resistive grid-based planner method (RGBPM). Furthermore, several improvements are implemented to solve complex planning problems that cannot be handled by the original formulation. The most important features of the proposed planner are as follows: (1) the easy implementation of robotic manipulators with multiple degrees of freedom, (2) the ability to handle dozens of obstacles in the environment, (3) compatibility with various obstacle representations using mathematical models, (4) a new recycling of a previous simulation strategy to convert the RGBPM into a multiple-query planner, and (5) the capacity to handle large sparse matrices representing the configuration space. A numerical simulation was carried out to validate the proposed planning method’s effectiveness for manipulators with three, five, and six DOFs on environments with dozens of surrounding obstacles. The case study results show the applicability of the proposed novel strategy in quickly computing new collision-free paths using the first execution data. Each new query requires less than 0.2 s for a 3 DOF manipulator in a configuration space free-modeled by a 7291 × 7291 sparse matrix and less than 30 s for five and six DOF manipulators in a configuration space free-modeled by 313,958 × 313,958 and 204,087 × 204,087 sparse matrices, respectively. Finally, a simulation was conducted to validate the proposed multiple-query RGBPM planner’s efficacy in finding feasible paths without collision using a six-DOF manipulator (KUKA LBR iiwa 14R820) in a complex environment with dozens of surrounding obstacles.

Download Full-text

A search-engine for single-cell epigenome profiles for multi-purpose applications

10.1101/2021.03.24.436738 ◽

2021 ◽

Author(s):

Shreya Mishra ◽

Smriti Chawla ◽

Neetesh Pandey ◽

Debarka SenGupta ◽

Vibhor Kumar

Keyword(s):

Single Cell ◽

Large Data ◽

Open Chromatin ◽

Data Sets ◽

Large Pool ◽

Mouse Cells ◽

Low Dimensional ◽

Cell Expression ◽

Short Time ◽

Multiple Query

AbstractThe true benefits of large data-sets of single-cell epigenome and transcriptome profiles can be availed only when they are searchable to annotate individual unannotated cells. Matching a single-cell epigenome profile to a large pool of reference cells remains as a challenge and largely unexplored. Here, we introduce scEpiSearch, which enables a user to query single-cell open-chromatin read-count matrices for comparison against a large pool of single-cell expression and open-chromatin profiles from human and mouse cells (∼ 3.5 million cells). Besides providing accurate search in a short time and scalable visualization of results for multiple query cells, scEpisearch also provides a low-dimensional representation of single-cell open-chromatin profiles. It outperformed many other methods in terms of correct low-dimensional embedding of single-cell open-chromatin profiles originating from different platforms and species. Here we show how scEpiSearch is unique in providing several facilities to assist researchers in the analysis of single-cell open-chromatin profiles to infer cellular state, lineage, potency and representative genes.

Download Full-text

An Approach of Automatic SPARQL Generation for BIM Data Extraction

Applied Sciences ◽

10.3390/app10248794 ◽

2020 ◽

Vol 10 (24) ◽

pp. 8794

Author(s):

Dongming Guo ◽

Erling Onstein ◽

Angela Daniela La Rosa

Keyword(s):

Large Scale ◽

Query Language ◽

Data Extraction ◽

Building Design ◽

Query Languages ◽

Design Data ◽

Simple Method ◽

Building Information ◽

Essential Knowledge ◽

Multiple Query

Generally, building information modelling (BIM) models contain multiple dimensions of building information, including building design data, construction information, and maintenance-related contents, which are related with different engineering stakeholders. Efficient extraction of BIM data is a necessary and vital step for various data analyses and applications, especially in large-scale BIM projects. In order to extract BIM data, multiple query languages have been developed. However, the use of these query languages for data extraction usually requires that engineers have good programming skills, flexibly master query language(s), and fully understand the Industry Foundation Classes (IFC) express schema or the ontology expression of the IFC schema (ifcOWL). These limitations have virtually increased the difficulties of using query language(s) and raised the requirements on engineers’ essential knowledge reserves in data extraction. In this paper, we develop a simple method for automatic SPARQL (SPARQL Protocol and RDF Query Language) query generation to implement effective data extraction. Based on the users’ data requirements, we match users’ requirements with ifcOWL ontology concepts or instances, search the connected relationships among query keywords based on semantic BIM data, and generate the user-desired SPARQL query. We demonstrate through several case studies that our approach is effective and the generated SPARQL queries are accurate.

Download Full-text

multiple query
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Topic Difficulty: Collection and Query Formulation Effects

Efficiently computing Pareto optimal G-skyline query in wireless sensor network

Correction: Reducing the Cost of Ensemble-Based Data Assimilation in Multiple-Query Scenarios through Covariance Augmentation

A Data Assimilation Framework for Reacting Flow in Multiple-Query Scenarios

Is Query Performance Prediction With Multiple Query Variations Harder Than Topic Performance Prediction?

SWARM: Adaptive Load Balancing in Distributed Streaming Systems for Big Spatial Data

GraphZero

Exploring a Novel Multiple-Query Resistive Grid-Based Planning Method Applied to High-DOF Robotic Manipulators

A search-engine for single-cell epigenome profiles for multi-purpose applications

An Approach of Automatic SPARQL Generation for BIM Data Extraction

Export Citation Format

multiple queryRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Topic Difficulty: Collection and Query Formulation Effects

Efficiently computing Pareto optimal G-skyline query in wireless sensor network

Correction: Reducing the Cost of Ensemble-Based Data Assimilation in Multiple-Query Scenarios through Covariance Augmentation

A Data Assimilation Framework for Reacting Flow in Multiple-Query Scenarios

Is Query Performance Prediction With Multiple Query Variations Harder Than Topic Performance Prediction?

SWARM: Adaptive Load Balancing in Distributed Streaming Systems for Big Spatial Data

GraphZero

Exploring a Novel Multiple-Query Resistive Grid-Based Planning Method Applied to High-DOF Robotic Manipulators

A search-engine for single-cell epigenome profiles for multi-purpose applications

An Approach of Automatic SPARQL Generation for BIM Data Extraction

multiple query
Recently Published Documents