Rough Sets and Probability Masses for Dempster-Shafer Data Fusion at a Traffic Management Center

Ping Yi; Huapu Lu; Yucheng Zhang

doi:10.3141/1836-19

Rough Sets and Probability Masses for Dempster-Shafer Data Fusion at a Traffic Management Center

Transportation Research Record Journal of the Transportation Research Board ◽

10.3141/1836-19 ◽

2003 ◽

Vol 1836 (1) ◽

pp. 151-156 ◽

Cited By ~ 4

Author(s):

Ping Yi ◽

Huapu Lu ◽

Yucheng Zhang

Keyword(s):

Data Fusion ◽

Rough Sets ◽

Traffic Management ◽

Large Scale ◽

Field Testing ◽

Traffic Information ◽

Traffic Operations ◽

Data Mining Technique ◽

Data Source ◽

Single Data

The Dempster–Shafer data-fusion technique as affected by probability masses as a result of sensor selection and probability masses distribution is investigated. Dempster–Shafer inference is a statistically based data-classification technique for detecting traffic events that affect normal traffic operations. It is used when data sources contribute discontinuous and incomplete information such that no single data source can produce a predominantly high probability of certainty for identifying the most probable event. To help in the selection of appropriate sensors and probability masses, a rough-sets data-mining technique in support of Dempster–Shafer inference was proposed and tested. The basic rough-sets technique is introduced, and a numerical example is given to explain its applications. Field testing of the rough-sets technique showed that it can reasonably and systematically process a large amount of traffic information, as an alternative to relying on the intuition of traffic operators and system managers. Because it allows easy maintenance and update of estimated probability masses, this technique is suitable for large-scale applications at the traffic management center.

Download Full-text

Decision Support System for Advanced Traffic Management Through Data Fusion

Transportation Research Record Journal of the Transportation Research Board ◽

10.3141/1804-23 ◽

2002 ◽

Vol 1804 (1) ◽

pp. 173-178 ◽

Cited By ~ 14

Author(s):

Lawrence A. Klein ◽

Ping Yi ◽

Hualiang Teng

Keyword(s):

Data Fusion ◽

Traffic Management ◽

Information Sources ◽

Partial Information ◽

Weather Conditions ◽

Field Testing ◽

Traffic Operations ◽

Dempster Shafer Theory ◽

Classification Technique ◽

Shafer Theory

The Dempster–Shafer theory for data fusion and mining in support of advanced traffic management is introduced and tested. Dempste–Shafer inference is a statistically based classification technique that can be applied to detect traffic events that affect normal traffic operations. It is useful when data or information sources contribute partial information about a scenario, and no single source provides a high probability of identifying the event responsible for the received information. The technique captures and combines whatever information is available from the data sources. Dempster’s rule is applied to determine the most probable event—as that with the largest probability based on the information obtained from all contributing sources. The Dempster–Shafer theory is explained and its implementation described through numerical examples. Field testing of the data fusion technique demonstrated its effectiveness when the probability masses, which quantify the likelihood of the postulated events for the scenario, reflect current traffic and weather conditions.

Download Full-text

CITYWIDE ESTIMATION OF PARKING SPACE USING AERIAL IMAGERY AND OSM DATA FUSION WITH DEEP LEARNING AND FINE-GRAINED ANNOTATION

ISPRS - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences ◽

10.5194/isprs-archives-xliii-b2-2021-479-2021 ◽

2021 ◽

Vol XLIII-B2-2021 ◽

pp. 479-485

Author(s):

C. Henry ◽

J. Hellekes ◽

N. Merkle ◽

S. M. Azimi ◽

F. Kurz

Keyword(s):

Data Fusion ◽

Urban Areas ◽

Traffic Management ◽

Large Scale ◽

Private Property ◽

Contextual Information ◽

Aerial Imagery ◽

Parking Space ◽

Fine Grained ◽

Actual Use

Abstract. Emerging traffic management technologies, smart parking applications, together with transport researchers and urban planners are interested in fine-grained data on parking space in cities. However, there are no standardized, complete and up-to-date databases for many urban areas. Moreover, manual data collection is expensive and time-consuming. Aerial imagery of entire cities can be used to inventory not only publicly accessible and dedicated parking lots, but also roadside parking areas and those on private property. For a realistic estimation of the total parking space, the observed use of multi-functional traffic areas is taken into account by segmenting not only parking areas but also roads according to their purpose. In this paper, different U-Net based architectures are tested for detecting all these types of visible traffic areas. A new large-scale, high-quality dataset of manual annotations is used in combination with selected contextual information from OpenStreetMap (OSM) to depict the actual use as parking space. Our models achieve a good performance on parking area segmentation, and we show the significant impact of OSM data fusion in deep neural networks on the simultaneous extraction of multiple traffic areas compared to using aerial imagery alone.

Download Full-text

Measuring Traffic in Cities Through a Large-Scale Online Platform

Journal of Big Data Analytics in Transportation ◽

10.1007/s42421-019-00007-7 ◽

2019 ◽

Vol 1 (2-3) ◽

pp. 161-173 ◽

Cited By ~ 3

Author(s):

Vilhelm Verendel ◽

Sonia Yeh

Keyword(s):

Real Time ◽

Large Scale ◽

Data Availability ◽

Traffic Information ◽

Traffic Data ◽

Online Platform ◽

Real Time Traffic ◽

Data Source ◽

Road Segments

Abstract Online real-time traffic data services could effectively deliver traffic information to people all over the world and provide large benefits to the society and research about cities. Yet, city-wide road network traffic data are often hard to come by on a large scale over a longer period of time. We collect, describe, and analyze traffic data for 45 cities from HERE, a major online real-time traffic information provider. We sampled the online platform for city traffic data every 5 min during 1 year, in total more than 5 million samples covering more than 300 thousand road segments. Our aim is to describe some of the practical issues surrounding the data that we experienced in working with this type of data source, as well as to explore the data patterns and see how this data source provides information to study traffic in cities. We focus on data availability to characterize how traffic information is available for different cities; it measures the share of road segments with real-time traffic information at a given time for a given city. We describe the patterns of real-time data availability, and evaluate methods to handle filling in missing speed data for road segments when real-time information was not available. We conduct a validation case study based on Swedish traffic sensor data and point out challenges for future validation. Our findings include (i) a case study of validating the HERE data against ground truth available for roads and lanes in a Swedish city, showing that real-time traffic data tends to follow dips in travel speed but miss instantaneous higher speed measured in some sensors, typically at times when there are fewer vehicles on the road; (ii) using time series clustering, we identify four clusters of cities with different types of measurement patterns; and (iii) a k-nearest neighbor-based method consistently outperforms other methods to fill in missing real-time traffic speeds. We illustrate how to work with this kind of traffic data source that is increasingly available to researchers, travellers, and city planners. Future work is needed to broaden the scope of validation, and to apply these methods to use online data for improving our knowledge of traffic in cities.

Download Full-text

Application of Dempster–Shafer Data Fusion Technique in Support of Decision Making with Big Data

Transportation Research Record Journal of the Transportation Research Board ◽

10.3141/2645-04 ◽

2017 ◽

Vol 2645 (1) ◽

pp. 32-37 ◽

Cited By ~ 2

Author(s):

Ping Yi ◽

Songling Zhang

Keyword(s):

Decision Making ◽

Big Data ◽

Data Fusion ◽

Incomplete Information ◽

Data Sources ◽

Fusion Technique ◽

Crowd Management ◽

Multiple Data ◽

Data Source ◽

Single Data

This paper introduces applications of the Dempster–Shafer (D-S) data fusion technique in transportation system decision making. D-S inference is a statistics-based data classification technique, and it can be used when data sources contribute discontinuous and incomplete information and no single data source can produce an overwhelmingly high probability of certainty for identifying the most probable event. The technique captures and combines the information contributed by the data sources by using Dempster’s rule to find the conjunction of the events and to determine the highest associated probability. The D-S theory is explained and its implementation described through numerical examples of a ride-hauling service and of crowd management at a subway station. Results from the applications have shown that the technique is very effective in dealing with incomplete information and multiple data sources in the era of big data.

Download Full-text

Based on Data Fusion Intelligent Traffic Information Analysis and Optimization

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.568-570.831 ◽

2014 ◽

Vol 568-570 ◽

pp. 831-834

Author(s):

Yuan Zhang Lu ◽

Bing Zhang

Keyword(s):

Data Fusion ◽

Intelligent Transportation Systems ◽

Traffic Management ◽

Scientific Basis ◽

Transportation Systems ◽

Traffic Information ◽

Information Analysis ◽

Existing Problems ◽

Attribute Data ◽

Intelligent Analysis

In this paper, we propose an analysis refine scheme based on data fusion towards some existing problems in data analysis of intelligent transportation systems .This method constructed the data into a plurality of time-series according to the characteristics of each attribute data. Providing an objective scientific basis for dynamic traffic management through intelligent analysis of traffic information based on the gray advantage analysis among data and system model of Intelligent Traffic Information decision support and auxiliary decision analysis.

Download Full-text

MULTI-SENSOR TRAFFIC DATA FUSION FOR CONGESTION DETECTION AND TRACKING

ISPRS - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences ◽

10.5194/isprs-archives-xliii-b1-2020-173-2020 ◽

2020 ◽

Vol XLIII-B1-2020 ◽

pp. 173-180

Author(s):

J. Gitahi ◽

M. Hahn ◽

M. Storz ◽

C. Bernhard ◽

M. Feldges ◽

...

Keyword(s):

Data Fusion ◽

Traffic Management ◽

Ground Truth ◽

Data Availability ◽

Sensor Data ◽

Traffic Information ◽

Multiple Sources ◽

Detection And Tracking ◽

Congestion Detection ◽

Spatial Coverage

Abstract. Traffic management applications including congestion detection and tracking rely on traffic from multiple sources to model the traffic conditions. The sources are either stationary sensors which include inductive loop detectors (ILD), radar stations and Bluetooth/WiFi/BLE sensors or Floating Car Data (FCD) from moving vehicles which transmit their locations and speeds. The different sources have their inherent strengths and weaknesses but when used together, they have the potential to provide traffic information with increased robustness. Multi-sensor data fusion has the potential to enhance the estimation of traffic state in real-time by reducing the uncertainty of individual sources, extending the temporal and spatial coverage and increasing the confidence of data inputs. In this study, we fuse data from different FCD providers to improve travel time and average segment speeds estimation. We use data from INRIX, HERE and TomTom FCD commercial services and fuse the speeds based on their confidence values and granularity on virtual sub-segments of 250 m. Speeds differences between each pair of datasets are evaluated by calculating the absolute mean and standard deviation of differences. The evaluation of systematic differences is also performed for peak periods depending on the day of the week. INRIX FCD speeds are compared with ground truth spot speeds where both datasets are measured at a 1-minute interval which show good agreement with an error rate of between 8–20%. Some issues that affect FCD accuracy which include data availability and reliability problems are identified and discussed.

Download Full-text

Quantification and Discrimination of in Vitro Regeneration Swertia nervosa at Different Growth Periods using the UPLC/UV Coupled with Chemometric Method

Journal of AOAC International ◽

10.5740/jaoacint.17-0488 ◽

2018 ◽

Vol 101 (5) ◽

pp. 1473-1481

Author(s):

Jie Li ◽

Ji Zhang ◽

Zhitian Zuo ◽

Hengyu Huang ◽

Yuanzhong Wang

Keyword(s):

Data Fusion ◽

High Performance ◽

Large Scale ◽

In Vitro Regeneration ◽

Hierarchical Cluster ◽

Chemical Information ◽

Growth Stages ◽

Chemometric Method ◽

Single Data

Abstract Background: Swertia nervosa (Wall. ex G. Don) C. B. Clarke, a promising traditional herbal medicine for the treatment of liver disorders, is endangered due to its extensive collection and unsustainable harvesting practices. Objective: The aim of this study is to discuss the diversity of metabolites (loganic acid, sweroside, swertiamarin, and gentiopicroside) at different growth stages and organs of Swertia nervosa using the ultra-high-performance LC (UPLC)/UV coupled with chemometric method. Methods: UPLC data, UV data, and data fusion were treated separately to find more useful information by partial least-squares discriminant analysis (PLS-DA). Hierarchical cluster analysis (HCA), an unsupervised method, was then employed for validating the results from PLS-DA. Results: Three strategies displayed different chemical information associated with the sample discrimination. UV information mainly contributed to the classification of different organs; UPLC information was prominently responsible for both organs and growth periods; the data fusion did not perform with apparent superiority compared with single data analysis, although it provided useful information to differentiate leaves that could not be recognized by UPLC. The quantification result showed that the content of swertiamarin was the highest compared with the other three metabolites, especially in leaves at the rooted stage (19.57 ± 5.34 mg/g). Therefore, we speculated that interactive transformations occurred among these four metabolites, facilitated by root formation. Conclusions: This work will contribute to exploitation of bioactive compounds of S. nervosa, as well as its large-scale propagation. Highlights: The roots formation may influence the distribution and accumulation of metabolites.

Download Full-text

A Large-Scale COVID-19 Twitter Chatter Dataset for Open Scientific Research—An International Collaboration

Epidemiologia ◽

10.3390/epidemiologia2030024 ◽

2021 ◽

Vol 2 (3) ◽

pp. 315-324

Author(s):

Juan M. Banda ◽

Ramya Tekumalla ◽

Guanyu Wang ◽

Jingyuan Yu ◽

Tuo Liu ◽

...

Keyword(s):

Large Scale ◽

Social Dynamics ◽

Additional Data ◽

Open Data ◽

Data Sources ◽

Research Projects ◽

Research Groups ◽

The World ◽

Data Source

As the COVID-19 pandemic continues to spread worldwide, an unprecedented amount of open data is being generated for medical, genetics, and epidemiological research. The unparalleled rate at which many research groups around the world are releasing data and publications on the ongoing pandemic is allowing other scientists to learn from local experiences and data generated on the front lines of the COVID-19 pandemic. However, there is a need to integrate additional data sources that map and measure the role of social dynamics of such a unique worldwide event in biomedical, biological, and epidemiological analyses. For this purpose, we present a large-scale curated dataset of over 1.12 billion tweets, growing daily, related to COVID-19 chatter generated from 1 January 2020 to 27 June 2021 at the time of writing. This data source provides a freely available additional data source for researchers worldwide to conduct a wide and diverse number of research projects, such as epidemiological analyses, emotional and mental responses to social distancing measures, the identification of sources of misinformation, stratified measurement of sentiment towards the pandemic in near real time, among many others.

Download Full-text

Event detection of different English data sources based on transfer learning

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-189798 ◽

2021 ◽

pp. 1-11

Author(s):

Yanan Huang ◽

Yuji Miao ◽

Zhenjing Da

Keyword(s):

Transfer Learning ◽

Event Detection ◽

Visual Analysis ◽

Learning Algorithm ◽

Data Sources ◽

Data Set ◽

Data Source ◽

Single Data Source ◽

The Difference ◽

Single Data

The methods of multi-modal English event detection under a single data source and isomorphic event detection of different English data sources based on transfer learning still need to be improved. In order to improve the efficiency of English and data source time detection, based on the transfer learning algorithm, this paper proposes multi-modal event detection under a single data source and isomorphic event detection based on transfer learning for different data sources. Moreover, by stacking multiple classification models, this paper makes each feature merge with each other, and conducts confrontation training through the difference between the two classifiers to further make the distribution of different source data similar. In addition, in order to verify the algorithm proposed in this paper, a multi-source English event detection data set is collected through a data collection method. Finally, this paper uses the data set to verify the method proposed in this paper and compare it with the current most mainstream transfer learning methods. Through experimental analysis, convergence analysis, visual analysis and parameter evaluation, the effectiveness of the algorithm proposed in this paper is demonstrated.

Download Full-text

The Research of Hierarchical Data Fusion Based on Three-tiers Wireless Sensor Networks for Urban Real Time Traffic Information Monitoring

2010 Fourth International Conference on Genetic and Evolutionary Computing ◽

10.1109/icgec.2010.203 ◽

2010 ◽

Author(s):

Huaping Yu

Keyword(s):

Wireless Sensor Networks ◽

Sensor Networks ◽

Data Fusion ◽

Real Time ◽

Wireless Sensor ◽

Traffic Information ◽

Hierarchical Data ◽

Real Time Traffic ◽

Information Monitoring

Download Full-text