Benchmarking Deep Trackers on Aerial Videos

Abu Md Niamul Taufique; Breton Minnehan; Andreas Savakis

doi:10.3390/s20020547

Benchmarking Deep Trackers on Aerial Videos

Sensors ◽

10.3390/s20020547 ◽

2020 ◽

Vol 20 (2) ◽

pp. 547

Author(s):

Abu Md Niamul Taufique ◽

Breton Minnehan ◽

Andreas Savakis

Keyword(s):

Deep Learning ◽

Ground Level ◽

Visual Object ◽

Camera Motion ◽

Correlation Filters ◽

Advantages And Disadvantages ◽

Learning Techniques ◽

Benchmark Datasets ◽

Siamese Networks ◽

Aerial Tracking

In recent years, deep learning-based visual object trackers have achieved state-of-the-art performance on several visual object tracking benchmarks. However, most tracking benchmarks are focused on ground level videos, whereas aerial tracking presents a new set of challenges. In this paper, we compare ten trackers based on deep learning techniques on four aerial datasets. We choose top performing trackers utilizing different approaches, specifically tracking by detection, discriminative correlation filters, Siamese networks and reinforcement learning. In our experiments, we use a subset of OTB2015 dataset with aerial style videos; the UAV123 dataset without synthetic sequences; the UAV20L dataset, which contains 20 long sequences; and DTB70 dataset as our benchmark datasets. We compare the advantages and disadvantages of different trackers in different tracking situations encountered in aerial data. Our findings indicate that the trackers perform significantly worse in aerial datasets compared to standard ground level videos. We attribute this effect to smaller target size, camera motion, significant camera rotation with respect to the target, out of view movement, and clutter in the form of occlusions or similar looking distractors near tracked object.

Download Full-text

Epileptic Seizure Prediction Using Deep Transformer Model

International Journal of Neural Systems ◽

10.1142/s0129065721500581 ◽

2021 ◽

Author(s):

Abhijeet Bhattacharya ◽

Tanmay Baweja ◽

S. P. K. Karri

Keyword(s):

Signal Processing ◽

Deep Learning ◽

False Positive Rate ◽

Superior Performance ◽

Seizure Prediction ◽

Advantages And Disadvantages ◽

Positive Rate ◽

Benchmark Datasets ◽

Automated Screening ◽

Transformer Model

The electroencephalogram (EEG) is the most promising and efficient technique to study epilepsy and record all the electrical activity going in our brain. Automated screening of epilepsy through data-driven algorithms reduces the manual workload of doctors to diagnose epilepsy. New algorithms are biased either towards signal processing or deep learning, which holds subjective advantages and disadvantages. The proposed pipeline is an end-to-end automated seizure prediction framework with a Fourier transform feature extraction and deep learning-based transformer model, a blend of signal processing and deep learning — this imbibes the potential features to automatically identify the attentive regions in EEG signals for effective screening. The proposed pipeline has demonstrated superior performance on the benchmark dataset with average sensitivity and false-positive rate per hour (FPR/h) as 98.46%, 94.83% and 0.12439, 0, respectively. The proposed work shows great results on the benchmark datasets and a big potential for clinics as a support system with medical experts monitoring the patients.

Download Full-text

Visual Object Multimodality Tracking Based on Correlation Filters for Edge Computing

Security and Communication Networks ◽

10.1155/2020/8891035 ◽

2020 ◽

Vol 2020 ◽

pp. 1-13

Author(s):

Guosheng Yang ◽

Qisheng Wei

Keyword(s):

Neural Network ◽

Deep Learning ◽

Target Position ◽

Correlation Filter ◽

Estimation Accuracy ◽

Visual Object ◽

Correlation Filters ◽

Data Set ◽

Hierarchical Processing ◽

Target Rotation

In recent years, visual object tracking has become a very active research field which is mainly divided into the correlation filter-based tracking and deep learning (e.g., deep convolutional neural network and Siamese neural network) based tracking. For target tracking algorithms based on deep learning, a large amount of computation is required, usually deployed on expensive graphics cards. However, for the rich monitoring devices in the Internet of Things, it is difficult to capture all the moving targets in each device in real time, so it is necessary to perform hierarchical processing and use tracking based on correlation filtering in insensitive areas to alleviate the local computing pressure. In sensitive areas, upload the video stream to a cloud computing platform with a faster computing speed to perform an algorithm based on deep features. In this paper, we mainly focus on the correlation filter-based tracking. In the correlation filter-based tracking, the discriminative scale space tracker (DSST) is one of the most popular and typical ones which is successfully applied to many application fields. However, there are still some improvements that need to be further studied for DSST. One is that the algorithms do not consider the target rotation on purpose. The other is that it is a very heavy computational load to extract the histogram of oriented gradient (HOG) features from too many patches centered at the target position in order to ensure the scale estimation accuracy. To address these two problems, we introduce the alterable patch number for target scale tracking and the space searching for target rotation tracking into the standard DSST tracking method and propose a visual object multimodality tracker based on correlation filters (MTCF) to simultaneously cope with translation, scale, and rotation in plane for the tracked target and to obtain the target information of position, scale, and attitude angle at the same time. Finally, in Visual Tracker Benchmark data set, the experiments are performed on the proposed algorithms to show their effectiveness in multimodality tracking.

Download Full-text

Predictive Analysis of Cryptocurrency Price Using Deep Learning

International Journal of Engineering & Technology ◽

10.14419/ijet.v7i3.27.17889 ◽

2018 ◽

Vol 7 (3.27) ◽

pp. 258 ◽

Cited By ~ 4

Author(s):

Yecheng Yao ◽

Jungho Yi ◽

Shengjun Zhai ◽

Yuwen Lin ◽

Taekseung Kim ◽

...

Keyword(s):

Deep Learning ◽

International Relations ◽

Short Term Memory ◽

Training Data ◽

Short Term ◽

Effective Learning ◽

Learning Techniques ◽

Benchmark Datasets ◽

Novel Method ◽

Long Short Term Memory

The decentralization of cryptocurrencies has greatly reduced the level of central control over them, impacting international relations and trade. Further, wide fluctuations in cryptocurrency price indicate an urgent need for an accurate way to forecast this price. This paper proposes a novel method to predict cryptocurrency price by considering various factors such as market cap, volume, circulating supply, and maximum supply based on deep learning techniques such as the recurrent neural network (RNN) and the long short-term memory (LSTM),which are effective learning models for training data, with the LSTM being better at recognizing longer-term associations. The proposed approach is implemented in Python and validated for benchmark datasets. The results verify the applicability of the proposed approach for the accurate prediction of cryptocurrency price.

Download Full-text

Network Attacks Detection Methods Based on Deep Learning Techniques: A Survey

Security and Communication Networks ◽

10.1155/2020/8872923 ◽

2020 ◽

Vol 2020 ◽

pp. 1-17

Author(s):

Yirui Wu ◽

Dabao Wei ◽

Jun Feng

Keyword(s):

Neural Network ◽

Deep Learning ◽

Attack Detection ◽

Detection Methods ◽

Generative Adversarial Network ◽

Network Attacks ◽

Adversarial Network ◽

Fifth Generation ◽

Learning Techniques ◽

Benchmark Datasets

With the development of the fifth-generation networks and artificial intelligence technologies, new threats and challenges have emerged to wireless communication system, especially in cybersecurity. In this paper, we offer a review on attack detection methods involving strength of deep learning techniques. Specifically, we firstly summarize fundamental problems of network security and attack detection and introduce several successful related applications using deep learning structure. On the basis of categorization on deep learning methods, we pay special attention to attack detection methods built on different kinds of architectures, such as autoencoders, generative adversarial network, recurrent neural network, and convolutional neural network. Afterwards, we present some benchmark datasets with descriptions and compare the performance of representing approaches to show the current working state of attack detection methods with deep learning structures. Finally, we summarize this paper and discuss some ways to improve the performance of attack detection under thoughts of utilizing deep learning structures.

Download Full-text

Heliport Detection Using Artificial Neural Networks

Photogrammetric Engineering & Remote Sensing ◽

10.14358/pers.86.9.541 ◽

2020 ◽

Vol 86 (9) ◽

pp. 541-546

Author(s):

Emre Başeski

Keyword(s):

Neural Networks ◽

Artificial Neural Networks ◽

Deep Learning ◽

Learning Approaches ◽

Remote Sensing Images ◽

Advantages And Disadvantages ◽

Learning Techniques ◽

Critical Technology ◽

Artificial Neural ◽

Military Facilities

Automatic image exploitation is a critical technology for quick content analysis of high-resolution remote sensing images. The presence of a heliport on an image usually implies an important facility, such as military facilities. Therefore, detection of heliports can reveal critical information about the content of an image. In this article, two learning-based algorithms are presented that make use of artificial neural networks to detect H-shaped, light-colored heliports. The first algorithm is based on shape analysis of the heliport candidate segments using classical artificial neural networks. The second algorithm uses deep-learning techniques. While deep learning can solve difficult problems successfully, classical-learning approaches can be tuned easily to obtain fast and reasonable results. Therefore, although the main objective of this article is heliport detection, it also compares a deep-learning based approach with a classical learning-based approach and discusses advantages and disadvantages of both techniques.

Download Full-text

Predicting epitopes Based on TCR sequence using an embedding deep neural network artificial intelligence approach

10.1101/2021.08.11.455918 ◽

2021 ◽

Author(s):

Michel Edwar Mickael ◽

Norwin Kubick

Keyword(s):

Neural Networks ◽

T Cells ◽

Deep Learning ◽

Cell Receptor ◽

Large Momentum ◽

Main Function ◽

Learning Approaches ◽

Advantages And Disadvantages ◽

Learning Techniques ◽

Recursive Neural Networks

AI has gained a large momentum in the field of T cell receptor (TCR) immunology. TCR is a complex that is expressed on CD4+ T cells and CD8+ T cells. Its main function is to it recognize antigens presented to T cells either through MHCI or MHCII. However, there are various knowledge gaps about classifying antigen affinity to MHC, epitope interactions with TCRs, and antigens immunogenicity. Deep learning is a type of machine learning that uses various layers of neural networks to increase prediction accuracy. There are different types of deep learning approaches, including autoencoders and recursive neural networks. There has been an exponential growth of using these two deep learning techniques in investigating TCR function. In this review, we discuss the main aspects of using these networks in elucidating TCR function. We also compare various platforms that are capable of performing deep learning studies. Taken together, our review sheds lighter on AI's ability to expand our knowledge of TCR interactions. It highlights types, implementation techniques, and various advantages and disadvantages of using these techniques.

Download Full-text

Benchmarking Deep Learning for On-Board Space Applications

Remote Sensing ◽

10.3390/rs13193981 ◽

2021 ◽

Vol 13 (19) ◽

pp. 3981

Author(s):

Maciej Ziaja ◽

Piotr Bosowski ◽

Michal Myller ◽

Grzegorz Gajoch ◽

Michal Gumiela ◽

...

Keyword(s):

Deep Learning ◽

Experimental Validation ◽

State Of The Art ◽

Learning Algorithms ◽

Real Life ◽

Space Applications ◽

Deep Model ◽

Learning Techniques ◽

Standard Tool ◽

Benchmark Datasets

Benchmarking deep learning algorithms before deploying them in hardware-constrained execution environments, such as imaging satellites, is pivotal in real-life applications. Although a thorough and consistent benchmarking procedure can allow us to estimate the expected operational abilities of the underlying deep model, this topic remains under-researched. This paper tackles this issue and presents an end-to-end benchmarking approach for quantifying the abilities of deep learning algorithms in virtually any kind of on-board space applications. The experimental validation, performed over several state-of-the-art deep models and benchmark datasets, showed that different deep learning techniques may be effectively benchmarked using the standardized approach, which delivers quantifiable performance measures and is highly configurable. We believe that such benchmarking is crucial in delivering ready-to-use on-board artificial intelligence in emerging space applications and should become a standard tool in the deployment chain.

Download Full-text

Object Recognition Using Deep Learning

Journal of Computational and Theoretical Nanoscience ◽

10.1166/jctn.2019.8291 ◽

2019 ◽

Vol 16 (9) ◽

pp. 4044-4052 ◽

Cited By ~ 1

Author(s):

Rohini Goel ◽

Avinash Sharma ◽

Rajiv Kapoor

Keyword(s):

Neural Network ◽

Performance Evaluation ◽

Deep Learning ◽

Object Recognition ◽

Deep Neural Network ◽

Learning Approaches ◽

Learning Techniques ◽

Benchmark Datasets ◽

Learning Frameworks

The deep learning approaches have drawn much focus of the researchers in the area of object recognition because of their implicit strength of conquering the shortcomings of classical approaches dependent on hand crafted features. In the last few years, the deep learning techniques have been made many developments in object recognition. This paper indicates some recent and efficient deep learning frameworks for object recognition. The up to date study on recently developed a deep neural network based object recognition methods is presented. The various benchmark datasets that are used for performance evaluation are also discussed. The applications of the object recognition approach for specific types of objects (like faces, buildings, plants etc.) are also highlighted. We conclude up with the merits and demerits of existing methods and future scope in this area.

Download Full-text

Investigations of Object Detection in Images/Videos Using Various Deep Learning Techniques and Embedded Platforms—A Comprehensive Review

Applied Sciences ◽

10.3390/app10093280 ◽

2020 ◽

Vol 10 (9) ◽

pp. 3280 ◽

Cited By ~ 3

Author(s):

Chinthakindi Balaram Murthy ◽

Mohammad Farukh Hashmi ◽

Neeraj Dhanraj Bokde ◽

Zong Woo Geem

Keyword(s):

Deep Learning ◽

Object Detection ◽

Pedestrian Detection ◽

Detection Methods ◽

Art Object ◽

Current State ◽

Learning Techniques ◽

Specific Object ◽

Benchmark Datasets ◽

Speed Up

In recent years there has been remarkable progress in one computer vision application area: object detection. One of the most challenging and fundamental problems in object detection is locating a specific object from the multiple objects present in a scene. Earlier traditional detection methods were used for detecting the objects with the introduction of convolutional neural networks. From 2012 onward, deep learning-based techniques were used for feature extraction, and that led to remarkable breakthroughs in this area. This paper shows a detailed survey on recent advancements and achievements in object detection using various deep learning techniques. Several topics have been included, such as Viola–Jones (VJ), histogram of oriented gradient (HOG), one-shot and two-shot detectors, benchmark datasets, evaluation metrics, speed-up techniques, and current state-of-art object detectors. Detailed discussions on some important applications in object detection areas, including pedestrian detection, crowd detection, and real-time object detection on Gpu-based embedded systems have been presented. At last, we conclude by identifying promising future directions.

Download Full-text

A Robust Visual Tracking Algorithm Based on Spatial-Temporal Context Hierarchical Response Fusion

Algorithms ◽

10.3390/a12010008 ◽

2018 ◽

Vol 12 (1) ◽

pp. 8 ◽

Cited By ~ 2

Author(s):

Wancheng Zhang ◽

Yanmin Luo ◽

Zhi Chen ◽

Yongzhao Du ◽

Daxin Zhu ◽

...

Keyword(s):

Visual Tracking ◽

Correlation Filter ◽

Temporal Context ◽

Visual Object ◽

Correlation Filters ◽

Visual Object Tracking ◽

Illumination Changes ◽

Model Update ◽

Benchmark Datasets ◽

Hierarchical Features

Discriminative correlation filters (DCFs) have been shown to perform superiorly in visual object tracking. However, visual tracking is still challenging when the target objects undergo complex scenarios such as occlusion, deformation, scale changes and illumination changes. In this paper, we utilize the hierarchical features of convolutional neural networks (CNNs) and learn a spatial-temporal context correlation filter on convolutional layers. Then, the translation is estimated by fusing the response score of the filters on the three convolutional layers. In terms of scale estimation, we learn a discriminative correlation filter to estimate scale from the best confidence results. Furthermore, we proposed a re-detection activation discrimination method to improve the robustness of visual tracking in the case of tracking failure and an adaptive model update method to reduce tracking drift caused by noisy updates. We evaluate the proposed tracker with DCFs and deep features on OTB benchmark datasets. The tracking results demonstrated that the proposed algorithm is superior to several state-of-the-art DCF methods in terms of accuracy and robustness.

Download Full-text