An Attention-Based Recommender System to Predict Contextual Intent Based on Choice Histories across and within Sessions

Ruo Huang; Shelby McIntyre; Meina Song; Haihong E; Zhonghong Ou

doi:10.3390/app8122426

An Attention-Based Recommender System to Predict Contextual Intent Based on Choice Histories across and within Sessions

Applied Sciences ◽

10.3390/app8122426 ◽

2018 ◽

Vol 8 (12) ◽

pp. 2426 ◽

Cited By ~ 2

Author(s):

Ruo Huang ◽

Shelby McIntyre ◽

Meina Song ◽

Haihong E ◽

Zhonghong Ou

Keyword(s):

Recommender Systems ◽

State Of The Art ◽

User Profile ◽

Vital Role ◽

Short Term ◽

Novel Approach ◽

Learning Techniques ◽

Real World Datasets ◽

Near Future

Recent years have witnessed the growth of recommender systems, with the help of deep learning techniques. Recurrent Neural Networks (RNNs) play an increasingly vital role in various session-based recommender systems, since they use the user’s sequential history to build a comprehensive user profile, which helps improve the recommendation. However, a problem arises regarding how to be aware of the variation in the user’s contextual preference, especially the short-term intent in the near future, and make the best use of it to produce a precise recommendation at the start of a session. We propose a novel approach named Attention-based Short-term and Long-term Model (ASLM), to improve the next-item recommendation, by using an attention-based RNNs integrating both the user’s short-term intent and the long-term preference at the same time with a two-layer network. The experimental study on three real-world datasets and two sub-datasets demonstrates that, compared with other state-of-the-art methods, the proposed approach can significantly improve the next-item recommendation, especially at the start of sessions. As a result, our proposed approach is capable of coping with the cold-start problem at the beginning of each session.

Download Full-text

A Novel LSTM Model with Interaction Dual Attention for Radar Echo Extrapolation

Remote Sensing ◽

10.3390/rs13020164 ◽

2021 ◽

Vol 13 (2) ◽

pp. 164

Author(s):

Chuyao Luo ◽

Xutao Li ◽

Yongliang Wen ◽

Yunming Ye ◽

Xiaofeng Zhang

Keyword(s):

Short Term Memory ◽

Weather Forecast ◽

Vital Role ◽

Data Sets ◽

Short Term ◽

Learning Techniques ◽

Radar Echo ◽

Hidden States ◽

Better Than

The task of precipitation nowcasting is significant in the operational weather forecast. The radar echo map extrapolation plays a vital role in this task. Recently, deep learning techniques such as Convolutional Recurrent Neural Network (ConvRNN) models have been designed to solve the task. These models, albeit performing much better than conventional optical flow based approaches, suffer from a common problem of underestimating the high echo value parts. The drawback is fatal to precipitation nowcasting, as the parts often lead to heavy rains that may cause natural disasters. In this paper, we propose a novel interaction dual attention long short-term memory (IDA-LSTM) model to address the drawback. In the method, an interaction framework is developed for the ConvRNN unit to fully exploit the short-term context information by constructing a serial of coupled convolutions on the input and hidden states. Moreover, a dual attention mechanism on channels and positions is developed to recall the forgotten information in the long term. Comprehensive experiments have been conducted on CIKM AnalytiCup 2017 data sets, and the results show the effectiveness of the IDA-LSTM in addressing the underestimation drawback. The extrapolation performance of IDA-LSTM is superior to that of the state-of-the-art methods.

Download Full-text

Where to Go Next: Modeling Long- and Short-Term User Preferences for Point-of-Interest Recommendation

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i01.5353 ◽

2020 ◽

Vol 34 (01) ◽

pp. 214-221 ◽

Cited By ~ 3

Author(s):

Ke Sun ◽

Tieyun Qian ◽

Tong Chen ◽

Yile Liang ◽

Quoc Viet Hung Nguyen ◽

...

Keyword(s):

State Of The Art ◽

User Preferences ◽

Short Term ◽

Preference Modeling ◽

Point Of Interest ◽

Proposed Model ◽

Poi Recommendation ◽

Novel Method ◽

Real World Datasets

Point-of-Interest (POI) recommendation has been a trending research topic as it generates personalized suggestions on facilities for users from a large number of candidate venues. Since users' check-in records can be viewed as a long sequence, methods based on recurrent neural networks (RNNs) have recently shown promising applicability for this task. However, existing RNN-based methods either neglect users' long-term preferences or overlook the geographical relations among recently visited POIs when modeling users' short-term preferences, thus making the recommendation results unreliable. To address the above limitations, we propose a novel method named Long- and Short-Term Preference Modeling (LSTPM) for next-POI recommendation. In particular, the proposed model consists of a nonlocal network for long-term preference modeling and a geo-dilated RNN for short-term preference learning. Extensive experiments on two real-world datasets demonstrate that our model yields significant improvements over the state-of-the-art methods.

Download Full-text

STG2Seq: Spatial-Temporal Graph to Sequence Model for Multi-step Passenger Demand Forecasting

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2019/274 ◽

2019 ◽

Cited By ~ 5

Author(s):

Lei Bai ◽

Lina Yao ◽

Salil S. Kanhere ◽

Xianzhi Wang ◽

Quan Z. Sheng

Keyword(s):

State Of The Art ◽

Demand Forecasting ◽

Short Term ◽

Demand Prediction ◽

On Demand ◽

Passenger Demand ◽

Temporal Correlations ◽

Output Module ◽

Real World Datasets

Multi-step passenger demand forecasting is a crucial task in on-demand vehicle sharing services. However, predicting passenger demand is generally challenging due to the nonlinear and dynamic spatial-temporal dependencies. In this work, we propose to model multi-step citywide passenger demand prediction based on a graph and use a hierarchical graph convolutional structure to capture both spatial and temporal correlations simultaneously. Our model consists of three parts: 1) a long-term encoder to encode historical passenger demands; 2) a short-term encoder to derive the next-step prediction for generating multi-step prediction; 3) an attention-based output module to model the dynamic temporal and channel-wise information. Experiments on three real-world datasets show that our model consistently outperforms many baseline methods and state-of-the-art models.

Download Full-text

A Review-Driven Neural Model for Sequential Recommendation

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2019/397 ◽

2019 ◽

Cited By ~ 5

Author(s):

Chenliang Li ◽

Xichuan Niu ◽

Xiangyang Luo ◽

Zhenzhong Chen ◽

Cong Quan

Keyword(s):

Performance Improvement ◽

State Of The Art ◽

Neural Model ◽

Sequential Patterns ◽

Short Term ◽

User Reviews ◽

Individual Level ◽

Significant Performance ◽

Real World Datasets

Writing review for a purchased item is a unique channel to express a user's opinion in E-Commerce. Recently, many deep learning based solutions have been proposed by exploiting user reviews for rating prediction. In contrast, there has been few attempt to enlist the semantic signals covered by user reviews for the task of collaborative filtering. In this paper, we propose a novel review-driven neural sequential recommendation model (named RNS) by considering user's intrinsic preference (long-term) and sequential patterns (short-term). In detail, RNS is devised to encode each user or item with the aspect-aware representations extracted from the reviews. Given a sequence of historical purchased items for a user, we devise a novel hierarchical attention over attention mechanism to capture sequential patterns at both union-level and individual-level. Extensive experiments on three real-world datasets of different domains demonstrate that RNS obtains significant performance improvement over uptodate state-of-the-art sequential recommendation models.

Download Full-text

PLASTIC: Prioritize Long and Short-term Information in Top-n Recommendation using Adversarial Training

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2018/511 ◽

2018 ◽

Cited By ~ 1

Author(s):

Wei Zhao ◽

Benyou Wang ◽

Jianbo Ye ◽

Yongqiang Gao ◽

Min Yang ◽

...

Keyword(s):

Reinforcement Learning ◽

Recommender Systems ◽

Real World ◽

Short Term ◽

Plastic Model ◽

The Real ◽

Adversarial Training ◽

Real World Datasets

Recommender systems provide users with ranked lists of items based on individual's preferences and constraints. Two types of models are commonly used to generate ranking results: long-term models and session-based models. While long-term models represent the interactions between users and items that are supposed to change slowly across time, session-based models encode the information of users' interests and changing dynamics of items' attributes in short terms. In this paper, we propose a PLASTIC model, Prioritizing Long And Short-Term Information in top-n reCommendation using adversarial training. In the adversarial process, we train a generator as an agent of reinforcement learning which recommends the next item to a user sequentially. We also train a discriminator which attempts to distinguish the generated list of items from the real list recorded. Extensive experiments show that our model exhibits significantly better performances on two widely used real-world datasets.

Download Full-text

Density Guarantee on Finding Multiple Subgraphs and Subtensors

ACM Transactions on Knowledge Discovery from Data ◽

10.1145/3446668 ◽

2021 ◽

Vol 15 (5) ◽

pp. 1-32

Author(s):

Quang-huy Duong ◽

Heri Ramampiaro ◽

Kjetil Nørvåg ◽

Thu-lan Dam

Keyword(s):

Lower Bound ◽

State Of The Art ◽

The State ◽

The Other ◽

Exact Methods ◽

Practical Solution ◽

Novel Approach ◽

Wide Range ◽

Real World Datasets ◽

Tensor Data

Dense subregion (subgraph & subtensor) detection is a well-studied area, with a wide range of applications, and numerous efficient approaches and algorithms have been proposed. Approximation approaches are commonly used for detecting dense subregions due to the complexity of the exact methods. Existing algorithms are generally efficient for dense subtensor and subgraph detection, and can perform well in many applications. However, most of the existing works utilize the state-or-the-art greedy 2-approximation algorithm to capably provide solutions with a loose theoretical density guarantee. The main drawback of most of these algorithms is that they can estimate only one subtensor, or subgraph, at a time, with a low guarantee on its density. While some methods can, on the other hand, estimate multiple subtensors, they can give a guarantee on the density with respect to the input tensor for the first estimated subsensor only. We address these drawbacks by providing both theoretical and practical solution for estimating multiple dense subtensors in tensor data and giving a higher lower bound of the density. In particular, we guarantee and prove a higher bound of the lower-bound density of the estimated subgraph and subtensors. We also propose a novel approach to show that there are multiple dense subtensors with a guarantee on its density that is greater than the lower bound used in the state-of-the-art algorithms. We evaluate our approach with extensive experiments on several real-world datasets, which demonstrates its efficiency and feasibility.

Download Full-text

Visions of Human Futures in Space and SETI

10.31235/osf.io/93nc2 ◽

2017 ◽

Author(s):

Jason T. Wright ◽

Michael P. Oman-Reagan

Keyword(s):

Solar System ◽

Science Fiction ◽

Dark Matter Particle ◽

Short Term ◽

National Science ◽

The Galaxy ◽

Near Future ◽

Far Future ◽

Made In

We discuss how visions for the futures of humanity in space and SETI are intertwined, and are shaped by prior work in the fields and by science fiction. This appears in the language used in the fields, and in the sometimes implicit assumptions made in discussions of them. We give examples from articulations of the so-called Fermi Paradox, discussions of the settlement of the Solar System (in the near future) and the Galaxy (in the far future), and METI. We argue that science fiction, especially the campy variety, is a significant contributor to the ‘giggle factor’ that hinders serious discussion and funding for SETI and Solar System settlement projects. We argue that humanity's long-term future in space will be shaped by our short-term visions for who goes there and how. Because of the way they entered the fields, we recommend avoiding the term ‘colony’ and its cognates when discussing the settlement of space, as well as other terms with similar pedigrees. We offer examples of science fiction and other writing that broaden and challenge our visions of human futures in space and SETI. In an appendix, we use an analogy with the well-funded and relatively uncontroversial searches for the dark matter particle to argue that SETI's lack of funding in the national science portfolio is primarily a problem of perception, not inherent merit.Also on arXiv: https://arxiv.org/abs/1708.05318Please cite this version:Wright, Jason T., and Michael P. Oman-Reagan. “Visions of Human Futures in Space and SETI.” International Journal of Astrobiology, 2017, 1–12. doi:10.1017/S1473550417000222.

Download Full-text

Reviewing Autoencoders for Missing Data Imputation: Technical Trends, Applications and Outcomes

Journal of Artificial Intelligence Research ◽

10.1613/jair.1.12312 ◽

2020 ◽

Vol 69 ◽

pp. 1255-1285

Author(s):

Ricardo Cardoso Pereira ◽

Miriam Seoane Santos ◽

Pedro Pereira Rodrigues ◽

Pedro Henriques Abreu

Keyword(s):

Missing Data ◽

Missing Values ◽

State Of The Art ◽

Data Imputation ◽

Tabular Data ◽

Missing Data Imputation ◽

Learning Techniques ◽

Real World Datasets ◽

And Training ◽

Machine Learning Models

Missing data is a problem often found in real-world datasets and it can degrade the performance of most machine learning models. Several deep learning techniques have been used to address this issue, and one of them is the Autoencoder and its Denoising and Variational variants. These models are able to learn a representation of the data with missing values and generate plausible new ones to replace them. This study surveys the use of Autoencoders for the imputation of tabular data and considers 26 works published between 2014 and 2020. The analysis is mainly focused on discussing patterns and recommendations for the architecture, hyperparameters and training settings of the network, while providing a detailed discussion of the results obtained by Autoencoders when compared to other state-of-the-art methods, and of the data contexts where they have been applied. The conclusions include a set of recommendations for the technical settings of the network, and show that Denoising Autoencoders outperform their competitors, particularly the often used statistical methods.

Download Full-text

Learning Long- and Short-Term User Literal-Preference with Multimodal Hierarchical Transformer Network for Personalized Image Caption

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i05.6503 ◽

2020 ◽

Vol 34 (05) ◽

pp. 9571-9578 ◽

Cited By ~ 1

Author(s):

Wei Zhang ◽

Yue Ying ◽

Pan Lu ◽

Hongyuan Zha

Keyword(s):

State Of The Art ◽

Natural Extension ◽

Target Image ◽

Short Term ◽

Image Representations ◽

High Level ◽

Image Descriptions ◽

Shed Light ◽

Image Caption

Personalized image caption, a natural extension of the standard image caption task, requires to generate brief image descriptions tailored for users' writing style and traits, and is more practical to meet users' real demands. Only a few recent studies shed light on this crucial task and learn static user representations to capture their long-term literal-preference. However, it is insufficient to achieve satisfactory performance due to the intrinsic existence of not only long-term user literal-preference, but also short-term literal-preference which is associated with users' recent states. To bridge this gap, we develop a novel multimodal hierarchical transformer network (MHTN) for personalized image caption in this paper. It learns short-term user literal-preference based on users' recent captions through a short-term user encoder at the low level. And at the high level, the multimodal encoder integrates target image representations with short-term literal-preference, as well as long-term literal-preference learned from user IDs. These two encoders enjoy the advantages of the powerful transformer networks. Extensive experiments on two real datasets show the effectiveness of considering two types of user literal-preference simultaneously and better performance over the state-of-the-art models.

Download Full-text

Learning from Interventions Using Hierarchical Policies for Safe Learning

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i06.6602 ◽

2020 ◽

Vol 34 (06) ◽

pp. 10352-10360

Author(s):

Jing Bi ◽

Vikas Dhiman ◽

Tianyou Xiao ◽

Chenliang Xu

Keyword(s):

Reaction Time ◽

State Of The Art ◽

The State ◽

Policy Framework ◽

Asymptotic Performance ◽

Short Term ◽

Learning From Demonstrations ◽

Hierarchical Levels ◽

Long Term Behavior

Learning from Demonstrations (LfD) via Behavior Cloning (BC) works well on multiple complex tasks. However, a limitation of the typical LfD approach is that it requires expert demonstrations for all scenarios, including those in which the algorithm is already well-trained. The recently proposed Learning from Interventions (LfI) overcomes this limitation by using an expert overseer. The expert overseer only intervenes when it suspects that an unsafe action is about to be taken. Although LfI significantly improves over LfD, the state-of-the-art LfI fails to account for delay caused by the expert's reaction time and only learns short-term behavior. We address these limitations by 1) interpolating the expert's interventions back in time, and 2) by splitting the policy into two hierarchical levels, one that generates sub-goals for the future and another that generates actions to reach those desired sub-goals. This sub-goal prediction forces the algorithm to learn long-term behavior while also being robust to the expert's reaction time. Our experiments show that LfI using sub-goals in a hierarchical policy framework trains faster and achieves better asymptotic performance than typical LfD.

Download Full-text