Burstiness in Query Log: Web Search Analysis by Combining Global and Local Evidences

Enhancing Web Search through Query Log Mining

Encyclopedia of Data Warehousing and Mining ◽

10.4018/978-1-59140-557-3.ch083 ◽

2011 ◽

pp. 438-442

Author(s):

Ji-Rong Wen

Keyword(s):

Information Retrieval ◽

Search Engine ◽

Web Mining ◽

Web Search ◽

Information Source ◽

Query Log ◽

Additional Information ◽

Query Logs ◽

Query Log Mining ◽

The Web

Web query log is a type of file keeping track of the activities of the users who are utilizing a search engine. Compared to traditional information retrieval setting in which documents are the only information source available, query logs are an additional information source in the Web search setting. Based on query logs, a set of Web mining techniques, such as log-based query clustering, log-based query expansion, collaborative filtering and personalized search, could be employed to improve the performance of Web search.

Download Full-text

ObSecure Logging (OSLo): A Framework to Protect and Evaluate the Web Search Privacy in Health Care Domain

Journal of Medical Imaging and Health Informatics ◽

10.1166/jmihi.2019.2708 ◽

2019 ◽

Vol 9 (6) ◽

pp. 1181-1190 ◽

Cited By ~ 1

Author(s):

Mohib Ullah ◽

Muhammad Arshad Islam ◽

Rafiullah Khan ◽

Muhammad Aleem ◽

Muhammad Azhar Iqbal

Keyword(s):

Web Search ◽

Medical Information ◽

Privacy Preserving ◽

Exposure Level ◽

Query Log ◽

Search Queries ◽

Internet Users ◽

Privacy Analysis ◽

The Web ◽

Better Than

Users around the world send queries to the Web Search Engine (WSE) to retrieve data from the Internet. Users usually take primary assistance relating to medical information from WSE via search queries. The search queries relating to diseases and treatment is contemplated to be the most personal facts about the user. The search queries often contain identifiable information that can be linked back to the originator, which can compromise the privacy of a user. In this work, we are proposing a distributed privacy-preserving protocol (OSLo) that eliminates limitation in the existing distributed privacy-preserving protocols and a framework, which evaluates the privacy of a user. The OSLo framework asses the local privacy relative to the group of users involved in forwarding query to the WSE and the profile privacy against the profiling of WSE. The privacy analysis shows that the local privacy of a user directly depends on the size of the group and inversely on the number of compromised users. We have performed experiments to evaluate the profile privacy of a user using a privacy metric Profile Exposure Level. The OSLo is simulated with a subset of 1000 users of the AOL query log. The results show that OSLo performs better than the benchmark privacy-preserving protocol on the basis of privacy and delay. Additionally, results depict that the privacy of a user depends on the size of the group.

Download Full-text

Exploiting Query’s Temporal Patterns for Query Autocompletion

Mathematical Problems in Engineering ◽

10.1155/2017/7490879 ◽

2017 ◽

Vol 2017 ◽

pp. 1-8 ◽

Cited By ~ 2

Author(s):

Danyang Jiang ◽

Honghui Chen ◽

Fei Cai

Keyword(s):

Fourier Transform ◽

Web Search ◽

Temporal Patterns ◽

Cyclic Behavior ◽

Current Time ◽

Query Log ◽

Sudden Rise ◽

Query Logs ◽

Interactive Feature ◽

Ranking Query

Query autocompletion (QAC) is a common interactive feature of web search engines. It aims at assisting users to formulate queries and avoiding spelling mistakes by presenting them with a list of query completions as soon as they start typing in the search box. Existing QAC models mostly rank the query completions by their past popularity collected in the query logs. For some queries, their popularity exhibits relatively stable or periodic behavior while others may experience a sudden rise in their query popularity. Current time-sensitive QAC models focus on either periodicity or recency and are unable to respond swiftly to such sudden rise, resulting in a less optimal QAC performance. In this paper, we propose a hybrid QAC model that considers two temporal patterns of query’s popularity, that is, periodicity and burst trend. In detail, we first employ the Discrete Fourier Transform (DFT) to identify the periodicity of a query’s popularity, by which we forecast its future popularity. Then the burst trend of query’s popularity is detected and incorporated into the hybrid model with its cyclic behavior. Extensive experiments on a large, real-world query log dataset infer that modeling the temporal patterns of query popularity in the form of its periodicity and its burst trend can significantly improve the effectiveness of ranking query completions.

Download Full-text

Temporal query log profiling to improve web search ranking

Proceedings of the 19th ACM international conference on Information and knowledge management - CIKM '10 ◽

10.1145/1871437.1871583 ◽

2010 ◽

Cited By ~ 2

Author(s):

Alexander Kotov ◽

Pranam Kolari ◽

Lei Duan ◽

Yi Chang

Keyword(s):

Web Search ◽

Query Log ◽

Temporal Query ◽

Web Search Ranking

Download Full-text

Privacy in Web Search Query Log Mining

Machine Learning and Knowledge Discovery in Databases - Lecture Notes in Computer Science ◽

10.1007/978-3-642-04180-8_4 ◽

2009 ◽

pp. 4-4 ◽

Cited By ~ 2

Author(s):

Rosie Jones

Keyword(s):

Web Search ◽

Search Query ◽

Query Log ◽

Log Mining ◽

Query Log Mining

Download Full-text

Enhancing Web Search through Query Log Mining

Encyclopedia of Data Warehousing and Mining, Second Edition ◽

10.4018/978-1-60566-010-3.ch117 ◽

2011 ◽

pp. 758-763 ◽

Cited By ~ 2

Author(s):

Ji-Rong Wen

Keyword(s):

Information Retrieval ◽

Search Engine ◽

Web Mining ◽

Web Search ◽

Information Source ◽

Query Log ◽

Additional Information ◽

Query Logs ◽

Query Log Mining ◽

The Web

Web query log is a type of file keeping track of the activities of the users who are utilizing a search engine. Compared to traditional information retrieval setting in which documents are the only information source available, query logs are an additional information source in the Web search setting. Based on query logs, a set of Web mining techniques, such as log-based query clustering, log-based query expansion, collaborative filtering and personalized search, could be employed to improve the performance of Web search.

Download Full-text

Multi-Agent-Based Information Retrieval System Using Information Scent in Query Log Mining for Effective Web Search

Information Retrieval and Management ◽

10.4018/978-1-5225-5191-1.ch012 ◽

2018 ◽

pp. 266-291

Author(s):

Suruchi Chawla

Keyword(s):

Information Retrieval ◽

Web Search ◽

Multi Agent System ◽

Information Need ◽

Agent System ◽

Query Log ◽

Information Scent ◽

Multi Agent ◽

Log Mining ◽

Query Log Mining

This chapter explains the multi-agent system for effective information retrieval using information scent in query log mining. The precision of search results is low due to difficult to infer the information need of the small size search query and therefore information need of the user is not satisfied effectively. Information Scent is used for modeling the information need of user web search session and clustering is performed to identify the similar information need sessions. Hyper Link-Induced Topic Search (HITS) is executed on clusters to generate the Hubs and authorities for web page recommendations to users who search with similar intents. This multi-agent system based on clustered query sessions uses query operations like expansion and recommendation to infer the information need of user search queries and recommends Hubs and authorities for effective web search.

Download Full-text

Analysis of a very large web search engine query log

ACM SIGIR Forum ◽

10.1145/331403.331405 ◽

1999 ◽

Vol 33 (1) ◽

pp. 6-12 ◽

Cited By ~ 572

Author(s):

Craig Silverstein ◽

Hannes Marais ◽

Monika Henzinger ◽

Michael Moricz

Keyword(s):

Search Engine ◽

Web Search ◽

Query Log ◽

Web Search Engine

Download Full-text

Query log driven web search results clustering

Proceedings of the 37th international ACM SIGIR conference on Research & development in information retrieval - SIGIR '14 ◽

10.1145/2600428.2609583 ◽

2014 ◽

Cited By ~ 12

Author(s):

Jose G. Moreno ◽

Gaël Dias ◽

Guillaume Cleuziou

Keyword(s):

Web Search ◽

Query Log ◽

Search Results ◽

Search Results Clustering

Download Full-text

Quantitative Common Sense Estimation System and its Application for Membership Function Generation

Journal of Advanced Computational Intelligence and Intelligent Informatics ◽

10.20965/jaciii.2014.p0856 ◽

2014 ◽

Vol 18 (5) ◽

pp. 856-864 ◽

Cited By ~ 1

Author(s):

Yuta Hayakawa ◽

◽

Masafumi Hagiwara

Keyword(s):

Membership Function ◽

Common Sense ◽

Web Search ◽

Estimation Methods ◽

Web Pages ◽

Threshold Values ◽

Generation System ◽

Function Generation ◽

Estimation System ◽

Global And Local

Systems capable of autonomous thinking are sometimes required to cope with unanticipated situations. An important issue in this context is knowledge – especially common sense – acquisition. In this paper, we propose novel quantitative common sense estimation methods and apply them to an automatic membership function generation system. Our proposed system estimates threshold values corresponding tolargeandsmallfor various kinds of objectattribute sets to form membership functions, where it attempts to relate each object to its corresponding impression. Two methods are proposed in this paper. The first, Method-1, obtains data from the top 1,000 snippets through a web search and estimates the global and local tendencies by clustering them. The second, Method-2, uses the number of hits from a web search together with parts of the results obtained through Method-1. In addition, we devise several techniques to eliminate unnecessary information in the retrieved web pages. We also carried out experiments that verified the effectiveness of our proposed methods and the method combining those two.

Download Full-text