• No results found

Peer-to-peer information retrieval

N/A
N/A
Protected

Academic year: 2021

Share "Peer-to-peer information retrieval"

Copied!
1
0
0

Bezig met laden.... (Bekijk nu de volledige tekst)

Hele tekst

(1)

DOCTORAL ABSTRACT

Peer-to-Peer Information Retrieval

Almer S. Tigelaar

Database Group, Faculty of Electrical

Engineering, Mathematics and Computer Science,

University of Twente, P.O. Box 217, 7500 AE,

Enschede, The Netherlands.

a.s.tigelaar@utwente.nl

T

he Internet has become an integral part of our daily lives. However, the essential task of finding information is dominated by a handful of large centralised search engines. In this thesis we study an alternative to this approach. Instead of using large data centres, we propose using the machines that we all use every day: our desktop, laptop and tablet computers, to build a peer web search engine. We provide a definition of the associated research field: peer-to-peer information retrieval. We examine what separates it from related fields, give an overview of the work done so far and provide an economic perspective on peer-to-peer search. Furthermore, we introduce our own architecture for peer-to-peer search systems, inspired by BitTorrent.

Distributing the task of providing search results for queries introduces the problem of query routing: a query needs to be sent to a peer that can provide relevant search results. We investigate how the content of peers can be represented so that queries can be directed to the best ones in terms of relevance. While cooperative peers can provide their own representation, the content of uncooperative peers can be accessed only through a search interface and thus they can not actively provide a description of themselves. We look into representing these uncooperative peers by probing their search interface to construct a representation. Finally, the capacity of the machines in peer-to-peer networks differs considerably, making it challenging to provide search results quickly. To address this, we present an approach where copies of search results for previous queries are retained at peers and used to serve future requests and show participation can be incentivised using reputations.

There are still problems to be solved before a real-world peer-to-peer web search engine can be built. This thesis provides a starting point for this ambitious goal and also provides a solid basis for reasoning about peer-to-peer information retrieval systems in general.

Thesis links:

http://almer.tigelaar.net/2012/09/25/peer-to-peer-information-retrieval/ http://dx.doi.org/10.3990/1.9789036534000

Supervisor: Peter Apers (University of Twente, NL), Assistant Supervisor: Djoerd Hiemstra (University of Twente, NL). Committee: Jamie Callan (Carnegie Mellon University, US), Fabio Crestani (Universit`a della Svizzera Italiana, CH), Arjen de Vries (Delft University of Technology, NL), Franciska de Jong (University of Twente, NL), Dirk Heylen (University of Twente, NL), Johan Pouwelse (Delft University of Technology, NL).

Defended: Wednesday, September 26th 2012 at the University of Twente, Enschede, The Netherlands.

Referenties

GERELATEERDE DOCUMENTEN

In kolom vier, antwoorden afkomstig uit enquête 1, is goed te zien dat studenten aan het begin van de cursus een grote verscheidenheid laten zien in de kwaliteiten die zij

H2: The level of absorptive capacity of an energy incumbent positively moderates the relationship between green acquisitions and firm performance, such that incumbents with

It also presupposes some agreement on how these disciplines are or should be (distinguished and then) grouped. This article, therefore, 1) supplies a demarcation criterion

PARTICIPATIE GEZONDHEID VEILIGHEID DRIE PREVENTIENIVEAUS pagina 19 GEWENSTE SITUATIE MENSEN ZONDER BEKENDE RISICOFACTOR(EN) / PROBLEEM MENSEN MET. RISICOFACTOR(EN) MENSEN MET

We research two peer reviews in the OECD, namely the WGB and the Economic and Development Review Committee (EDRC); two UN peer reviews, namely the UPR of human rights and

In this paper, we presented a new method to obtain a fully gap-free time series of gridded daily surface temperature from MODIS collection 6 LST products (tiled MOD11A1/MYD11A1 data

Enkele van de benodigde gegevens kunnen niet volgens de eerste methode worden verzameld, daar zij van dynamische aard zijn.. Hiervoor wordt dan de tweede methode

Party political competition could be strengthened if a majority in the directly elected European Parliament would have stronger control over legislative decision-making in