Gipp, B; Taylor, A; Beel, J. 2010. Link Proximity Analysis - Clustering Websites by Examining Link Proximity. RESEARCH AND ADVANCED TECHNOLOGY FOR DIGITAL LIBRARIES 6273: 449-452

Eugene Garfield garfield at CODEX.CIS.UPENN.EDU
Fri Apr 1 14:18:29 EDT 2011


Gipp, B; Taylor, A; Beel, J. 2010. Link Proximity Analysis - Clustering Websites 
by Examining Link Proximity. RESEARCH AND ADVANCED TECHNOLOGY FOR 
DIGITAL LIBRARIES 6273: 449-452. edited by Lalmas, M; Jose, J; Rauber, A; 
Sebastiani, F; Frommholz, I.presented at 14th European Conference on 
Research and Advanced Technology for Digital Libraries in Glasgow, SCOTLAND, 
SEP 06-10, 2010.

Author Full Name(s): Gipp, Bela; Taylor, Adriana; Beel, Joeran
Book series title: Lecture Notes in Computer Science
Language: English
Document Type: Proceedings Paper

Author Keywords: Web page; Website; clustering; Network Analysis; Link 
Analysis; Citation Proximity Analysis
KeyWords Plus: COCITATION

Abstract: This research-in-progress paper presents a new approach called Link 
Proximity Analysis (LPA) for identifying related web pages based on link 
analysis. In contrast to current techniques, which ignore intra-page link 
analysis, the one put forth here examines the relative positioning of links to 
each other within websites. The approach uses the fact that a clear correlation 
between the proximity of links to each other and the subject-relatedness of 
the linked websites can be observed on nearly every web page. By statistically 
analyzing this relationship and measuring the amount of sentences, paragraphs, 
etc. between two links, related websites can be automatically, identified as a 
first study has proven.

Addresses: [Gipp, Bela; Taylor, Adriana; Beel, Joeran] UC Berkeley, Berkeley, 
CA USA
E-mail Address: gipp at berkeley.edu; aitaylor at berkeley.edu; beel at berkeley.edu
ISSN: 0302-9743
ISBN: 978-3-642-15463-8
pre-print PDF: http://www.sciplore.org/publications/2010-
Link_Proximity_Analysis_-_preprint.pdf



More information about the SIGMETRICS mailing list