Research
Seminars
Department Series
[an error occurred while processing this directive]
Reliable Web-crawling for estimating node indegree in a Web-graph |
|
Dr. Viv Cothey Post-Doctoral Researcher School of Computing and IT Wolverhampton University Wolverhampton, UK |
|
The principles of Web-crawling appear well known but practice receives little attention. Yet understanding the practice of Web-crawling becomes important when the object of study is the resulting Web-graph and its structural properties. Web-crawling can be regarded as a sampling procedure where different selection methods can be adopted. For example in "content-crawling" the goal is to sample the Web's content in order to support Web content discovery. However it can be seen that the heuristics used by content-crawlers do not reliably represent the Web's link structure.
Different Web-crawling heuristics have been used to construct an alternative Web link-crawler that is designed to reliably sample the Web's link structure. This work is part of the European Union WISER project and is a work in progress.
In this presentation I discuss:
Viv is a Postdoctoral research fellow in the Statistical Cybermetrics Research Group in the School of Computing and IT, University of Wolverhampton. He completed his Ph.D. from the University of Bristol in 2002 in Web information strategies.
[an error occurred while processing this directive]