This file is part of IDEAS, which uses RePEc data


[ Papers | Articles | Software | Books | Chapters | Authors | Institutions | JEL Classification | NEP reports | Search | New papers by email | Author registration | Rankings | Volunteers | FAQ | Blog | Help! ]

World Wide Web Robot for Extreme Datamining with Swiss-Tx Supercomputers

Author info | Abstract | Publisher info | Download info | Related research | Statistics
Author Info
A.S.A. Roehrl
M. Frey
R.A. Roehrl

Additional information is available for the following registered author(s):

Abstract

This paper discusses the software and hardware issues of designing a highly parallel robot for extreme datamining on the Internet. As a sample application, a World Wide Web server count experiment for Switzerland and Thailand is presented. Our platform of choice is the SwissTx, a supercomputer built from commodity components that runs NT and COMPAQ tru64 UNIX. Hardware and software of this machine are discussed and benchmark results presented. They show that NT is a feasible choice even under the given extreme conditions. Using statistical modelling for optimizing the search process, the inevitable bandwidth problem is reduced to some extent to a computation problem. We suggest that our approach to Web robots is a robust bet for a multitude of future Internet applications which might lead to a large-scale and cost-efficient usage of Web robots.

Download Info
To download:

If you experience problems downloading a file, check if you have the proper application to view it first. Information about this may be contained in the File-Format links below. In case of further problems read the IDEAS help page. Note that these files are not on the IDEAS site. Please be patient as the files may be large.

File URL: http://www.iiasa.ac.at/Publications/Documents/IR-99-020.pdf
File Format: application/pdf
File Function:
Download Restriction: no
File URL: http://www.iiasa.ac.at/Publications/Documents/IR-99-020.ps
File Format: application/postscript
File Function:
Download Restriction: no

Publisher Info
Paper provided by International Institute for Applied Systems Analysis in its series Working Papers with number ir99020.

Download reference. The following formats are available: HTML (with abstract), plain text (with abstract), BibTeX, RIS (EndNote, RefMan, ProCite), ReDIF
Length:
Date of creation: Jun 1999
Date of revision:
Handle: RePEc:wop:iasawp:ir99020

Contact details of provider:
Postal: A-2361 Laxenburg
Phone: +43-2236-807-0
Fax: +43-2236-71313
Email:
Web page: http://www.iiasa.ac.at/Publications/Catalog/PUB_ONLINE.html
More information through EDIRC

For technical questions regarding this item, or to correct its listing, contact: (Thomas Krichel).

Related research
Keywords:

This paper has been announced in the following NEP Reports:

Statistics
Access and download statistics

Did you know? About 2700 working paper series are listed on RePEc.

This page was last updated on 2009-12-2.


This information is provided to you by IDEAS at the Department of Economics, College of Liberal Arts and Sciences, University of Connecticut using RePEc data on a server sponsored by the Society for Economic Dynamics.