In partial fulfillment of the Requirements for the Degree of
Master of Science
Bing Zheng
will defend his thesis
Web Information Archival System
Abstract
The World-Wide Web now contains about
800 million pages, encompassing about 6 terabytes of text data on about 3
million servers. New
technologies are urgently needed to capture and manage the overwhelming and
easily accessible Web information. This thesis presents a new kind of
client-side archival system, “Web Information Archival System,” or WiAS.
It is a program that uses local disk storage to archive the Web pages
containing useful information for future references. WiAS
is linked directly to the Microsoft Internet Explorer to simplify the archiving
process; it provides a windows explorer like interface to browse and organize
the archived information; it utilizes an underlying database management system
to handle the archive-related data; and most importantly, it introduces a
powerful housekeeping mechanism to automatically remove outdated information
from the archive, therefore, ensuring enough storage capacity to deal with the
continuously growing Web information.
Date: Wednesday,
August 21, 2002
Time: 2:00 PM
Place: 550-PGH
Faculty, students, and the general public are invited.
Thesis Advisor: Dr. Venkat Subramaniam