June 15 - 17, 2003   
Wequassett Inn, Cape Cod   
Chatham, Massachusetts   
NSF/JISC Workshop
 
General
Welcome
Background
Agenda
References
Important Dates
Participants List
   
OUTREACH
China 2004
Bangalore 2005
   
For Contributors
Call For Papers
Papers
Breakout Reports
Final Report
Opening Plenary Session
Supplementary Contributions
   
For Participants
Expense Form
Accommodation
Tourist info
Travel
   
Organization
Sponsors
Contacts
   

 

   
Papers  
   
Towards Building Digital Library as an Institution of Knowledge
 
   
   
NSF Post-DL Futures Workshop Short Paper
Hsinchun Chen, Professor, University of Arizona
Download: PDF Version    WORD Version
  1. What are we trying to do? What is the problem we’re trying to solve?

    Digital library research aims to develop the engineering and science for generating, capturing, accessing, and utilizing data, information, and knowledge in various digital formats, for a variety of applications, and in a global, collaborative human and system network.

  2. How is it done today, and what are the limitations of current practice?

    Data and information are captured and represented in various digital formats and are proliferating rapidly. However, the techniques for accessing data and information are rudimentary and imprecise, mostly based on simple keyword indexes, relational queries, and/or low-level image or audio features (i.e., research results of the 70s and 80s).

    Digital library should become an institution for capturing and transferring human knowledge, instead of simply for accessing data and information.

  3. What is new in our approach/technology, and why do we think it will be successful? What gives evidence that it will work?

    A knowledge discovery approach, based on both top-down knowledge creation (e.g., ontologies, subject headings, user modeling) and bottom-up automated knowledge extraction (e.g., data mining, text mining, web mining), promises to help transfer digital library from an institution of data and information to an institution of knowledge.

    The rapid proliferation of digital data and information, the increased sophistication and robustness in machine learning and data/text/web mining techniques, and the ever-increasing power of modern networks and computers will help contribute to the new science of digital library.

  4. Assuming we are successful, what difference does it make?

    Digital libraries will soon become a ubiquitous, global knowledge resource for education, training, and (international) collaboration. Its impacts will be felt in all aspects of human activities, from industries to governments, and from education to research.

  5. How long will it take, how much will it cost, and what are the (measurable) milestones, mid-term, and final exams?

    Many fundamental technical, social, and policy issues remain to be developed and researched. It calls for a long-term (ten-year), systematic research agenda to develop digital libraries into an institution of human knowledge. A long-term NSF research program with an annual funding level of $10M-$30M is needed to promote the field and develop the science.

    In 3-5 years (mid-term), we expect to see significant advancement in knowledge creation for various digital library applications and domains based on more advanced algorithmic techniques and proper human-computer interaction principles. Instead of accessing low-level, fragmented data and information pieces, high-level, abstract and decision-relevant knowledge will be accessed in a seamless manner.

    In 8-10 years (final exams), we expect the systematic science and theory for digital library knowledge creation to be developed and validated. We will be able to access human knowledge in a multi-lingual, multi-media, mobile, and semantics-based digital library knowledge network.