April 17 - 19, 2007   
Hyatt Regency Phoenix   
Phoenix, Arizona   

 

Breakout Groups
( As of: April 25, 2007 )

 
Data Driven Scholarship  

Some academic leaders are beginning to recognize that data-driven science is becoming a new scientific paradigm – ranking with theory, experimentation, and computational science.  As yet, fewer people appreciate that the combination of large-scale digitization of books, scholarly journals online, and huge data sets provides opportunities for new methodologies for scholarship and research in all academic disciplines.  Is this really a fourth paradigm of science or is it new wine in old bottles?  How can the community articulate the importance of this area, so that university presidents (in the US) and vice-chancellors (in Britain) understand the potential and challenges?

 
   
  1. Fran Berman, The Need for Formalized Trust in Digital Repository Collaborative Infrastructure
  2. Paolo Galluzzi
  3. Steve Griffin
  4. Michael Nelson, I Don’t Know and I Don’t Care
  5. Malcolm Read
  6. Jürgen Renn, From Research Challenges of the Humanities to the Epistemic Web (Web 3.0)
  7. Alexander Szalay
  8. Howard Wactlar
 
   
Scale and Complexity  

The complexity of information is largely uncorrelated with the scale of the phenomena of interest.  What are the opportunities and challenges to managing scale and complexity through automation and the potential barriers to be overcome?  Data-driven scholarship is technically difficult.  Many of the collections are huge by any standards – hundreds of terabytes today even petabytes.  Moreover, they often have complex internal structure.  Organizations such as the National Virtual Observatory, the Internet Archive, and the Shoah Foundation have demonstrated the challenges in reconciling these two parameters, scale and complexity, particularly when the research questions to be asked of the data are not known in advance.  Recently there has been interest in studying the various workflows that are encountered in managing large collections of digital information.  Is this a good framework for planning?

 
   
  1. Bill Arms, Repositories for Large-scale Digital Libraries
  2. Matthew Dovey
  3. Jerry Goldman, Complexity and scale in audio archives
  4. Andrew Gruen (observer)
  5. Babak Hamidzadeh, Scale: A repository challenge
  6. Mark Liberman
  7. Brian Schottlaender, The Need for Formalized Trust in Digital Repository Collaborative Infrastructure
  8. Abby Smith, Thoughts on Scale and Complexity
 
   
Organizations  

What are the issues confronting organizations in a new digital world (public good, sustainability, …)? Collaboration, cooperation, and standards are needed to exploit heterogeneous sources of data, but the difficulties of cooperation are often ignored and the benefits often fall short of expectations.  Many types of organization have expertise in some aspects of data-driven scholarship: research centers, libraries, supercomputing centers, archives, Internet companies, and so on.  But in almost every instance such expertise is incidental to the major expertise of the organization.  What is the role of these organizations and how might they change?  We anticipate that new hybrid organizations will emerge.  What is the role of government agencies, such as the NSF and JISC, in stimulating such developments?

 
   
  1. Laura Campbell, How Digital Technologies Have Changed the Library of Congress: Inside and Outside
  2. Dave Cook
  3. Ian Dolphin
  4. Rick Luce, eDatabase Lessons for an eData World
  5. Peter Murray-Rust, Data-driven science - a scientist's view
  6. David Rosenthal, Engineering Issues In Preserving Large Databases
  7. Eric Van de Velde, Workshop on Data-Driven Science & Scholarship: Organizations
  8. Brad Wheeler
 
   
Individuals  

What are the enabling conditions, both human and technical (workflow, transparency, interoperability, standards,), for wide adoption by individuals? Large-scale developments in data-driven science and scholarship depend on the enthusiasm of individuals.  Recent years have seen rapid changes in the behavior of researchers in some matters (e.g., adoption of email, dissemination of research papers and data from personal web sites), and strong resistance to change in others (e.g., conservatism in publishing practices, low contributions to institutional repositories).  What are the barriers and incentives to change and how can the NSF and JISC influence them?

 
   
  1. Linda Frueh, Access Tools: Bridging Individuals to Information
  2. Ken Hamma, Professionally Indisposed to Change
  3. Mark Kornbluh
  4. Ron Larsen
  5. Janet Murray, Genre creation as cognition and collective knowledge making
  6. Joyce Ray, Discussion Group on Individual Users
  7. Beth Stewart, NEH and Digital Humanities
  8. Norman Wiseman
 
   
Scholarly Communications  

This is a period of change in scholarly communication.  While social forces are resisting fundamental changes in the process of peer-reviewed publication, other areas are changing rapidly. Researchers are publishing their work on private web sites or discipline repositories, such as arXiv.org.  Conventional indexes are being supplemented by and even eclipsed by web search engines and tools such as Google Scholar. Mass digitization of printed books is creating huge volumes of data to be stored online and new partnerships between libraries and corporations.  What, if anything, should the NSF and JISC do to steer the developments of new repositories for scholarly communication?

 
   
  1. Sheila Anderson
  2. Sayeed Choudhury, The Relationship between Data and Scholarly Communication
  3. Bas Cordewener, Institutional Repositories in the Netherlands, a national and international perspective
  4. Greg Crane, Repositories, Cyberinfrastructure and the Humanities
  5. Carl Lagoze
  6. Don Waters, Doing much more than we have so far attempted