• Dr. Milind Bhandarkar, Chief Architect of Greenplum Labs, EMC will give a talk titled "OpenChorus : Building a Toolchest for Big Data Analytics" as part of the industry session
  • C-Big 2012 will feature a best paper award (The paper will be selected by the program committee).
  • Prof. Christos Faloutsos from Carnegie Mellon University will give the keynote talk at C-Big 2012.
  • Selected papers from the workshop will be invited for a special issue in Elsevier's Information Systems journal
  • To comply with the new deadline of CollaborateCom we extend the paper submission deadline of C-Big 2012 until Aug. 15.

Call for Papers

The ability to collect, integrate, and analyze data from large number of diverse data sources has increased the amount of data collected and processed by individual organizations on the order of several tens to hundreds of TB. This data can be efficiently utilized for better decision making, improved business intelligence, as well as for enabling new knowledge and services.

Collaborative generation and utilization of large quantities of data, either through crowdsourcing or through efforts of different organizations and groups creates notable research opportunities. The potential benefits of Collaborative Big Data, as well as new and unexpected challenges are still emerging. Managing, processing, and making sense of this data pose new challenges in storage, networking, database management, data mining, knowledge discovery, information security and privacy.

C-Big 2012 brings together researchers and practitioners from around the world to share their experiences on creating, managing, and handling Collaborative Big Data and its benefits.

The authors of selected papers from the workshop will be invited to submit extended versions of the papers to a special issue in Elsevier's Information Systems journal.

Topics of Interest

Topics of interest include (but are not limited to):

Important Dates

Paper Submission

Submitted manuscripts should closely reflect the final papers as they will appear in the Proceedings, and should not exceed 8 pages in two-column IEEE proceeding format. We urge the authors to prepare their papers according to the Latex or Microsoft Word templates found at IEEE Author Digital Tool Box, under the "Template for Transactions" section.

All papers are refereed through a single blind process. All papers must be submitted online.


All submitted papers will be rigorously reviewed. All accepted papers will be made available in IEEE Xplore and external indexing services (DBLP database, ZB1Math/CompuServe, IO-Port, EI, Scopus, INSPEC, ISI proceeding - pending approval).


General Co-Chairs

Program Co-Chairs

Technical Program Committee

Tentative Program

Sunday, October 14, 2012
8:00 AM – 8:45 AM Registration
8:50 AM – 9:00 AM Opening Remarks
9:00 AM – 10:00 AM Session 1: Keynote Address
  • Mining Billion-Node Graphs - Patterns and Scalable Algorithms (slides)
    Prof. Christos Faloutsos
10:00 AM – 10:15 PM Coffee Break
10:15 AM – 12:15 PM Session 2: Industry Session
  • OpenChorus : Building a Toolchest for Big Data Analytics (slides)
    Milind Bhandarkar, Chief Architect of Greenplum Labs, EMC
  • Real-time analytics using Vertica Analytics Platform (slides)
    Shilpa Lawande and Stephen Walkauskas, Vertica Group, HP
12:15 PM – 1:30 PM Lunch
1:30 PM – 3:00 PM Session 3: Algorithms
  • Using a Distributed Search Engine to Identify Optimal Product Sets for Use in an Outbreak Detection System
    Ruhsary Rexit, Fuchiang Tsui, Jeremy Espino, Sahawut Wesaratchakit, Ye Ye and Panos Chrysanthis
  • Towards Efficient Query Processing on Massive Time-Evolving Graphs (slides)
    Arash Fard, Amir Abdolrashidi, Lakshmish Ramaswamy and John A. Miller
  • Delta Extraction Optimization for View Maintenance in a Limited Collaborative Environment
    Zheng Lu, Haijun Liu and Peter Hyland
3:00 PM – 3:30 PM Coffee Break
3:30 PM – 5:00 PM Session 4: Architecture and Applications
  • Biologically-inspired Network “Memory” for Smarter Networking
    Bassem Mokhtar and Mohamed Eltoweissy
  • Efficient Processing of Models for Large-scale Shotgun Proteomics Data (slides)
    Himanshu Grover and Vanathi Gopalakrishnanp
  • Robust Decision Engineering: Collaborative Big Data and its Application to International Development/Aid
    Steve Chan, Wesley Rhodes, Charles Atencio, Brent Ranalli, Anna Miao, Simone Sala, Stephen Serene, Sarah Rumbley, Lisa Sokol, Loren Gary, Caroline Kuo and Marc Clement


Keynote Speaker: Prof. Christos Faloutsos, Carnegie Mellon University.

Mining Billion-Node Graphs - Patterns and Scalable Algorithms


How do graphs look like? How do they evolve over time? How do rumors and viruses propagate on real graphs? We review some static and temporal 'laws', fast algorithms to spot deviations and outliers, and recent developments on virus propagation and scalable tensor analysis.


Prof. Faloutsos, ACM Fellow, has received the Presidential Young Investigator Award from the National Science Foundation (1989), the Research Contributions Award in ICDM 2006, the SIGKDD Innovations Award (2010), eighteen "best paper" awards (including two "test of time" awards), and four teaching awards. He has served as a member of the executive committee of SIGKDD, he has published over 200 refereed articles, 11 book chapters, and one monograph. He holds six patents and he has given over 30 tutorials and over 10 invited distinguished lectures. His research interests include data mining for graphs and streams, fractals, database performance, and indexing for multimedia and bio-informatics data. He is ranked among the top 50 nurturers in information technology.