BDGMM: The 1st IEEE International Workshop on Big Data Governance and Metadata and Management

Call for Papers

Big Data is a collection of data so large, so complex, so distributed, and growing so fast (or 5Vs- volume, variety, velocity, veracity, and value). It has been known for unlocking new sources of economic values, providing fresh insights into sciences, and assisting on policy making. However, Big Data is not practically consumable until it can be aggregated and integrated into a manner that a computer system can process. For instance, in the Internet of Things (IoT) environment, there is a great deal of variation in the hardware, software, coding methods, terminologies and nomenclatures used among the data generation systems. Given the variety of data locations, formats, structures and access policies, data aggregation has been extremely complex and difficult. More specifically, a health researcher was interested in finding answers to a series of questions, such as “How is the gene ‘myosin light chain 2’ associated with the chamber type hypertrophic cardiomyopathy? What is the similarity to a subset of the genes’ features? What are the potential connections among pairs of genes”? To answer these questions, one may retrieve information from databases he knows, such as the NCBI Gene database or PubMed database. In the Big Data era, it is highly likely that there are other repositories also storing the relevant data. Thus, we are wondering

  • Is there an approach to manage such big data, so that a single search engine available to obtain all relevant information drawn from a variety of data sources and to act as a whole?
  • How do we know if the data provided is related to the information contained in our study?

To achieve this objective, we need a mechanism to help us describe a digital source so well that allows it to be understood by both human and machine. Metadata is “data about data”. It is descriptive information about a particular dataset, object or resource, including how it is formatted, and when and by whom it is collected. With those information, the finding of and the working with particular instances of Big Data would become easier. Besides, the Big Data must be managed effectively. This has partially manifested in data models a.k.a. “NoSQL”. The goal of this multidisciplinary workshop is to gather both researchers and practitioners to discuss methodological, technical and standard aspects for Big Data management. Papers describing original research on both theoretical and practical aspects of metadata for Big Data management are solicited.

Scope of the Workshop

Topics include, but are not limited to:

  • Metadata standard(s) development for Big Data management
  • Methodologies, architecture and tools for metadata annotation, discovery, and interpretation
  • Case study on metadata standard development and application
  • Metadata interoperability (crosswalk)
  • Metadata and Data Privacy
  • Metadata for Semantic Webs
  • Human Factors on Metadata
  • Innovations in Big Data management
  • Opportunities in standardizing Big Data management
  • Digital object architectures and infrastructures for Big Data management
  • Best practices and standard based persistent identifiers, data types registry structures and representations for Big Data management
  • Query languages and ontology in Big Data
  • NoSQL databases and Schema-less data modeling
  • Multimodal resource and workload management
  • Availability, reliability and Fault tolerance
  • Frameworks for parallel and distributed information retrieval
  • Domain standardization for Big Data management
  • Big Data governance for data integrity, quality, provenance, retention, asset management, and business intelligence

In addition to the accepted papers, the workshop intends to have an industry focus through a keynote speaker and hackathon challenges. The hackathon session will explore interoperable data infrastructure for Big Data Governance and Metadata Management that is scalable and can enable the Findability, Accessibility, Interoperability, and Reusability between heterogeneous datasets from various domains without worrying about data source and structure.

Paper Submission Site

Deadlines and due dates are available on the Important Dates page.

Paper templates and additional information for authors is available on the Information for Authors page.

The BDGMM Workshop is also sponsoring the special session, Hackathon: 24 hours on Data Mashup (Varieties Problem) Big Data Analytics.

BDGMM Workshop Organizers

General Co-Chairs

Wo Chang, National Institute of Standards and Technology, USA
Convenor, ISO/IEC JTC 1/WG 9 Working Group on Big Data
Chair, IEEE Big Data Governance and Metadata Management

Mahmoud Daneshmand, Stevens Institute of Technology, USA
Co-Chair, IEEE Big Data Governance and Metadata Management
Co-founder, IEEE BDIs

Program Co-Chairs

Kathy Grise, Senior Program Director, Future Directions, IEEE Technical Activities, USA

Claire Austin, Big Data Research Scientist, S&T Strategies,Environment & Climate Change Canada, Canada

Publicity Chairs

Cherry Tom, Emerging Technologies Intelligence Manager, IEEE Standards Association

BDGMM Program Committee

Frederic Andres, National Institute of Informatics, Japan
Paventhan Arumugam, ERNET, India
Claire Austin, S&T Strategies, Environment & Climate Change Canada, Canada
Ismael Caballero, UCLM, Spain
Yue-Shan Chang, National Central University, Taiwan
Periklis Chatzimisios, Alexander TEI of Thessaloniki, Greece
Hung-Ming Chen, National Taichung University of Science & Technology, Taiwan
Miyuru Dayarathna, WSO2 Inc., Sri Lanka
Jacob Dilles, Acuant Corp., USA
Robert Hsu, Chung Hua University, Taiwan
Wei Hu, Nanjing University, China
Carson Leung, University of Manitoba, Canada
Sian Lun Lau, Sunway University, Malaysia
Christian Camilo Urcuqui Lóepz, Icesi University, Columbia
Neil Miller, The Bioinformatics for Children’s Mercy Hospital, USA
Jinghua Min, China Electronic Cyberspace Great Wall Co., Ltd., China
Carlos Monroy, Rice University, USA
Huangsheng Ning, USTB, China
Arindam Pal, TCS Research, India
Lijun Qian, Prairie View A&M University, USA
Weining Qianx, East China Normal University, China
Yufei Ren, IBM, USA
Robby Robson, Eduworks Corporation, USA
Angelo Simone Scotto, European Food Safety Authority, Italy
Priyaa Thavasimani, Newcastle University, UK
Alex Thomo, University of Victoria, Canada
Chongang Want, InterDigital Communications, USA
Jianwu Wang, University of Maryland, Baltimore County, USA
Shu-Lin Wang, National Taichung University of Science & Technology, Taiwan
Jens Weber, University of Victoria, Canada
Lingfei Wu, IBM Research, USA
Hao Xu, University of North Carolina at Chapel Hill, USA
Godwin Yeboah, University of Warwick, UK
Tim Zimmerlin, Automation Technologies, USA