Provenance in Data Science

Provenance in Data Science
Author: Leslie F. Sikos,Oshani W. Seneviratne,Deborah L. McGuinness
Publsiher: Springer Nature
Total Pages: 110
Release: 2021-04-26
Genre: Computers
ISBN: 9783030676810

Download Provenance in Data Science Book in PDF, Epub and Kindle

RDF-based knowledge graphs require additional formalisms to be fully context-aware, which is presented in this book. This book also provides a collection of provenance techniques and state-of-the-art metadata-enhanced, provenance-aware, knowledge graph-based representations across multiple application domains, in order to demonstrate how to combine graph-based data models and provenance representations. This is important to make statements authoritative, verifiable, and reproducible, such as in biomedical, pharmaceutical, and cybersecurity applications, where the data source and generator can be just as important as the data itself. Capturing provenance is critical to ensure sound experimental results and rigorously designed research studies for patient and drug safety, pathology reports, and medical evidence generation. Similarly, provenance is needed for cyberthreat intelligence dashboards and attack maps that aggregate and/or fuse heterogeneous data from disparate data sources to differentiate between unimportant online events and dangerous cyberattacks, which is demonstrated in this book. Without provenance, data reliability and trustworthiness might be limited, causing data reuse, trust, reproducibility and accountability issues. This book primarily targets researchers who utilize knowledge graphs in their methods and approaches (this includes researchers from a variety of domains, such as cybersecurity, eHealth, data science, Semantic Web, etc.). This book collects core facts for the state of the art in provenance approaches and techniques, complemented by a critical review of existing approaches. New research directions are also provided that combine data science and knowledge graphs, for an increasingly important research topic.

Provenance and Annotation of Data and Processes

Provenance and Annotation of Data and Processes
Author: Khalid Belhajjame,Ashish Gehani,Pinar Alper
Publsiher: Springer
Total Pages: 272
Release: 2018-09-05
Genre: Computers
ISBN: 9783319983790

Download Provenance and Annotation of Data and Processes Book in PDF, Epub and Kindle

This book constitutes the refereed proceedings of the 7th International Provenance and Annotation Workshop, IPAW 2018, held in London, UK, in July 2018. The 12 revised full papers, 19 poster papers, and 2 demonstration papers presented were carefully reviewed and selected from 50 submissions. The papers feature a variety of provenance-related topics ranging from the capture and inference of provenance to its use and application.They are organized in topical sections on reproducibility; modeling, simulating and capturing provenance; PROV extensions; scientific workflows; applications; and system demonstrations.

Active Conceptual Modeling of Learning

Active Conceptual Modeling of Learning
Author: Peter P. Chen,Leah Y. Wong
Publsiher: Springer
Total Pages: 227
Release: 2008-01-04
Genre: Computers
ISBN: 9783540775034

Download Active Conceptual Modeling of Learning Book in PDF, Epub and Kindle

This volume is a collection of papers presented during the first International ACM-L Workshop, which was held in Tucson, Arizona, during the 25th International Conference on Conceptual Modeling, ER 2006. Included in this state-of-the-art survey are 11 revised full papers, carefully reviewed and selected from the workshop presentations. These are rounded off with four invited lectures and an introductory overview, and represent the current thinking in conceptual modeling research.

Encyclopedia of Database Systems

Encyclopedia of Database Systems
Author: Ling Liu,M. Tamer Özsu
Publsiher: Unknown
Total Pages: 135
Release: 2024
Genre: Database management
ISBN: 148997993X

Download Encyclopedia of Database Systems Book in PDF, Epub and Kindle

Principles of Data Integration

Principles of Data Integration
Author: AnHai Doan,Alon Halevy,Zachary Ives
Publsiher: Elsevier
Total Pages: 522
Release: 2012-06-25
Genre: Computers
ISBN: 9780123914798

Download Principles of Data Integration Book in PDF, Epub and Kindle

Principles of Data Integration is the first comprehensive textbook of data integration, covering theoretical principles and implementation issues as well as current challenges raised by the semantic web and cloud computing. The book offers a range of data integration solutions enabling you to focus on what is most relevant to the problem at hand. Readers will also learn how to build their own algorithms and implement their own data integration application. Written by three of the most respected experts in the field, this book provides an extensive introduction to the theory and concepts underlying today's data integration techniques, with detailed, instruction for their application using concrete examples throughout to explain the concepts. This text is an ideal resource for database practitioners in industry, including data warehouse engineers, database system designers, data architects/enterprise architects, database researchers, statisticians, and data analysts; students in data analytics and knowledge discovery; and other data professionals working at the R&D and implementation levels. Offers a range of data integration solutions enabling you to focus on what is most relevant to the problem at hand Enables you to build your own algorithms and implement your own data integration applications

Provenance in Databases

Provenance in Databases
Author: James Cheney,Laura Chiticariu,Wang-Chiew Tan
Publsiher: Now Publishers Inc
Total Pages: 111
Release: 2009-06-02
Genre: Computers
ISBN: 9781601982322

Download Provenance in Databases Book in PDF, Epub and Kindle

Reviews research over the past ten years on why, how, and where provenance, clarifies the relationships among these notions of provenance, and describes some of their applications in confidence computation, view maintenance and update, debugging, and annotation propagation

Encyclopedia of Big Data

Encyclopedia of Big Data
Author: Laurie A. Schintler,Connie L. McNeely
Publsiher: Springer
Total Pages: 0
Release: 2022-02-23
Genre: Business & Economics
ISBN: 3319320092

Download Encyclopedia of Big Data Book in PDF, Epub and Kindle

This encyclopedia will be an essential resource for our times, reflecting the fact that we currently are living in an expanding data-driven world. Technological advancements and other related trends are contributing to the production of an astoundingly large and exponentially increasing collection of data and information, referred to in popular vernacular as “Big Data.” Social media and crowdsourcing platforms and various applications ― “apps” ― are producing reams of information from the instantaneous transactions and input of millions and millions of people around the globe. The Internet-of-Things (IoT), which is expected to comprise tens of billions of objects by the end of this decade, is actively sensing real-time intelligence on nearly every aspect of our lives and environment. The Global Positioning System (GPS) and other location-aware technologies are producing data that is specific down to particular latitude and longitude coordinates and seconds of the day. Large-scale instruments, such as the Large Hadron Collider (LHC), are collecting massive amounts of data on our planet and even distant corners of the visible universe. Digitization is being used to convert large collections of documents from print to digital format, giving rise to large archives of unstructured data. Innovations in technology, in the areas of Cloud and molecular computing, Artificial Intelligence/Machine Learning, and Natural Language Processing (NLP), to name only a few, also are greatly expanding our capacity to store, manage, and process Big Data. In this context, the Encyclopedia of Big Data is being offered in recognition of a world that is rapidly moving from gigabytes to terabytes to petabytes and beyond. While indeed large data sets have long been around and in use in a variety of fields, the era of Big Data in which we now live departs from the past in a number of key respects and with this departure comes a fresh set of challenges and opportunities that cut across and affect multiple sectors and disciplines, and the public at large. With expanded analytical capacities at hand, Big Data is now being used for scientific inquiry and experimentation in nearly every (if not all) disciplines, from the social sciences to the humanities to the natural sciences, and more. Moreover, the use of Big Data has been well established beyond the Ivory Tower. In today’s economy, businesses simply cannot be competitive without engaging Big Data in one way or another in support of operations, management, planning, or simply basic hiring decisions. In all levels of government, Big Data is being used to engage citizens and to guide policy making in pursuit of the interests of the public and society in general. Moreover, the changing nature of Big Data also raises new issues and concerns related to, for example, privacy, liability, security, access, and even the veracity of the data itself. Given the complex issues attending Big Data, there is a real need for a reference book that covers the subject from a multi-disciplinary, cross-sectoral, comprehensive, and international perspective. The Encyclopedia of Big Data will address this need and will be the first of such reference books to do so. Featuring some 500 entries, from "Access" to "Zillow," the Encyclopedia will serve as a fundamental resource for researchers and students, for decision makers and leaders, and for business analysts and purveyors. Developed for those in academia, industry, and government, and others with a general interest in Big Data, the encyclopedia will be aimed especially at those involved in its collection, analysis, and use. Ultimately, the Encyclopedia of Big Data will provide a common platform and language covering the breadth and depth of the topic for different segments, sectors, and disciplines.

Provenance and Annotation of Data and Process

Provenance and Annotation of Data and Process
Author: Deborah L. McGuinness,James R. Michaelis,Luc Moreau
Publsiher: Springer Science & Business Media
Total Pages: 318
Release: 2011-01-04
Genre: Business & Economics
ISBN: 9783642178184

Download Provenance and Annotation of Data and Process Book in PDF, Epub and Kindle

The 7 revised full papers, 11 revised medium-length papers, 6 revised short, and 7 demo papers presented together with 10 poster/abstract papers describing late-breaking work were carefully reviewed and selected from numerous submissions. Provenance has been recognized to be important in a wide range of areas including databases, workflows, knowledge representation and reasoning, and digital libraries. Thus, many disciplines have proposed a wide range of provenance models, techniques, and infrastructure for encoding and using provenance. The papers investigate many facets of data provenance, process documentation, data derivation, and data annotation.