Integrating Data

Integrating Data, by Bill Inmon, Patty Haines, and David Rapien

Overcome the challenges, appreciate the varieties, and apply the process of data integration.

Bill Inmon, the “father of the data warehouse,” has written 60 books published in nine languages. ComputerWorld named Bill one of the ten most influential people in the history of the computer profession.

Topics

Chapter 1: Integration

Inaccuracy of data
Lack of integration
Spider web systems
Reasons for complexity
Transformation of data
Summary

Chapter 2: Integrating Structured Data

Silos of data
Types of integration
Transforming data
Summary

Chapter 3: Integrating Textual Data

Components of textual integration
Textual data architecture
Preparing textual data for analytics
Performing analytics on textual data
Summary

Chapter 4: Mechanics of Integration

Summary

Chapter 5: Combining Structured and Textual Data

An intersection of data
Universal common connectors
Summary

Chapter 6: A Project Plan for Integrating Structured Data

Step 1: Scope
Step 2: Model
Step 3: Map
Step 4: Create a central pool of shared data
Advantages of the plan
Summary

Chapter 7: A Project Plan for Integrating Textual Data

Step 1: Select the scope
Step 2: Find ontologies/taxonomies
Step 3: Load the taxonomies
Step 4: Ingest raw text
Step 5: Determining analytical processes
An iterative process
Summary

Chapter 8: Integration Best Practices

Aim for true data integration
Identify the fans of data integration
Determine the data integration roles
Stress the benefits of data integration
Deploy a reusable process for new sources
Update data often
Define milestones
Summary

Chapter 9: Taxonomies and the Data Model

Taxonomies and ontologies
The purpose of data models and taxonomies
Data model and taxonomy differences
Summary

Chapter 10: Data Science and Integration

Levels of commonality
Analog/IoT data
Summary

Chapter 11: Documentation and Integration

Documentation components
Summary

Chapter 12: An Example of Integration

A merger
Challenges
Structured data
Textual data
Summary

Chapter 13: Integration Considerations

Plan
Educate
Management support

Learn all about data integration and become a data integration hero instead of following the masses and running in the opposite direction at the mere mention of the word “integration”. Understand why organizations avoid data integration and often wind up with spider web environments containing siloed applications instead of an enterprise database which excites analysts and data scientists. Distinguish the different types of integration: database, attribute, key, index, encoding, measurement, format, definition, KPI, calculations, summarization, selection criteria, data exclusion, lineage, and timing. Apply identification, equivocation, and physical conversion levels of integration for both structured and textual data. Leverage deidentification, proximity analysis, alternate spelling, stop word resolution, homographic resolution, stemming, taxonomical resolution, inline contextualization, classification, and acronym resolution. Learn how to combine structured and textual data in the context of three levels of interaction. Follow the steps of scope, model, and map in integrating structured data. Follow the steps of scope, connect taxonomies, ingest raw text, and determine analytical processes in integrating textual data. Apply integration best practices, including identifying integration roles, developing a reusable data integration process, and documenting the integration benefits. Compare taxonomies with data models. Know how data integration helps data science.

To reinforce all of the concepts within the book, we include a detailed case study on data integration.

About Patty and Dave

Patty Haines is a senior advisor and data practitioner, who provides expertise in managing and integrating data to ensure data is transformed and available to provide value to the business community.

David Rapien is an Associate Professor – Educator of Information Systems and Business Analytics at the University of Cincinnati’s Lindner College of Business.

Bestsellers

Faculty may request complimentary digital desk copies

Please complete all fields.

Integrating Data PDF Instant Download quantity	Integrating Data PDF Instant Download	Original price was: $39.95.Current price is: $19.95.
Integrating Data Print Version quantity	Integrating Data Print Version	$39.95