Data Architecture: A Primer for the Data Scientist - 2nd Edition - ISBN: 9780128169162

Data Architecture: A Primer for the Data Scientist

2nd Edition

Big Data, Data Warehouse and Data Vault

Authors: W.H. Inmon Dan Linstedt Mary Levins
Paperback ISBN: 9780128169162
Imprint: Academic Press
Published Date: 1st June 2019
Page Count: 450
Sales tax will be calculated at check-out Price includes VAT/GST

Institutional Subscription

Secure Checkout

Personal information is secured with SSL technology.

Free Shipping

Free global shipping
No minimum order.


The first edition of Data Architecture was written more than five years ago and, since then the concept of Big Data has matured, data science has grown exponentially, and data architecture has become a standard part of organizational decision making. Throughout all this change, the basic principles which shape the architecture of data have remained the same. There remains a need for people to take a look at the “bigger picture” and to understand where their data fits into the grand scheme of things.

The Second Edition of Data Architecture: A Primer for the Data Scientist addresses the larger architectural picture of how Big Data fits within the existing information infrastructure or data warehousing systems. This is an essential topic not only for data scientists, analysts, and managers but also for researchers and engineers who increasingly need to deal with large and complex sets of data. Until data is gathered and can be placed into an existing framework or architecture, it cannot be used to its full potential. Drawing upon years of practical experience and using numerous examples and case studies from across various industries, the authors seek to explain this larger picture into which Big Data fits, giving data scientists the necessary context for how pieces of the puzzle should fit together

Key Features

  • Reviews the exponential growth of Big Data integration and applications across industries – from healthcare to finance
  • Places new emphasis on end state architecture as a lens for understanding the architecture of big data
  • Explains how Big Data fits within an existing systems environment, as well as the value of data transformation and redundancy
  • Includes new chapters on Data lakes, ponds, landing zones; IoT, Edge computing; as well as data modelling and taxonomies


Data analysts, data managers, researchers, and engineers who need to deal with large and complex sets of data; masters level students in data analytics programs

Table of Contents

  1. Introduction to architecture
    2. “Diagram of the world”, end state architecture
    3. Transformation and redundancy
    4. Big Data
    5. Siloed applications
    6. Data vault
    7. Data lake, ponds, landing zone
    8. IoT, Edge computing
    9. Operational environment
    10. The evolution of data architecture
    11. Repetitive data, the sandbox
    12. Non-repetitive data, contextualization
    13. Operational performance
    14. Integration of data
    15. Personal computing
    16. Managing text, taxonomies
    17. System of record
    18. The intellectual roadmap – data modelling, taxonomies, etc.
    19. Business value across the architecture
    20. Virtualization, streaming
    21. The end of evolution


No. of pages:
© Academic Press 2019
Academic Press
Paperback ISBN:

About the Author

W.H. Inmon

Best known as the “Father of Data Warehousing,” Bill Inmon has become the most prolific and well-known author worldwide in the big data analysis, data warehousing and business intelligence arena. In addition to authoring more than 50 books and 650 articles, Bill has been a monthly columnist with the Business Intelligence Network, EIM Institute and Data Management Review. In 2007, Bill was named by Computerworld as one of the “Ten IT People Who Mattered in the Last 40 Years” of the computer profession. Having 35 years of experience in database technology and data warehouse design, he is known globally for his seminars on developing data warehouses and information architectures. Bill has been a keynote speaker in demand for numerous computing associations, industry conferences and trade shows. Bill Inmon also has an extensive entrepreneurial background: He founded Pine Cone Systems, later named Ambeo in 1995, and founded, and took public, Prism Solutions in 1991. Bill consults with a large number of Fortune 1000 clients, and leading IT executives on Data Warehousing, Business Intelligence, and Database Management, offering data warehouse design and database management services, as well as producing methodologies and technologies that advance the enterprise architectures of large and small organizations world-wide. He has worked for American Management Systems and Coopers & Lybrand. Bill received his Bachelor of Science degree in Mathematics from Yale University, and his Master of Science degree in Computer Science from New Mexico State University.

Affiliations and Expertise

Inmon Data Systems, Castle Rock, CO, USA

Dan Linstedt

Dan has more than 25 years of experience in the Data Warehousing and Business Intelligence field and is internationally known for inventing the Data Vault 1.0 model and the Data Vault 2.0 System of Business Intelligence. He helps business and government organizations around the world to achieve BI excellence by applying his proven knowledge in Big Data, unstructured information management, agile methodologies and product development. He has held training classes and presented at TDWI, Teradata Partners, DAMA, Informatica, Oracle user groups and Data Modeling Zone conference. He has a background in SEI/CMMI Level 5, and has contributed architecture efforts to petabyte scale data warehouses and offers high quality on-line training and consulting services for Data Vault.

Affiliations and Expertise

Founder and Principal of Empowered Holdings, LLC, St. Albans, VT, USA

Mary Levins

Mary Levins is recognized as a leader in Data Governance with over 20 years of experience working with organizations to bring value through data strategies that drive business results. Mary has a BS and MS in Industrial Engineering, and her experience spans across many different industries including manufacturing, healthcare, energy/utilities, automotive, electronics, and financial (including Consumer Credit Bureaus and Credit Unions). Today, Mary is the founder of Sierra Creek Consulting, a specialized firm delivering Data Governance, Data Management, and Data Solutions to help companies bring value through data.

Affiliations and Expertise

Sierra Creek Consulting LLC, Dacula, GA, USA

Ratings and Reviews