Data Architecture: A Primer for the Data Scientist

Big Data, Data Warehouse and Data Vault
Author: W.H. Inmon,Dan Linstedt
Publisher: Morgan Kaufmann
ISBN: 0128020911
Category: Computers
Page: 378
View: 9520

Continue Reading →

Today, the world is trying to create and educate data scientists because of the phenomenon of Big Data. And everyone is looking deeply into this technology. But no one is looking at the larger architectural picture of how Big Data needs to fit within the existing systems (data warehousing systems). Taking a look at the larger picture into which Big Data fits gives the data scientist the necessary context for how pieces of the puzzle should fit together. Most references on Big Data look at only one tiny part of a much larger whole. Until data gathered can be put into an existing framework or architecture it can’t be used to its full potential. Data Architecture a Primer for the Data Scientist addresses the larger architectural picture of how Big Data fits with the existing information infrastructure, an essential topic for the data scientist. Drawing upon years of practical experience and using numerous examples and an easy to understand framework. W.H. Inmon, and Daniel Linstedt define the importance of data architecture and how it can be used effectively to harness big data within existing systems. You’ll be able to: Turn textual information into a form that can be analyzed by standard tools. Make the connection between analytics and Big Data Understand how Big Data fits within an existing systems environment Conduct analytics on repetitive and non-repetitive data Discusses the value in Big Data that is often overlooked, non-repetitive data, and why there is significant business value in using it Shows how to turn textual information into a form that can be analyzed by standard tools. Explains how Big Data fits within an existing systems environment Presents new opportunities that are afforded by the advent of Big Data Demystifies the murky waters of repetitive and non-repetitive data in Big Data

Building a Scalable Data Warehouse with Data Vault 2.0


Author: Dan Linstedt,Michael Olschimke
Publisher: Morgan Kaufmann
ISBN: 0128026480
Category: Computers
Page: 684
View: 1770

Continue Reading →

The Data Vault was invented by Dan Linstedt at the U.S. Department of Defense, and the standard has been successfully applied to data warehousing projects at organizations of different sizes, from small to large-size corporations. Due to its simplified design, which is adapted from nature, the Data Vault 2.0 standard helps prevent typical data warehousing failures. "Building a Scalable Data Warehouse" covers everything one needs to know to create a scalable data warehouse end to end, including a presentation of the Data Vault modeling technique, which provides the foundations to create a technical data warehouse layer. The book discusses how to build the data warehouse incrementally using the agile Data Vault 2.0 methodology. In addition, readers will learn how to create the input layer (the stage layer) and the presentation layer (data mart) of the Data Vault 2.0 architecture including implementation best practices. Drawing upon years of practical experience and using numerous examples and an easy to understand framework, Dan Linstedt and Michael Olschimke discuss: How to load each layer using SQL Server Integration Services (SSIS), including automation of the Data Vault loading processes. Important data warehouse technologies and practices. Data Quality Services (DQS) and Master Data Services (MDS) in the context of the Data Vault architecture. Provides a complete introduction to data warehousing, applications, and the business context so readers can get-up and running fast Explains theoretical concepts and provides hands-on instruction on how to build and implement a data warehouse Demystifies data vault modeling with beginning, intermediate, and advanced techniques Discusses the advantages of the data vault approach over other techniques, also including the latest updates to Data Vault 2.0 and multiple improvements to Data Vault 1.0

Super Charge Your Data Warehouse

Invaluable Data Modeling Rules to Implement Your Data Vault
Author: Dan Linstedt,Kent Graziano
Publisher: CreateSpace
ISBN: 9781463778682
Category: Computers
Page: 126
View: 3977

Continue Reading →

Do You Know If Your Data Warehouse Flexible, Scalable, Secure and Will It Stand The Test Of Time And Avoid Being Part Of The Dreaded "Life Cycle"? The Data Vault took the Data Warehouse world by storm when it was released in 2001. Some of the world's largest and most complex data warehouse situations understood the value it gave especially with the capabilities of unlimited scaling, flexibility and security. Here is what industry leaders say about the Data Vault "The Data Vault is the optimal choice for modeling the EDW in the DW 2.0 framework" - Bill Inmon, The Father of Data Warehousing "The Data Vault is foundationally strong and an exceptionally scalable architecture" - Stephen Brobst, CTO, Teradata "The Data Vault should be considered as a potential standard for RDBMS-based analytic data management by organizations looking to achieve a high degree of flexibility, performance and openness" - Doug Laney, Deloitte Analytics Institute "I applaud Dan's contribution to the body of Business Intelligence and Data Warehousing knowledge and recommend this book be read by both data professionals and end users" - Howard Dresner, From the Foreword - Speaker, Author, Leading Research Analyst and Advisor You have in your hands the work, experience and testing of 2 decades of building data warehouses. The Data Vault model and methodology has proven itself in hundreds (perhaps thousands) of solutions in Insurance, Crime-Fighting, Defense, Retail, Finance, Banking, Power, Energy, Education, High-Tech and many more. Learn the techniques and implement them and learn how to build your Data Warehouse faster than you have ever done before while designing it to grow and scale no matter what you throw at it. Ready to "Super Charge Your Data Warehouse"?

Scalable Big Data Architecture

A practitioners guide to choosing relevant Big Data architecture
Author: Bahaaldine Azarmi
Publisher: Apress
ISBN: 1484213262
Category: Computers
Page: 141
View: 5529

Continue Reading →

This book highlights the different types of data architecture and illustrates the many possibilities hidden behind the term "Big Data", from the usage of No-SQL databases to the deployment of stream analytics architecture, machine learning, and governance. Scalable Big Data Architecture covers real-world, concrete industry use cases that leverage complex distributed applications , which involve web applications, RESTful API, and high throughput of large amount of data stored in highly scalable No-SQL data stores such as Couchbase and Elasticsearch. This book demonstrates how data processing can be done at scale from the usage of NoSQL datastores to the combination of Big Data distribution. When the data processing is too complex and involves different processing topology like long running jobs, stream processing, multiple data sources correlation, and machine learning, it’s often necessary to delegate the load to Hadoop or Spark and use the No-SQL to serve processed data in real time. This book shows you how to choose a relevant combination of big data technologies available within the Hadoop ecosystem. It focuses on processing long jobs, architecture, stream data patterns, log analysis, and real time analytics. Every pattern is illustrated with practical examples, which use the different open sourceprojects such as Logstash, Spark, Kafka, and so on. Traditional data infrastructures are built for digesting and rendering data synthesis and analytics from large amount of data. This book helps you to understand why you should consider using machine learning algorithms early on in the project, before being overwhelmed by constraints imposed by dealing with the high throughput of Big data. Scalable Big Data Architecture is for developers, data architects, and data scientists looking for a better understanding of how to choose the most relevant pattern for a Big Data project and which tools to integrate into that pattern.

Data Lake for Enterprises


Author: Tomcy John,Pankaj Misra
Publisher: Packt Publishing Ltd
ISBN: 1787282651
Category: Computers
Page: 596
View: 7248

Continue Reading →

A practical guide to implementing your enterprise data lake using Lambda Architecture as the base About This Book Build a full-fledged data lake for your organization with popular big data technologies using the Lambda architecture as the base Delve into the big data technologies required to meet modern day business strategies A highly practical guide to implementing enterprise data lakes with lots of examples and real-world use-cases Who This Book Is For Java developers and architects who would like to implement a data lake for their enterprise will find this book useful. If you want to get hands-on experience with the Lambda Architecture and big data technologies by implementing a practical solution using these technologies, this book will also help you. What You Will Learn Build an enterprise-level data lake using the relevant big data technologies Understand the core of the Lambda architecture and how to apply it in an enterprise Learn the technical details around Sqoop and its functionalities Integrate Kafka with Hadoop components to acquire enterprise data Use flume with streaming technologies for stream-based processing Understand stream- based processing with reference to Apache Spark Streaming Incorporate Hadoop components and know the advantages they provide for enterprise data lakes Build fast, streaming, and high-performance applications using ElasticSearch Make your data ingestion process consistent across various data formats with configurability Process your data to derive intelligence using machine learning algorithms In Detail The term "Data Lake" has recently emerged as a prominent term in the big data industry. Data scientists can make use of it in deriving meaningful insights that can be used by businesses to redefine or transform the way they operate. Lambda architecture is also emerging as one of the very eminent patterns in the big data landscape, as it not only helps to derive useful information from historical data but also correlates real-time data to enable business to take critical decisions. This book tries to bring these two important aspects — data lake and lambda architecture—together. This book is divided into three main sections. The first introduces you to the concept of data lakes, the importance of data lakes in enterprises, and getting you up-to-speed with the Lambda architecture. The second section delves into the principal components of building a data lake using the Lambda architecture. It introduces you to popular big data technologies such as Apache Hadoop, Spark, Sqoop, Flume, and ElasticSearch. The third section is a highly practical demonstration of putting it all together, and shows you how an enterprise data lake can be implemented, along with several real-world use-cases. It also shows you how other peripheral components can be added to the lake to make it more efficient. By the end of this book, you will be able to choose the right big data technologies using the lambda architectural patterns to build your enterprise data lake. Style and approach The book takes a pragmatic approach, showing ways to leverage big data technologies and lambda architecture to build an enterprise-level data lake.

Data Lake Architecture

Designing the Data Lake and Avoiding the Garbage Dump
Author: Bill Inmon
Publisher: Technics Publications
ISBN: 1634621190
Category: Computers
Page: 166
View: 2426

Continue Reading →

Organizations invest incredible amounts of time and money obtaining and then storing big data in data stores called data lakes. But how many of these organizations can actually get the data back out in a useable form? Very few can turn the data lake into an information gold mine. Most wind up with garbage dumps. Data Lake Architecture will explain how to build a useful data lake, where data scientists and data analysts can solve business challenges and identify new business opportunities. Learn how to structure data lakes as well as analog, application, and text-based data ponds to provide maximum business value. Understand the role of the raw data pond and when to use an archival data pond. Leverage the four key ingredients for data lake success: metadata, integration mapping, context, and metaprocess. Bill Inmon opened our eyes to the architecture and benefits of a data warehouse, and now he takes us to the next level of data lake architecture.

Architecture of a Database System


Author: Joseph M. Hellerstein,Michael Stonebraker,James Hamilton
Publisher: Now Publishers Inc
ISBN: 1601980787
Category: Computers
Page: 123
View: 1029

Continue Reading →

Architecture of a Database System presents an architectural discussion of DBMS design principles, including process models, parallel architecture, storage system design, transaction system implementation, query processor and optimizer architectures, and typical shared components and utilities.

Agile Data Warehouse Design

Collaborative Dimensional Modeling, from Whiteboard to Star Schema
Author: Lawrence Corr,Jim Stagnitto
Publisher: DecisionOne Consulting
ISBN: 0956817203
Category: Business & Economics
Page: 304
View: 1797

Continue Reading →

Agile Data Warehouse Design is a step-by-step guide for capturing data warehousing/business intelligence (DW/BI) requirements and turning them into high performance dimensional models in the most direct way: by modelstorming (data modeling ] brainstorming) with BI stakeholders. This book describes BEAM, an agile approach to dimensional modeling, for improving communication between data warehouse designers, BI stakeholders and the whole DW/BI development team. BEAM provides tools and techniques that will encourage DW/BI designers and developers to move away from their keyboards and entity relationship based tools and model interactively with their colleagues. The result is everyone thinks dimensionally from the outset! Developers understand how to efficiently implement dimensional modeling solutions. Business stakeholders feel ownership of the data warehouse they have created, and can already imagine how they will use it to answer their business questions. Within this book, you will learn: Agile dimensional modeling using Business Event Analysis & Modeling (BEAM ) Modelstorming: data modeling that is quicker, more inclusive, more productive, and frankly more fun! Telling dimensional data stories using the 7Ws (who, what, when, where, how many, why and how) Modeling by example not abstraction; using data story themes, not crow's feet, to describe detail Storyboarding the data warehouse to discover conformed dimensions and plan iterative development Visual modeling: sketching timelines, charts and grids to model complex process measurement - simply Agile design documentation: enhancing star schemas with BEAM dimensional shorthand notation Solving difficult DW/BI performance and usability problems with proven dimensional design patterns LawrenceCorr is a data warehouse designer and educator. As Principal of DecisionOne Consulting, he helps clients to review and simplify their data warehouse designs, and advises vendors on visual data modeling techniques. He regularly teaches agile dimensional modeling courses worldwide and has taught dimensional DW/BI skills to thousands of students. Jim Stagnitto is a data warehouse and master data management architect specializing in the healthcare, financial services, and information service industries. He is the founder of the data warehousing and data mining consulting firm Llumino.

Practical Data Science

A Guide to Building the Technology Stack for Turning Data Lakes into Business Assets
Author: Andreas François Vermeulen
Publisher: Apress
ISBN: 148423054X
Category: Computers
Page: 805
View: 3311

Continue Reading →

Learn how to build a data science technology stack and perform good data science with repeatable methods. You will learn how to turn data lakes into business assets. The data science technology stack demonstrated in Practical Data Science is built from components in general use in the industry. Data scientist Andreas Vermeulen demonstrates in detail how to build and provision a technology stack to yield repeatable results. He shows you how to apply practical methods to extract actionable business knowledge from data lakes consisting of data from a polyglot of data types and dimensions. What You'll Learn Become fluent in the essential concepts and terminology of data science and data engineering Build and use a technology stack that meets industry criteria Master the methods for retrieving actionable business knowledge Coordinate the handling of polyglot data types in a data lake for repeatable results Who This Book Is For Data scientists and data engineers who are required to convert data from a data lake into actionable knowledge for their business, and students who aspire to be data scientists and data engineers

DW 2.0: The Architecture for the Next Generation of Data Warehousing


Author: W.H. Inmon,Derek Strauss,Genia Neushloss
Publisher: Elsevier
ISBN: 9780080558332
Category: Computers
Page: 400
View: 2442

Continue Reading →

DW 2.0: The Architecture for the Next Generation of Data Warehousing is the first book on the new generation of data warehouse architecture, DW 2.0, by the father of the data warehouse. The book describes the future of data warehousing that is technologically possible today, at both an architectural level and technology level. The perspective of the book is from the top down: looking at the overall architecture and then delving into the issues underlying the components. This allows people who are building or using a data warehouse to see what lies ahead and determine what new technology to buy, how to plan extensions to the data warehouse, what can be salvaged from the current system, and how to justify the expense at the most practical level. This book gives experienced data warehouse professionals everything they need in order to implement the new generation DW 2.0. It is designed for professionals in the IT organization, including data architects, DBAs, systems design and development professionals, as well as data warehouse and knowledge management professionals. * First book on the new generation of data warehouse architecture, DW 2.0. * Written by the "father of the data warehouse", Bill Inmon, a columnist and newsletter editor of The Bill Inmon Channel on the Business Intelligence Network. * Long overdue comprehensive coverage of the implementation of technology and tools that enable the new generation of the DW: metadata, temporal data, ETL, unstructured data, and data quality control.

Client-Side Data Storage

Keeping It Local
Author: Raymond Camden
Publisher: "O'Reilly Media, Inc."
ISBN: 1491935081
Category: Computers
Page: 118
View: 6675

Continue Reading →

One of the most useful features of today’s modern browsers is the ability to store data right on the user’s computer or mobile device. Even as more people move toward the cloud, client-side storage can still save web developers a lot of time and money, if you do it right. This hands-on guide demonstrates several storage APIs in action. You’ll learn how and when to use them, their plusses and minuses, and steps for implementing one or more of them in your application. Ideal for experienced web developers familiar with JavaScript, this book also introduces several open source libraries that make storage APIs easier to work with. Learn how different browsers support each client-side storage API Work with web (aka local) storage for simple things like lists or preferences Use IndexedDB to store nearly anything you want on the user’s browser Learn how support web apps that still use the discontinued Web SQL Database API Explore Lockr, Dexie, and localForage, three libraries that simplify the use of storage APIs Build a simple working application that makes use of several storage techniques

Modeling the Agile Data Warehouse with Data Vault


Author: Hans Hultgren
Publisher: N.A
ISBN: 9780615723082
Category: Data warehousing
Page: 434
View: 1560

Continue Reading →

Data Modeling for Agile Data Warehouse using Data Vault Modeling Approach. Includes Enterprise Data Warehouse Architecture. This is a complete guide to the data vault data modeling approach. The book also includes business and program considerations for the agile data warehousing and business intelligence program. There are over 200 diagrams and figures concerning modeling, core business concepts, architecture, business alignment, semantics, and modeling comparisons with 3NF and Dimensional modeling.

The Data Model Toolkit

Simple Skills To Model The Real World
Author: Dave Knifton
Publisher: Paragon Publishing
ISBN: 1782224734
Category: Computers
Page: 348
View: 1427

Continue Reading →

Adopting the latest technological and data related innovations has caused many organisations to realise they don’t have a firm grasp on their basic operational data. This is a problem that Logical Data Models are uniquely qualified to help them solve. The realisation of the need to define a Logical Data Model may be driven by any number of reasons including; trying to link Big Data Analytics to operational data, plunging into Digital Marketing, choosing the best SaaS solution, carrying out a core Data Migration, developing a Data Warehouse, enhancing Data Governance processes, or even just trying to get everyone to agree on their Product specifications! This book will provide you with the skills required to start to answer these and many similar types of questions. It is not written with a focus on IT development, so you don’t need a technical background to get the most from it. But for any professional working in an organisation’s data landscape, this book will provide the skills they need to define high quality and beneficial data models quickly and easily. It does this using a wealth of practical examples, tips and techniques, as well as providing checklists and templates. It is structured into three parts: The Foundations: What are the solid foundations necessary for building effective data models? The Tools: What Tools are required to enable you to specify clear, precise and accurate data model definitions? The Deliverables: What processes will you need to successfully define the models, what will they deliver, and how can we make them beneficial to the organisation? “In this data-rich era, it is even more critical for organisations to answer the question of what their data means and the value it can bring. Those who can, will gain a competitive advantage through their use of data to streamline their operations and energise their strategies. Core to revealing this meaning, is the data model that is now, more than ever, the lynchpin of success. The Data Model Toolkit provides the essential knowledge and skills that will ensure this success.” – Reem Zahran, Global IT Platform Director, TNS “We work with many enterprise customers to help them transform their technology and it always starts with data. The key is a clear definition of their data quality, completeness and governance. This book shows you step by step how to define and use Data Models as powerful tools to define an organisation’s data and maximise its business benefit.” – John Casserly, CEO, Xceed Group

Privacy in the Age of Big Data

Recognizing Threats, Defending Your Rights, and Protecting Your Family
Author: Theresa Payton,Ted Claypoole
Publisher: Rowman & Littlefield
ISBN: 1442225467
Category: Computers
Page: 276
View: 7204

Continue Reading →

Digital data collection and surveillance is pervasive and no one can protect your privacy without your help. Before you can help yourself, you need to understand the new technologies, what benefits they provide, and what trade-offs they require. Some of those trade-offs – privacy for convenience – could be softened by our own behavior or be reduced by legislation if we fight for it. This book analyzes why privacy is important to all of us, and it describes the technologies that place your privacy most at risk, starting with modern computing and the Internet.

Microsoft Azure

Planning, Deploying, and Managing Your Data Center in the Cloud
Author: Marshall Copeland,Julian Soh,Anthony Puca,Mike Manning,David Gollob
Publisher: Apress
ISBN: 1484210433
Category: Computers
Page: 426
View: 9800

Continue Reading →

Written for IT and business professionals, this book provides the technical and business insight needed to plan, deploy and manage the services provided by the Microsoft Azure cloud. Find out how to integrate the infrastructure-as-a-service (IaaS) and platform-as-a-service (PaaS) models with your existing business infrastructure while maximizing availability, ensuring continuity and safety of your data, and keeping costs to a minimum. The book starts with an introduction to Microsoft Azure and how it differs from Office 365—Microsoft’s ‘other’ cloud. You'll also get a useful overview of the services available. Part II then takes you through setting up your Azure account, and gets you up-and-running on some of the core Azure services, including creating web sites and virtual machines, and choosing between fully cloud-based and hybrid storage solutions, depending on your needs. Part III now takes an in-depth look at how to integrate Azure with your existing infrastructure. The authors, Anthony Puca, Mike Manning, Brent Rush, Marshall Copeland and Julian Soh, bring their depth of experience in cloud technology and customer support to guide you through the whole process, through each layer of your infrastructure from networking to operations. High availability and disaster recovery are the topics on everyone’s minds when considering a move to the cloud, and this book provides key insights and step-by-step guidance to help you set up and manage your resources correctly to optimize for these scenarios. You’ll also get expert advice on migrating your existing VMs to Azure using InMage, mail-in and the best 3rd party tools available, helping you ensure continuity of service with minimum disruption to the business. In the book’s final chapters, you’ll find cutting edge examples of cloud technology in action, from machine learning to business intelligence, for a taste of some exciting ways your business could benefit from your new Microsoft Azure deployment.

The Nimble Elephant

Agile Delivery of Data Models using a Pattern-based Approach
Author: John Giles
Publisher: Technics Publications
ISBN: 1634620259
Category: Computers
Page: 254
View: 9277

Continue Reading →

“Get it done well and get it done fast” are twin, apparently opposing, demands. Data architects are increasingly expected to deliver quality data models in challenging timeframes, and agile developers are increasingly expected to ensure that their solutions can be easily integrated with the data assets of the overall organization. If you need to deliver quality solutions despite exacting schedules, “The Nimble Elephant” will help by describing proven techniques that leverage the libraries of published data model patterns to rapidly assemble extensible and robust designs. The three sections in the book provide guidelines for applying the lessons to your own situation, so that you can apply the techniques and patterns immediately to your current assignments. The first section, Foundations for Data Agility, addresses some perceived aspects of friction between “data” and “agile” practitioners. As a starting point for resolving the differences, pattern levels of granularity are classified, and their interdependencies exposed. A context of various types of models is established (e.g. conceptual / logical / physical, and industry / enterprise / project), and you will learn how to customize patterns within specific model types. The second section, Steps Towards Data Agility, shares guidelines on generalizing and specializing, with cautions on the dangers of going too far. Creativity in using patterns beyond their intended purpose is encouraged. The short-term “You Ain’t Gonna Need It” (YAGNI) philosophy of agile practitioners, and the longer-term strategic perspectives of architects, are compared and evaluated. Consideration is given to the potential of enterprise views contributing to project-specific models. Other topics include industry models, iterative modeling, creation of patterns when none exist, and patterns for rules-in-data. The section ends with a perspective on the modeler’s possible role in agile projects, followed by a case study. The final section, A Bridge to the Land of Object Orientation, provides a pathway for re-skilling traditional data modelers who want to expand their options by actively engaging with the ranks of object-oriented developers. I’m delighted to see that John has put his extensive experience and broad knowledge of data modeling into print! John’s ability to simplify the complex, and to share his knowledge and enthusiasm – and humor – with colleagues, comes through in this very useful and readable book. I recommend it to anyone working with data. — Monika Remenyi, Senior Data Architect, Telstra John Giles has written a compelling and engaging book about the importance of data modeling patterns in the world of agile computing. His book is clearly and simply written, and it is full of excellent examples drawn from his extensive experience as a practitioner. You will see the enthusiasm and passion that John clearly has for his work in data modeling. And you will see in his book that any interchange with John will always have its fair share of good humor and wisdom! — Professor Ron Weber, Dean, Faculty of IT, Monash University

Big Data Governance

Modern Data Management Principles for Hadoop, Nosql & Big Data Analytics
Author: Peter Ghavami, Ph.d.
Publisher: Createspace Independent Publishing Platform
ISBN: 9781519559722
Category:
Page: 202
View: 1897

Continue Reading →

Data is the new Gold and Analytics is the machinery to mine, mold and mint it. Data analytics has become core to business and decision making. The rapid increase in data volume, velocity and variety, known as big data, offers both opportunities and challenges. While open source solutions to store big data, like Hadoop and NoSQL offer platforms for exploring value and insight from big data, they were not originally developed with data security and governance in mind. Organizations that are launching big data initiatives face significant challenges for managing this data effectively. In this book, the author has collected best practices from the world's leading organizations who have successfully implemented big data platforms. He offers the latest techniques and methods for managing big data effectively. The book offers numerous policies, strategies and recipes for managing big data. It addresses many issues that are prevalent with data security, privacy, controls and life cycle management offering modern principles and open source architectures for successful governance of big data. Topics that cover the entire data management life cycle, data quality, data stewardship, regulatory considerations, data council, architectural and operational models are presented for successful management of big data. The book is a must-read for data scientists, data engineers and information technology leaders who are implementing big data platforms in their organizations.

Data Architecture

From Zen to Reality
Author: Charles Tupper
Publisher: Elsevier
ISBN: 9780123851277
Category: Computers
Page: 448
View: 7018

Continue Reading →

Data Architecture: From Zen to Reality explains the principles underlying data architecture, how data evolves with organizations, and the challenges organizations face in structuring and managing their data. Using a holistic approach to the field of data architecture, the book describes proven methods and technologies to solve the complex issues dealing with data. It covers the various applied areas of data, including data modelling and data model management, data quality, data governance, enterprise information management, database design, data warehousing, and warehouse design. This text is a core resource for anyone customizing or aligning data management systems, taking the Zen-like idea of data architecture to an attainable reality. The book presents fundamental concepts of enterprise architecture with definitions and real-world applications and scenarios. It teaches data managers and planners about the challenges of building a data architecture roadmap, structuring the right team, and building a long term set of solutions. It includes the detail needed to illustrate how the fundamental principles are used in current business practice. The book is divided into five sections, one of which addresses the software-application development process, defining tools, techniques, and methods that ensure repeatable results. Data Architecture is intended for people in business management involved with corporate data issues and information technology decisions, ranging from data architects to IT consultants, IT auditors, and data administrators. It is also an ideal reference tool for those in a higher-level education process involved in data or information technology management. Presents fundamental concepts of enterprise architecture with definitions and real-world applications and scenarios Teaches data managers and planners about the challenges of building a data architecture roadmap, structuring the right team, and building a long term set of solutions Includes the detail needed to illustrate how the fundamental principles are used in current business practice