I was recently reminded that even the best technology is not always enough to implement a successful solution. Purview may be a great data governance tool, but the key to success is to also implement a well-planned data governance program around it.
Within the last year, I had the unique opportunity to experience two Purview implementation projects with very different approaches to data governance. One organization started from the perspective of implementing a data governance program and finding the best tool to support that program. The other started from the perspective of implementing the technology and then trying to figure out how to use it to provide data governance.
Organization One |
Organization Two |
Established a formal Data Governance Office and dedicated staff. |
Assigned responsibility to an existing data analytics group. |
Business stakeholder involvement from the start of planning, through tool assessment, pilot and into MVP launch. |
No business stakeholder input. |
User focused. |
Technology/administration focused. |
As you may have guessed, organization two had many challenges. Without a clear understanding of the value to be provided to users, it was difficult to obtain support from the business and executive management and get dedicated resources assigned to the project. The result was that the initiative was always a low priority and progress was slow.
Below are the keys to successfully implementing data governance with Purview from my experience. You’ll notice that the technical platform and tools are just one out of seven key components for success.
In the following approach, I will incorporate these keys to success with the lessons learned from the two projects I mentioned earlier, as well as others, to guide you toward a successful data governance and Purview implementation and help you avoid the common pitfalls.
The main goal of this phase is to ensure core data governance objectives can be met and the right stakeholders become engaged with the project. In order to do this, you first need to know what your objectives are.
Start with Key #1 and define your enterprise-wide data governance vision, goals, and objectives. There are many reasons you may be starting this program, from wanting to get to the next level of a data-driven organization to recovering from a data breach, and your goals and objectives should align with these reasons. You will want to take the time to gather requirements for the program. This is also a good opportunity to start working on Keys 2 & 4, stakeholder buy-in and domain specific use cases.
Microsoft provides some example objectives to get you started with thinking about what you want to achieve. The following is my favorite vision statement and example objectives:
“Our vision for the Data Governance Program is to improve the use and usefulness of data across the company to promote the development of actionable insights. We will drive the evolution of data asset management and empower business units to derive value from their data through policies, standards, and collaboration.”
Goals & Objectives:
As you can see, you do not need to have all the details worked out at this phase, just an understanding of the direction you need to go.
While you are working on your data governance goals and objectives, you can start on Key #5 by standing up Purview and getting some technical resources trained on how to use it. They should learn how to:
The goal for this phase is to configure Purview to demonstrate some critical end-to-end scenarios for your data governance objectives.
By the end of this phase, you will need to have Key #2 started. Your Data Governance Program might be an IT program, but it isn't just about the technology. It's about fostering an enterprise-wide "data as an asset" mindset and creating a foundation in which data assets can be effectively managed. For this, you want to have the support of a core group of executives and key business stakeholders who understand the value data governance will provide the organization and will champion the program and grow support.
In this phase, you will launch Purview with a small set of users, maybe a department or other business unit. You should start with Key #4 and develop in-depth requirements for each domain’s specific end-to-end scenarios, such as lineage from Azure Data Lake Storage to Azure Synapse DW to Power BI, as well as key scenarios that will provide value across your organization, such as glossary terms. Then continue with your Purview implementation.
The goal of this phase is to show how you will be able to successfully roll-out data governance across the organization. A successful pilot phase will solidify Keys #2 & 3 by strengthening executive and stakeholder support and convincing them of the need for dedicated data governance resources and assigned ownership of data governance tasks across the organization. By end of this phase, you should be able to answer these questions:
It’s time to plan your production milestones!
Based on what you learned from your Purview Pilot, create a Center of Excellence and onboard the team of data stewards/curators, technical owners, and business data owners to finish the work needed to make Purview useful across your whole organization. They will then work with your data governance organization to define the domains, what the implementation of data governance for each domain will accomplish, and jointly determine prioritization.
A business domain is a grouping of data with similar characteristics that is needed to satisfy business process requirements. Within the domains, there will be data products that harness the value of the business domain’s data. A data product is a curated and self-contained combination of data, metadata, semantics, and templates. This means it is data that can be easily used for analytics use cases and can be tailored to the business domain’s specific needs.
The Data Product Operating Model is a federated data governance operating model. This means the data in each domain is managed, maintained, and operated by a specific business domain team.
To successfully launch your data governance and Purview MVP solution, you should complete Keys 6 & 7. Data governance runbooks and supporting documentation should provide teams with the resources needed to implement data governance into their domains. Define the required processes for run-the-business policies and procedures and to manage issues. Issue Management is a crucial process for addressing, managing, and resolving data issues formally by defining ownership of data issues, tracking their resolution, and ensuring others are informed. Then develop your training and communications program for both pre- and post-launch.
By the time you reach this phase you should have effective data governance processes in place. In this phase you will start to integrate the data and business domains that were not included in the MVP.
The delivery of data governance and the Purview Data Catalog at scale provides a source of truth rather than just granting access to the multiple systems of record. Data products should be built and highly governed to provide purpose-built data sets for specific use cases. These products should be aligned to business scenarios rather than technical or administrative requirements.
Data governance is an ongoing process. Regular assessments and adjustments are key to maintaining effective governance. And remember, this is just the start for many things data and analytics; data governance will also help your organization prepare for growing trends such as AI.
If you want to discuss more about what you can do to get started on data governance with Purview and take advantage of new AI capabilities, please don’t hesitate to contact us.