The Curse of doing data right
Photo by Jude Beck on Unsplash
Part 1: What is the curse?
Very few people will disagree that having a solid foundation for data is a good idea. This includes many of those things that we hear all the time, such as good data quality, common standards, or a good catalogue where we can find all our data assets. This is the same for more technical foundational topics, such as metadata, master / reference data, data modelling, and so on. Once we, data professionals, can explain what these things mean, the general response tends to be positive: essentially, everybody agrees these are good things to have.
Getting to implement this foundation, however, is a different story. Because data is the backbone of most organisations these days, any serious attempt to review and modify how data is managed will affect users and could fundamentally change their ways of working. And it is then, when users realise that they are at the core of implementing changes across the organisation. It is their responsibility too, not just the data professional job and here are where the problems start. The same organisation that has probably invested in hiring data professionals to “fix” its data problems, will start making excuses for why many of the measures can’t be implemented. Users and management will argue that operations will be disrupted, deadlines will be missed or that it is not the right time. Some organisations will agree to proceed with the less disruptive proposals, but of course, those will also be the ones with less impact or will incur technical debt that will need to be fixed later. Chances are that months and years will pass and little will change; at best, there will be an improvement of awareness and some minor initiatives will be carried out. At worse, the organisation will dismiss advice or try to solve the problems by investing in technology products, which will never work because the foundations were never fixed.
Data in an interesting technical field. It is in so much demand, but so misunderstood. There are a wide variety of areas of expertise with radically different skill sets, and yet often the field gets oversimplified by putting all data professionals in the same bucket. For example, data scientists / analysts are many times thought of as holding the key to solve many data issues. But the issues normally stem from poor data quality or interoperability, which is not something that data scientist or analysist can (or should) address. Along the same lines, data problems are something to be resolved upstream i.e. once the data has been collected and stored. But, to solve quality issues one needs to start at source i.e. where data is collected, stored and shared. This misunderstanding is one of the main reasons why there is such a disparity between the willingness to invest in data professionals and the reluctance to carry out their advice.
Organisations are ill-prepared to undertake the scale of changes that are being advised. They put in place obstacles to attempt managing the disruption, and so the main reasons why these change projects take such a long time are not technical, but cultural. Work is carried out to justify investment, but there is unwillingness to fix the root cause of the problem. Data scientists / analysts are hired to produce results, which invariably means creating inefficient and inconsistent shortcuts to put the data in a form that can be analysed. In turn, the data professionals that provided the advice for change in the first case, see that advice dismissed or underappreciated, and fall victims of the contradiction in which the organisation finds itself. As those professional become frustrated, and as they continue pointing out how the organisation is doing things in the wrong way, they sometimes can become disliked and even ostracised, which increases their frustration as, in their view, they are just trying to achieve what they were hired to do.
The cycle continues, with investment in data infrastructure but without the willingness to implement the necessary changes. For those responsible for the change, sometimes this situation might feel like a curse, the curse of doing right for data. But it’s not all doom and gloom; there is a growing community of data professionals who are supporting each other with doing data right. Which is exactly what our next blog will talk about!
This is the first blog of a 3-part series looking at the challenges of data professionals within their organisations. Join us next month where Lisa Allen will talk about what data management changes are commonly required and why organisations find them challenging.
Tomas is currently the Chief Data Architect at the Office for National Statistics, where he is leading ONS’s data strategy as well as being responsible for a number of data products. Tomas also regularly gets involved in forums and initiatives to foster the use of good data management practices across government.
Data Quality – a Multidimensional Approach
In my blogs and articles over the lockdown period I’ve avoided talking about the impact of the Covid 19 pandemic and the heavy reliance on good quality data to support the models needed to combat and mitigate its effects. I have decided to break my silence in this blog as a major data story recently hit the headlines in my part of the world, Wales in the United Kingdom. This story was literally so close to home that I felt impelled to highlight and comment on it, and use it to stress why the need for good data quality is more important than ever. Click here to read the blog in full
Data Governance on a ‘shoestring’ budget - yes, it’s possible
Demand for data governance is increasing as the result of Coronavirus, and at a time where resources are scarce. There’s a huge focus on data because people want to be sure what they have is of good enough quality to make decisions about the future of their business and its survival.
However, at the same time demand’s going up, budgets are being cut. People are not spending money on data governance but, they want more of it! So, we've got this conundrum. If we're going to deliver data governance, it has to deliver some benefits. There's no point doing it for just the fun of it but how are you going to do that if you've got little or no budget? We need to deliver data governance of a shoestring budget.
So, the question I asked myself is what can we really do that's useful on that basis? Well, after 17 years in data governance I've learned that, practically, you can’t do data governance over everything and that it's also not useful to as all data is not of the same value to your organisation. We always need to consider carefully where we put our Data Governance focus on and now, more so than ever we need to be pragmatic - that’s where ‘minimal data governance’ comes in.
But what does this really mean in practice?
Well, it’s not the bare minimum to keep your regulator happy. It is not ‘just enough’ so you can say you are doing it. Minimal data governance has to deliver real value. If it doesn't, there's absolutely no point in doing it.
But just because it's minimal it doesn't mean it's going to take less time. Data Governance takes a long time and I'm afraid the bad news is that minimal data governance also takes a long time.
Apart from anything else, you won’t get any value from it by trying to do it quickly because you won't do it properly. And therefore, you won't get the value.
So, can take minimal data governance be effective?
Yes, I think it can, because I think it's probably the way I've been increasingly approaching Data Governance over recent years. What I'm encouraging you to do is to be even more pragmatic and focused than I usually am, but I think if you do that, you should be able to deliver something on an inadequate budget that can deliver some real value to your organisation.
And, the secret to ‘minimal data governance’ is to identify one priority benefit because if you get Data Governance in place to deliver that correctly, some of the other benefits will start coming through anyway, and you'll be in a good position to then focus on delivering more of them. Benefits can include:
Once you’ve focussed on what you want to get out of your minimal approach, you will need to define your scope - are we dealing with customer data, finance data or a subset of one of these categories? Really take this opportunity to identify a very limited scope. I think the best way of thinking about it is of doing data governance incrementally. What we're doing is our first phase is just going to be very tightly defined. Then when we deliver that, we'll be in a good place to roll it out further.
So, to be truly effective, you need to really bear three things in mind:
Nicola Askham - Data Governance Coach - https://www.nicolaaskham.com/
Data Quality and Data Governance Frameworks
What are they and do I need both?
"How do a data quality and data governance framework relate to each other?”I get asked this question quite frequently and I think it’s a really interesting one, so I’d really like to help you get to the bottom of it. I think the reason it comes up is because people have been doing data quality and worrying about data quality for many more years than they have data governance.And so, they feel very strongly that there are two different frameworks in action. Another common misconception is that the two are the same. This may come from a lack of understanding of what data governance really is, so let’s break it down…..
Is data governance the same as data quality?
The very short answer is no. Data quality is the degree to which data is accurate, complete, timely, and consistent with your business’s requirements. Data governance, in very basic terms, is a framework to proactively manage your data in order to help your organisation achieve its goals and business objectives by improving the quality of your data.
Data governance helps protect your business, but also helps streamline your business's efficiency. It ensures that trusted information is used for critical business processes, decision making, and accounting. And so, if you think about it, data governance vastly provides a fabulous foundation for many data management disciplines, its primary purpose is to manage and improve your data quality.
To put it in much simpler terms, if data was water then…
- Data Quality would ensure the water was clean and prevent contamination
- Data Governance would make sure the right people had the right tools to maintain the plumbing.
So, why would you want two frameworks relating to data quality?
The simple answer is you wouldn’t. This really isn't a question about how you align two frameworks. You should only have one framework and data quality and data governance should be working in harmony with one another – not against or in opposition.
Data governance and data quality rely very much on each other, I usually describe the relationship between them as symbiotic, as their relationship is based on mutual interdependence. Therefore, of course, you need both! You would not want to do one without the other if you want to successfully manage and improve the quality of your data in a sustainable manner.
Sadly, in my experience, some organisations do not yet fully understand that you do need to do both. Whilst you rarely (if ever) come across a company that is implementing a data governance framework without the intention to improve data quality, it is fairly common for organisations to commence data quality initiatives without implementing a data governance framework to support them. Unfortunately, this leaves many data quality initiatives as merely tactical solutions that only have short-term results.
And, it doesn’t matter whether you call it data quality or data governance (because let's face it, some people really react badly to the term data governance) as long as it gets your business users engaged and understanding what that framework is about.
So, let's just have one data quality framework which encompasses the roles and responsibilities around data, and then there is nothing to go wrong, no duplication, no gaps between two different frameworks. Make this simple and make it sustainable.
You can see the video I originally did on this topic here and if you've got any questions you’d like me to address in future videos or blogs, please just email them in to email@example.com.