Something interesting happened the other day. Our company experienced a minor data incident that our data observability platform didn’t catch for the simple reason that the data was trapped in a silo.
This got me thinking about other places data hides, not just from data observability platforms but from the scrutiny of the data team itself — and how these risks can impact our ability to successfully implement more modern or even experimental programs like data mesh. We sometimes refer to these as data silos, but really it’s the data equivalent of shadow IT.
Shadow data (or data silos) arise when:
Autonomy and speed are prioritized over technical standards;
Data access or resources are limited, forcing teams to work around existing systems and processes; or
Data consumers decide to deploy their own point solutions rather than work with you.
But instead of waving the white flag, consumers often find a way to get what they need outside of the watchful eye of the data team — and that’s risky.
When done wrong, these can be the hardest silos to reign in.
Much of the marketing tech stack, like email service providers and marketing automation platforms, rose to prominence ahead of the advances in modern data warehouses. This meant the most expedient way to drive impact was with an all-in-one solution that collected, managed, and served data directly to the marketer. Admittedly, I was once selling the dream of a “single line of JavaScript” for running experiments that allows you to avoid any further conversations with “IT.”
Some of the most pressing problems created by this silo are that these systems quickly experience an ungoverned sprawl of customer segments (product page repeat visitors Oct 2020), and the segmented campaigns often suffer from a lack of measurement.
Back in my consulting days, I assessed that one large telco was “micro-targeting” less than 1% of their customer base because they were unknowingly filtering campaigns down to customers that satisfied every attribute within their complex segmentation.
But we turned a corner a few years ago — data teams and technologies are now capable of being fast and flexible enough to operate at the speed of business while also unlocking new opportunities with the richness of data in the warehouse (“hey, want to optimize for LTV instead of click-through?”). Marketers and other business partners see the value in going through, rather than around, the data team.
These marketer-first solutions are adapting. Whether you opt for a modern CDP or reverse ETL to transport data into the hands of marketers, the must-have feature is to have a collection, transformation, and basic segmentation operating on your enterprise data warehouse.
Solution: I’ve found the best way to get marketing teams onboard is to create systems that respect and address their need for speed and autonomy while partnering to ensure strong governance and measurement are part of the solution.
My Recommendation? Proactively Remove Silos
It can be tempting to let sleeping dogs lie, but, in my opinion, data teams should proactively work to remove data silos or any “shadow data” systems.
I’ve typically opted for the Field of Dreams approach — “if you build it, they will come.” (The approach is probably more akin to gathering requirements, scoping the project, getting approvals, building a minimum viable product, getting feedback, iterate and they will come — but that is not nearly as pithy).
But if you build it and they don’t come, then you need to address whether that’s a failure of your technology solution, a lack of organizational buy-in, or something else altogether. Then you need to find a solution that puts you on the right path to breaking down silos.
After all, data is ultimately the data team’s responsibility, and consumers will do what they need to do to access it. Our best path forward as data leaders is to accept this reality and take steps to mitigate it.
Data (computing)AnalyticsData lossData managementObservability
Comments