With the explosion of Data Mesh and the Data Product concept, data architecture, data platform, and SDLC teams have been working to understand how to enable the adoption of these new patterns within their organizations.
The adoption of data products promises to solve three critical challenges within data organizations:
By increasing levels of ownership and autonomy, connecting the software development lifecycle (SDLC) with the way data use cases are developed, and providing platform infrastructure as a service, it is possible to improve the quality of data products at scale and accelerate the speed at which data connects to business problems.
However, adopting data products and increasing autonomy in the development of data-driven business solutions introduces several challenges:
Many companies have decided to develop an internal platform in-house to pursue these objectives. The reason is clear: these are not just technological problems but soft problems that involve processes and people. The real challenge is how people work, ensuring they follow best practices and produce high-quality artifacts.
Once organizations recognize these challenges and understand that solving them is critical for successfully adopting and scaling a Data Product-centric model within complex and diverse enterprises, IT will have a huge opportunity.
For IT, this is a great chance to build something valuable for the business and position itself as an enabler of transformation. However, this opportunity, combined with the natural inclination of technical teams to build things, often leads to the decision to develop a completely in-house platform for managing data products.
We call these platforms DIY (Do-It-Yourself) Data Product Management Platforms.
Before creating Witboost, I developed several DIY platforms for Agile Lab customers. Over the following years, I carefully observed how these platforms evolved and what happened to them. Additionally, these experiences provided me with the information necessary to build the business case for Witboost, which we will share and analyze below.
Now, letβs examine how DIY platforms are typically built.
DIY platforms almost always start with a strong reliance on CI/CD and Terraform (or other Infrastructure-as-Code tools). This happens for several reasons:
Another common approach is to create a Data Product Descriptor, which is stored in Git and processed by CI/CD pipelines that trigger Terraform scripts accordingly.
Whenever a change is made to the Data Product Descriptor:
This approach seems logical and functional at first glance. After an initial MVP, teams begin expanding their capabilities.
However, as complexity grows, a series of problems start appearing. Let's take a look at them all.
1. How can you control what the user is creating?
2. How do you allow sufficient flexibility in Data Products?
3. Who manages CI/CD updates and testing?
4. As complexity increases, debugging becomes harder.
Creating resource groups, workspaces, or databases works fine.
However, managing Data Products requires much more than infrastructure provisioning, such as:
Terraform was never designed for this, leading to unstable and fragile workarounds.
2. Terraform scripts and variables must reside within the Data Product repository.
3. Pre-existing Data Products cannot be imported.
π¨ Just to be clear:
One of the pillars of Data Product Management is creating cross-functional teams, which means integrating:
If we truly want this model to succeed, we cannot:
β Force non-technical users to edit YAML and Terraform files.
β Make them work with Git commits and CI/CD debugging cycles.
π¨ Without a strong UX, business users will disengage, pushing all responsibility back to IT, and defeating the entire purpose of Data Products.
DIY platforms often fail because they are designed for hard-core engineers rather than for other users.
Before making any evaluations, it is crucial to recognize that technical teams are strongly biased toward underestimating the effort required to build any platform or IT system. The stronger the internal technical team, the greater this bias tends to be.
Below is a list of the main capabilities that a Data Product Management Platform should have. I will divide them into Platform Foundation and Data Product Management-specific features.
Platform Foundation:
Data Product Management-Specific Features:
To avoid biases, we asked ChatGPT to estimate the development cost for a platform with these characteristics.
Here is its response:
π‘ Estimated total effort: ~3,800 man-days
In our experience, this number is very optimistic, letβs assume it is a valid baseline.
If we assume a team structure consisting of:
After one year of focused work (without distractions or external dependencies), this team could deliver the platform's foundation. Consider that a team with 17 people is not easy to manage and coordinate; very often, companies start with way smaller teams before realizing they need more power to speed up the process.
However, at this stage, a significant amount of work would still remain, including:
This creates two major problems:
What do business teams do while waiting for the platform to become fully operational?
Delaying the official platform may result in the unstructured development of Data Products, which contradicts the very purpose of governance. Also, it will require spending money to reduce the technical and data debt created while the platform reaches a decent state.
2. Long-Term Maintenance Costs
In the software industry, the ongoing maintenance cost of developed code is estimated at 15% per year's initial development cost.
So, even assuming that no new features are added (an unrealistic assumption), letβs do some calculations:
Developing the core functionalities of the platform will require between β¬1.5M and β¬3M.
The developed platform will generate an annual maintenance cost of β¬225Kββ¬450K.
π‘ However, platforms do not remain static β they require new features and improvements.
Thus, we must also budget at least β¬500K annually for evolutive development.
These new features will, in turn, generate even higher maintenance costs over time. Progressively, after 3β4 years, the maintenance cost alone will easily exceed 1M Euro.
Is This Estimate Too High?
No β if you want a platform that can evolve.
This estimate is very conservative if the goal is to build a system that does not overfit current requirements and can expand in the future.
Why?
π‘ From our experience, a more realistic estimate is 2β3x higher than 3,800 man-days to reach a high-quality level.
Wardley Maps are a well-established tool for strategizing internal investments and identifying where it makes sense to develop internal tools β and where it does not.
Data Product Management is a domain with an emerging but rapidly maturing market. Several companies have already invested over β¬5 million in R&D, making it highly unlikely that an internal effort could achieve better results, faster time to market, or any meaningful differentiation.
In the 1990s, companies built custom in-house databases. Today, databases are fully commoditized, and developing one internally does not offer a competitive advantage. A similar shift is happening in Data Product Management.
π¨ Why does this matter?
Because many companies underestimate the complexity of building and maintaining a Data Product Management Platform.
In many cases, an internal DIY approach:
β Takes too long to reach adoption
β Fails to meet real business needs
β Becomes a long-term maintenance burden
π‘ This is why many companies are now considering hybrid models:
β
Buying a mature platform as a foundation
β
Extending it with adapters, webhooks, policies, plugins
This approach significantly reduces risk, accelerates time to market, and controls long-term costs. π