5/07/2018

Who Owns the Data - the Chief Data Officer

Who owns the data? 

The data gets captured from front end systems perhaps, or capture web log files, or downloaded off the web, Hadoop clusters, perhaps CSV files or JSON, Streaming Analytics data sent from IoT mini burst packets, OData feeds, archives & backups, or good old legacy data.


So it would appear IT owns the data.  Because it resides in files, databases, mainframes that sit internally on a shelf in the data center on-sight or centralized location at another location.


Or perhaps it resides in the Cloud.  If so, the vendor stores the data and is responsible for back ups and concurrency across the globe, so does the Vendor own it?  Well, they only capture or store the data.  So does organization that owns the Cloud actually owns the data?  Or the Vendor?


Yet the data ends up in ETL jobs, converted into Data Warehouses, Data Models, Reports, Visualizations, Machine Learning models, etc.  So does the developer that cleanses, pushes the data to new systems, models, reports, aggregates the data, do they own the data?


How about the Business Units, they know the business model, or at least their piece of the puzzle.  Does the Business own the data? What about data residing on file shares across the network, does IT own that, or the business?


What about insights derived from the data, who owns that?


I'd say it needs to roll up the Chief Data Officer, a fairly new role, that intersects IT CIO and the Business, and everything else in between, and reports to the CFO or CEO.  Or the Data Competency Center, which performs similar if not identical roles.


The CDO is responsible for the entire data stack.  From data creation to data ingestion to data storage to data mashing to reporting to data science.  He or she can matrix other departments for skills, domain knowledge and assistance as needed, including the hiring of consultants.  The CDO works with IT and accounting to purchase software, align for costs savings, document data across the entire org as well as how and when data flows through the entire ecosystem.





Who owns the data?  I venture to say the Chief Data Officer owns the data.