Techno Blender
Digitally Yours.
Browsing Tag

Kinsvater

How to Reduce Your Master Data Management Bill | by Paul Kinsvater | Apr, 2023

Grab the low-hanging fruits with open-sourceMaster Data Management, or MDM, is commercial vendors' buzzword for an entity resolution framework. I talked to several vendors, most offering SaaS and priced by the total number of records ingested from sources. That totals in the 6- to 7-digit $ range per year for larger enterprises.The target audience for this articleAre you planning to implement MDM soon? Have you asked vendors for a quote? Or did your company already invest in an MDM SaaS? For sure, it is not a small…

Metrics Store in Action. With a tutorial using MetricFlow… | by Paul Kinsvater | Feb, 2023

With a tutorial using MetricFlow, Python, DuckDB, dbt, and StreamlitThe metrics layer defines all critical business metrics and dimensions centrally. It translates metric requests into SQL, abstracting away implementation details. Image created by the author.There is a lot of literature on Modern Data Stacks (MDS)—most discussions are around storage, ingestion, transformation, and presentation. Today we focus on metrics, one of the many other MDS categories.Some say that metrics are one component of the semantic layer —…

Deduplicate and clean up millions of location records | by Dr. Paul Kinsvater | Sep, 2022

How record linkage and geocoding combined improve data qualityPhoto by Ralph (Ravi) Kayden on UnsplashBig companies store data in several systems for different purposes (ERPs, CRMs, local files). Each potentially holds customer data, and not all of them, if any, are in sync. In addition, links across sources either do not exist or are not appropriately maintained. The consequence is duplicate records, inconsistencies, and poor data quality in general. That’s a perfect opportunity for us to shine with an algorithmic…