More data, more tools, more people = more data catalogs
Companies are deploying their analytics to more people in the company. Now, regardless of data literacy, most departments of large companies are using data. For that reason, there's a need to improve trust and understanding in data resources and infrastructure.
This explains the recent explosion in the past five years of data catalogs (internal, open-source, and SaaS). This new trend is not going to stop, and we'd rather bring visibility and structure soon.
At Coalesce Catalog, we believe the first step to structure the data catalog market, is more transparency. For that reason, we put up a list of all the catalog tools we heard of.
Feature definition
**This is an attempt at classifying the tools on the market. If anything seems wrong, the feature list seems off, or if you don't see your data catalog and want to have it placed, please reach out: xavier.deboisredon@coalesce.io
Feature | Classification | Collibra | Alation | Atlan | Coalesce Catalog | Informatica | Data World | Dataedo | OvalEdge | Purview | Octopai | Acryl | Secoda | Select Star | Metaphor |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Role-Based Access Controls | Data Governance | ||||||||||||||
Metadata Analytics | Data Governance | ||||||||||||||
Metadata Bulk Edit | Data Governance | ||||||||||||||
Automated PII tagging | Data Governance | ||||||||||||||
Advanced Tag Management | Data Governance | ||||||||||||||
Policy and Workflow | Data Governance | ||||||||||||||
Multi-Tenant Infrastructure | Data Governance | ||||||||||||||
Social Data Discovery | Data Discovery | ||||||||||||||
Advanced Search Filtering | Data Discovery | ||||||||||||||
Table Popularity & Frequent Users | Data Discovery | ||||||||||||||
Column Lineage | Data Lineage | ||||||||||||||
Cross Platform Lineage (ETL → Data Warehouse → BI tools) | Data Lineage | ||||||||||||||
Definition Propagation | Data Lineage | ||||||||||||||
Personalized Views | User Experience | ||||||||||||||
Chrome Extension | User Experience | ||||||||||||||
Rich Text | User Experience | ||||||||||||||
SQL Editor | User Experience | ||||||||||||||
Two-Way Sync | Integrations | ||||||||||||||
Slack Integration | Integrations | ||||||||||||||
API Based Ingestion | Integrations | ||||||||||||||
On Premise Metadata Extractor | Integrations | ||||||||||||||
Data Quality Integration | Integrations | ||||||||||||||
Natural Language Search | AI features | ||||||||||||||
AI Documentation | AI features | ||||||||||||||
AI for SQL | AI features | ||||||||||||||
AI Assistant | AI features | ||||||||||||||
Business Glossary | Knowledge Management | ||||||||||||||
Knowledge Map | Knowledge Management |
More Ressources
Data Catalog Pricing Guide:
Data Catalog Template:
Data Catalog RFI template:
Data Catalog ROI calculator:
F.A.Q
Do You Need a Data Catalog?
Additional comparisons and benchmark resources
How to Make Your Data Catalog Successful
There are only 2 goals that matter when it comes to measuring the success of a data catalog: 1) adoption, and 2) customer satisfaction. If you nail these two, you are successful. I'm the co-creator of the leading open-source data catalog, Amundsen, which is used by 35+ companies including Instacart, Square, Brex, Asana, and many more.
towardsdatascience.com
The Ultimate Guide to Evaluating a Data Catalog - CastorDoc Blog
Make informed decisions when choosing a data catalog with CastorDoc's comprehensive evaluation guide.
www.castordoc.com
https://towardsdatascience.com/defining-data-ownership-3fbe95fd0125
In the first paragraph of a post I had written earlier this month, I referred to data engineers as producers of data. Someone immediately replied something to the extent of, " You lost me at the first sentence. Data Engineers can't be data owners."
towardsdatascience.com
Castor Data Catalog
Castor Data Catalog is a platform that enables organizations to quickly and easily find, understand, and use their data.
www.castordoc.com
Collibra vs. Alation: Points of Difference Between the Two
It is the data that drives the businesses. Managing it and getting the best out of it is of great importance in any organization. Collibra and Alation are two such platforms helping in removing the barriers for managing data efficiently.
wisdomplexus.com
Data Discovery for Business Intelligence
Dashboards and reports are the lingua franca in the world of business. Simple as they may seem, behind each KPI dashboard are data analysts who are responsible for keeping dashboards working, accurate, and fresh. For small teams with a handful of data analysts, building dashboards is easy.
towardsdatascience.com