Logo
  • Data Stack Guide
  • GPT Prompts
  • KPI Map
Get Demo
Catalog of Data Modeling Tools
Catalog of Data Modeling Tools

Catalog of Data Modeling Tools

By Catalog from Coalesce (https://coalesce.io)

The raw data collected by companies is usually messy and unusable for data analysis. Data has to be transformed, so it can be made conducive to value-generating data analysis.

This explains the recent explosion of data transformation tools (internal, open-source, and SaaS). This new trend is not going to stop, and we'd rather bring visibility and structure soon.

image

At Coalesce Catalog, we believe the first step to structure the data transformation tools market, is more transparency. For that reason, we put up a list of all the data modeling tools we heard of.

Get started on Data Modeling Tools

πŸ’‘
This list is still exploratory, may contain errors, or lack information. Please reach out to us, if you notice anything wrong: xavier.deboisredon@coalesce.io
πŸ“’
In-depth analysis and evolution Read the full breakdown by generation and market analysis of data transformation here
image

Deeper dive into SQL Editors

β€£
What does each column in the benchmark below mean?

Deployment: Is the tool SaaS or open-source?

Classification: Is the tool exclusively used for transforming data (such as coalesce) or is the transformation part of a larger offering? For example, ETL tools transform data, but they also take care of the extract and loading steps.

Security: This criteria notes whether the solution is compliant with any specific regulatory law like GDPR, HIPAA, etc.

Language: What is the scripting language used for data transformations? Scala, Python, SQL? Is the solution no-code?

Community: Is there a community built around the solution? Communities tend to be especially important with open-source tools, as they provide a great amount of support.

Data sources supported: Where are the transformations operated with the solution? Does it support transformations in data warehouses? Databases?

Add data quality checks: test data quality with assertions checks for uniqueness or null values, or write a custom assertion in SQL to check any property of your data.

version control: You can easily track changes and restore version histories of datasets.

Real-time query validation: solution validates compiled queries against BigQuery in real-time, enabling users to identify issues before running queries.

Real-time data transformation: Run SQL search, aggregations and joins just as data is generated.

Benchmark data transformation tools

Name
Website
Deployment
Classification
Security
language
features
Community
data sources supported
Coalesce πŸ‡ΊπŸ‡Έ
coalesce.io
SaaS
Transformation only
Hosted on google cloud platform
No-codeSQL
Column-aware transformationsData modeling automationVersion controlOrchestration integration
SnowflakeRedshiftBigQuery
Dataform πŸ‡¬πŸ‡§
dataform.co
SaaS
Transformation only
Hosted on google cloud platform
No-codeSQL
dataform.co
RedshiftBigQuerySnowflakeAzure SQLPostgresSQL
Modlr πŸ‡¦πŸ‡Ί
modlr.co
SaaS
Transformation part of larger offering
No-code
Dashboard customisation
Trifacta πŸ‡ΊπŸ‡Έ
www.trifacta.com
SaaS
Transformation only
SOC 2 compliant
SQLNo-code
Data visualization
community.trifacta.com
RedshiftSnowflakeBigQuery
Rudderstack πŸ‡ΊπŸ‡Έ
rudderstack.com
Open-source
Transformation part of larger offering
SQL
Real-time transformations
resources.rudderstack.com
RedshiftBigQuerySnowflakePostgresSQLClickhouse
Matillion πŸ‡¬πŸ‡§
www.matillion.com
SaaS
Transformation part of larger offering
SOC 2 compliantHIPAACSA STAR
SQLNo-code
RedshiftBigQuerySnowflake
Easymorph πŸ‡¨πŸ‡¦
easymorph.com
SaaS
Transformation part of larger offeringETL tool
No-code
120 built-in transforms
Paxata πŸ‡ΊπŸ‡Έ
www.paxata.com
SaaS
Transformation part of larger offering
No-code
Rockset πŸ‡ΊπŸ‡Έ
rockset.com
SaaS
Transformation only
SQL
Real-time transformations
MongoDBDynamoDBPostgresSQLMySQLS3GCSKafka
Beam
beam.apache.org
Open-source
Transformation part of larger offering
SQL
beam.apache.org
Mara
github.com
Open-source
Zillion
totalhack.github.io
Open-source
Glue πŸ‡ΊπŸ‡Έ
aws.amazon.com
SaaS
Transformation part of larger offering
ScalaPython
Airflow
airflow.apache.org
Open-source
Transformation part of larger offering
Python
Quality checks