By Louise de Leyritz from Castor (www.castordoc.com)
The raw data collected by companies is usually messy and unusable for data analysis. Data has to be transformed, so it can be made conducive to value-generating data analysis.
This explains the recent explosion of data transformation tools (internal, open-source, and SaaS). This new trend is not going to stop, and we'd rather bring visibility and structure soon.
At CastorDoc, we believe the first step to structure the data transformation tools market, is more transparency. For that reason, we put up a list of all the data modeling tools we heard of.
Get started on Data Modeling Tools
This list is still exploratory, may contain errors, or lack information. Please reach out to us, if you notice anything wrong: louise@castordoc.com.
In-depth analysis and evolution
Read the full breakdown by generation and market analysis of data transformation here
Deeper dive into SQL Editors
β£
Name | Website | Deployment | Classification | Security | language | features | Community | data sources supported |
---|---|---|---|---|---|---|---|---|
Dataform π¬π§ | SaaS | Transformation only | Hosted on google cloud platform | No-codeSQL | RedshiftBigQuerySnowflakeAzure SQLPostgresSQL | |||
Modlr π¦πΊ | SaaS | Transformation part of larger offering | No-code | Dashboard customisation | ||||
Trifacta πΊπΈ | SaaS | Transformation only | SOC 2 compliant | SQLNo-code | Data visualization | RedshiftSnowflakeBigQuery | ||
Rudderstack πΊπΈ | Open-source | Transformation part of larger offering | SQL | Real-time transformations | RedshiftBigQuerySnowflakePostgresSQLClickhouse | |||
Matillion π¬π§ | SaaS | Transformation part of larger offering | SOC 2 compliantHIPAACSA STAR | SQLNo-code | RedshiftBigQuerySnowflake | |||
Easymorph π¨π¦ | SaaS | Transformation part of larger offeringETL tool | No-code | 120 built-in transforms | ||||
Paxata πΊπΈ | SaaS | Transformation part of larger offering | No-code | |||||
Rockset πΊπΈ | SaaS | Transformation only | SQL | Real-time transformations | MongoDBDynamoDBPostgresSQLMySQLS3GCSKafka | |||
Beam | Open-source | Transformation part of larger offering | SQL | |||||
Dbt πΊπΈ | Open-source | Transformation only | SOC 2 compliant | SQL | Git integrationsversion controlloggingmodularityreference one data model within another | RedshiftPostgresSQLSnowflake | ||
Mara | Open-source | |||||||
Zillion | Open-source | |||||||
Glue πΊπΈ | SaaS | Transformation part of larger offering | ScalaPython | |||||
Airflow | Open-source | Transformation part of larger offering | Quality checks |