top of page

Tobiko Data: Revolutionizing Data Transformation With SQLMesh

Discover how Tobiko Data's SQLMesh transforms the data landscape, enabling data scientists and analysts to build efficient and correct pipelines.

The Power of SQLMesh

Tobiko Data, an innovative company founded by industry veterans from Netflix, Apple, Airbnb, and Google, is revolutionizing the data transformation landscape with their flagship product, SQLMesh. As an open-source data transformation platform, SQLMesh empowers data scientists and analysts to build correct and efficient pipelines, addressing the common pain points faced in the current data ecosystem.

Tobias Mao, Co-founder and CTO of Tobiko Data, explained to the 56th IT Press Tour, "Tobiko Data is an open-source data transformation platform. We primarily write open-source software, and our flagship product is called SQLMesh. SQLMesh is a framework that allows data scientists and analysts to work with data in an efficient and reproducible way, thereby reducing costs and increasing productivity."

Addressing the Pain Points

The current data landscape is plagued with challenges, including unmaintainable spaghetti code, data accuracy issues, and inefficiencies that lead to skyrocketing costs. Tobias highlights these pain points: "You have no idea what's going in and around the data. The checks and balances around data are very immature. This causes many problems and trust issues with data accuracy and outages. In terms of deficiencies, the industry, as a whole, doesn't like to think about that. And, until now, did have ways to address these contradictions."

SQLMesh tackles these issues by providing an efficient, correct, and operationally complete solution. By leveraging virtual data environments, building tables once, and reducing warehouse costs, SQLMesh saves organizations time and money. Its semantic understanding, column-level DAG resolution, and support for simple SQL in any dialect ensure correctness without compromising functionality.

The SQLMesh Advantage

One of SQLMesh's key differentiators is its virtual data environments. Tobias explains, "The way SQLMesh works is, imagine you have two data models with a straightforward plug. SQLMesh will have two layers. One is a physical layer. This is where all the actual tables are stored, and then the virtual layer is where you interact with the data. The virtual layer is views pointing to the physical layers."

This innovative approach allows for instantaneous, no-downtime deployments and easy rollbacks, making SQLMesh the first data platform to offer actual blue-green deployments with data. Tobias emphasizes, "This is unique to SQL versions, the first time that any data platform can get true blue/green deployments with data."

Leveraging SQLGlot for Unparalleled Understanding

At the core of SQLMesh's capabilities lies SQLGlot, an open-source SQL parser built by Tobias during his time at Netflix. SQLGlot enables SQLMesh to understand various SQL dialects, a crucial aspect in achieving correctness and efficiency.

Tobias explains, "Until I created SQLGlot, that power didn't exist. There's Snowflake, BigQuery, Databricks, Postgres, and MySQL. All are quite different from each other. This makes it challenging for tools to understand SQL queries written for one database and run them on another. How can anybody understand all these different SQL guidelines? The answer is SQLGlot."

Seamless Integration and Vendor Independence

SQLMesh integrates with existing data ecosystems, including popular platforms like Snowflake and Databricks. Its ability to transpile code from one dialect to another ensures vendor independence, allowing organizations to own their data and avoid vendor lock-in.

Tobias emphasizes, "Using SQLMesh in combination with a data lake like Apache Iceberg really gives you the ability to work on any vendor platform."

The Future of Data Transformation

Tobiko Data's SQLMesh is poised to shape the future of data transformation. With its efficient, correct, and operationally complete approach, SQLMesh addresses the critical challenges data scientists and analysts face. As organizations increasingly rely on data-driven decision-making, tools like SQLMesh become indispensable in building scalable and reliable data pipelines.

Tobias sums it up perfectly: "SQLMesh is one of the premier platforms to generate scalable and correct data.  This platform helps your data teams build those AI models today."

Embracing the Open-Source Community

Tobiko Data's commitment to open-source software is evident in its approach to building and maintaining SQLMesh. By fostering a thriving community and leveraging the power of collaboration, Tobiko Data ensures SQLMesh remains at the forefront of data transformation innovation.

Tobias highlights the importance of the open-source community: "We have a large and growing community of over 2,000 people in our Slack channel. When we first launched, people said, 'Oh, wow, this is solving so many problems we've had for years. We just didn't think solving those kinds of things was possible.'"

The Road Ahead

As Tobiko Data continues to innovate and expand, the future looks bright for SQLMesh and organizations that adopt it. With a strong focus on efficiency, correctness, and operational completeness, SQLMesh is well-positioned to revolutionize the data transformation landscape.

Tobias concludes, "Right now, I'm having much fun. I didn't do this for any financial outcome. I wanted to start a company, work with my friends, and build something great. So we'll see where that takes us."

For developers, engineers, and architects seeking a powerful and user-friendly data transformation solution, Tobiko Data's SQLMesh is a game-changer. By leveraging the power of SQLGlot, virtual data environments, and an unwavering commitment to open-source innovation, SQLMesh is set to redefine how organizations handle data transformation.


bottom of page