About Trino
The distributed SQL engine that queries the lake where the data already lives.
Trino is the open-source distributed SQL query engine that grew out of Presto, the project Martin Traverso, David Phillips and Dain Sundstrom started inside Facebook in 2012 to run interactive analytics on a Hadoop warehouse the size of the company itself. The three founders left Facebook in 2018, kept building the engine outside as PrestoSQL, and renamed the project Trino in December 2020 after a trademark dispute with Facebook over the Presto name. The code stays Apache 2.0; the Trino Software Foundation governs it; Starburst, the company co-founded by the same Presto creators alongside Justin Borgman, ships the commercial managed version.
The architecture is coordinator and workers, with one SQL plan that fans out across as many machines as the cluster has. What makes Trino different from a warehouse is the connector layer: Iceberg, Delta Lake, Hudi, Hive, Postgres, MySQL, SQL Server, Snowflake, BigQuery, Cassandra, MongoDB, Kafka and roughly thirty more all sit behind the same SQL dialect, joinable in one query. That is why Netflix, LinkedIn, Goldman Sachs, Salesforce, Stripe, Shopify and Lyft built their interactive analytics on Trino: one engine reads Parquet on S3 next to Postgres rows next to Snowflake tables, without a copy step in between. Trino is not an OLTP database and not a replacement for Snowflake or BigQuery; it is the SQL layer that lets you query the lake and federate across systems without lifting the data out first.