by Joseph Brady, Director of Business Development at Treehouse Software, Inc.

Since 1982, Treehouse Software has been serving enterprises worldwide with industry-leading mainframe software products and outstanding technical support. Today, Treehouse Software brings you Treehouse Dataflow Toolkit (TDT), a serverless (Lambda-based) application that goes beyond basic data transfer. It is a fully automated solution that prepares the full infrastructure needed to automatically prepare the staging infrastructure for the massive data loading to targets, such as Amazon Redshift, Snowflake, Amazon Athena/S3, Amazon S3 Express One Zone Buckets, and Amazon Aurora PostgreSQL. TDT supports data replication between mainframe and non-mainframe sources—without disrupting existing critical work on customers’ legacy systems.

The Treehouse solutions utilizes Rocket Data Replicate and Sync (RDRS) to pull data from the mainframe, where an agent (with a very small footprint) extracts data (either bulk-load or CDC processing). The raw data is then securely passed from the mainframe by RDRS, which transforms and publishes the data to a Kafka topic (in our example above, a topic in an Amazon MSK cluster). The TDT microservices consume the data from MSK/Kafka and land it in S3 buckets, where TDT’s proprietary crawler technology is used to automatically prepare landing tables, views, and additional infrastructure for various analytics friendly targets. Then the mainframe data is loaded into Redshift, Snowflake, S3, or PostgreSQL (all the while adhering to AWS’s and Snowflake’s recommended “best practices” for massive data loading, thus assuring shortest and surest loads). The inherent reliability and scalability of the entire pipeline infrastructure assures near-real-time synchronization between mainframe sources and the target tables, even with very large bulk-loads or transaction-heavy CDC processing.
What about non-mainframe data?
For customers who have non-mainframe data sources, Treehouse offers TDT-DIRECT which pulls data directly from PostgreSQL, SQL Server, Oracle, MySql, and Db2 for bulk-load and CDC into a variety of targets on AWS.

Instantaneous auto scaling…
For massive amounts of data, TDT takes advantage of the auto scaling and parallelizing of the Lambda framework. This allows many parallel selects to all run at once, thus loading large tables with minimal latency. Additionally, all TDT Lambda microservices are fully customizable (they will be YOUR Lambdas) to add extra monitoring capabilities, and any other functionalities for future needs.

TDT’s innovative Lambda-based microservices approach enables faster data flow than any conceivable ODBC-based solution, which is the standard tool used for most “roll your own” approaches, or “we have a connector for that” offerings. TDT offers several key differentiators from standard “connectors” on the market, including:
- Automatic creation of target resources – TDT automatically prepares landing tables, views, and additional staging infrastructure for the target. Without TDT’s fully automated approach, a customer can spend months designing and creating target resources, such as delta tables, views, schemas, etc.
- Ease of delivery/implementation – TDT is delivered via CloudFormation templates, which automate and accelerate the process of installing and configuring the complete TDT application (including AWS Lambda functions and numerous other AWS resources, all wrapped in a well-architected security framework) in your AWS account. This allows your site to be up and running with a fully preconfigured implementation of your new data transfer pipeline in minutes.
- Adherence to best practices – TDTis built in alignment with AWS and Snowflake best practices, ensuring proper security and performance. The fault-tolerant design of the Cloud-native application provides for a robust, future-proof architecture.
- Adaptability to evolving Cloud ecosystems – In today’s fast-evolving cloud world, TDT’s flexible design ensures lasting compatibility with emerging technologies. As AWS and Snowflake introduce new features, the application readily integrates them, staying ahead of the curve, keeping your data pipelines modern and efficient.
TDT and TDT-DIRECT are designed to deliver:
- rapid mainframe and non-mainframe data bulk-loading and CDC to Snowflake and AWS targets
- access to the latest Analytics, AI, and ML tools and services
- swift ROI
Contact us today to discuss your needs, or to book a free demo.
Visit Treehouse Software on the AWS Marketplace for all of our Cloud offerings…
Treehouse Dataflow Toolkit (TDT) and TDT-DIRECT are Copyright © Treehouse Software, Inc. All rights reserved.
Contact us today to schedule a demo!






















