Accelerating Data Migration to Google Cloud with the Datametica Bird Product Suite

Posted by

In October 2023, Onix announced the release of new AI and data solutions after the company’s acquisition of Datametica. With this announcement, Onix is ready with next-gen solutions in data migration and modernization that deliver the following capabilities:

  • Data migration from legacy and cloud-powered warehouses
  • Data integration
  • Business intelligence (BI) 
  • Vertex AI and Gemini (formerly Duet AI) deployments

Datametica has its proprietary suite of data transformation products – also referred to as Datametica Birds, namely:

  • Eagle – The Planner
  • Raven – The Transformer
  • Pelican – The Validator

In this blog, we will delve deeper into the capabilities and benefits of each of the Datametica Birds.

Eagle – The Planner

With the Eagle product, companies can perform the first step of any data migration process – namely planning. They can accurately assess the data warehouse requirements of the existing legacy systems for advanced systems like Oracle, Teradata, Netezza, and Google BigQuery.

How does the Eagle planning tool benefit organizations planning to migrate their data to the cloud? This tool provides them with a customized cloud migration plan, along with the estimated budget and migration time. With data warehouse assessment, they can detect unknown complexities before the migration process.

Here are some of the core features of the Eagle tool:

  1. Detailed assessment of data warehouses
    Organizations can generate an accurate and detailed assessment of their data warehouses within a few weeks. They can also identify and prioritize data workloads fit for cloud migration. Data warehouse assessment also recommends the best data models for migration. 
  2. Cloud migration planning
    With this Eagle feature, organizations can develop their cloud migration strategy, along with identifying the data to be migrated, the detailed migration steps, and the onboarding plan for existing users. Additionally, they can define the scope of the data migration project and how it aligns with their business objective.
  3. Migration budget and time estimation
    This Eagle feature accurately estimates the budget and duration of data migration by analyzing:
    • Volumetrics, data workloads, and data models (for the duration)Cloud migration and consumption data (for the cost)
  4. Data lineage
    Data lineage enables organizations to unlock use cases in data governance, data privacy, and cloud computing. Some of its in-built features include code lineage, data lineage in table and column format, and end-to-end visibility into data flow.

Raven – The Transformer

The Raven product is the data transformation tool, which converts SQL- and ETL-based code workloads from an on-premises environment to a cloud-native system.

The supported on-premises systems include Oracle, Netezza, and Teradata and the supported cloud-native platforms include Google BigQuery, MS Azure Synapse, AWS Redshift, and Snowflake.

The benefit of this data transformation tool is that it can convert long and complex code workloads in minutes, thus saving both transformation time and costs.

Here are some of the core features of the Raven tool:

  1. SQL code conversion
    With the Raven tool, organizations can automatically translate complex SQL codes and scripts. It also provides smart emulations for unsupported (or incomplete) SQL clauses using an SQL-like language. Among other capabilities, it can convert embedded SQL and incomplete SQL statements, along with parameterized SQL code.

    This feature also supports translation for database utilities like Bteq, Load-Store, and Macros. It has built-in support for DDL definitions for objects like stored procedures, macros, and view definitions.

  1. ELT/ETL conversion
    With this feature, organizations can convert ETL to ELT scripts according to the target cloud environment. Additionally, Eagle supports custom expressions written in ETL code. It is also capable of converting non-SQL components that are part of ETL workflows.

  1. Code optimization
    With Eagle’s code optimization feature, companies can achieve consistent optimization in generated code and scripts. They can benefit from cost and time savings due to SQL support and workflow optimization. During code transformation, this feature considers the data lineage and removes job redundancy.

Pelican – The Validator

Powered by AI technology, Pelican is a data validation tool that can compare and reconcile datasets across two (or more) heterogeneous data sources. By using Pelican, organizations can ensure optimum data quality and data reliability between the source and target platform.

Some of the benefits of the Pelican tool include:

  • 100% data accuracy and quality
  • Time and cost efficiency due to automatic validation
  • Nearly 100% network savings
  • 100% data storage savings with zero data movement between data stores

Here are some of the core features of the Pelican validator tool:

  1. AI-powered automated data validation and reconciliation with
    • AI-powered mapping engine and Soundex algorithm for initial setup
    • Expression engine for system configuration
    • Execution engine for data validation
  2. Zero coding functionality – based on a no-code/ low-code development approach
  3. No data movement (or duplication) across data sources during validation
  4. End-to-end data validation comprising of
    • Cell-level validation
    • Metadata validation
    • Various validation modes
  1. Data validation runs parallel to the data migration process – thus eliminating wait time for completion of the migration process.
  2. Data lineage-powered triaging that
    • Validates all data load pipeline tables at the same time.
    • Traverses the entire lineage of data load pipelines and identifies the root cause of any data error.
  1. Built-in reporting comprising
    • Summary dashboards.
    • Data validation reports.
    • Sample reports of data mismatch
    • Mismatch viewer that displays the text, hex, and the Unicode
  1. Enterprise-wide data validation tool with
    • AES 256 encryption for data persistence.
    • Compatibility with SSL and TLS network standards.
    • Automated backups of metadata.
    • Work segmentation.
  1. Integration with CI/CD pipelines with the ability to Automatically create mappings on multiple tables. Upload CSV files to quickly establish equivalence between renamed tables and table columns.
    • Automatically create mappings on multiple tables.
    • Upload CSV files to quickly establish equivalence between renamed tables and table columns.

In summary

By using the data migration suite of products from the Datametica Birds, organizations can accelerate their cloud migration initiatives. With the Datametica acquisition, Onix has empowered its customers to streamline data migration to Google Cloud – with improved speed and lower costs.

Besides, organizations can accelerate their cloud migration without any human intervention – and reduce the time and cost of migration by 50%. Here’s a case study of a U.S.-based retailer who leveraged the Pelican tool to save 90% of their data validation costs when migrating to Google Cloud.

Are you interested in knowing more about our data migration product suite? Just get in touch with us!

Related blogs

Subscribe to stay in the know

Your trusted guide to everything cloud

No matter where you are on your journey, trusted Onix experts can support you every step of the way.