Products

Developer data solutions for secure software and AI development
Structural
Structural
Ephemeral
Textual
Validate
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.

Tonic Structural

Safe, realistic structured and semi-structured synthetic data generation for hydrating staging environments, enabling local development, and fueling secure analytics.
Annual commitment / volume discounts available. Contact Us
De-identify your data without losing its utility.
Shrink down your database while maintaining referential integrity.
Accelerate your release cycles and ensure regulatory compliance.
Pricing based on source data volume.
Commit annually and save.
Pay-as-you-go
For developers and small teams with simple use cases who need to protect their sensitive data.
Adapt to changing business needs without overcommitting budgets.
Starting at $199/month for 20 tables

Usage
Source Data
Up to 2TB
Workspaces
Unlimited
Generated Data
Unlimited
Databases
Unlimited
Deployment
Tonic Cloud
Features
Comprehensive Generator Library
Privacy Scan
Cross-Table Consistency
Schema Change Alerts
Virtual Foreign Keys
Concurrent Generations
Subsetting with Referential Integrity
Upsert
w/o Schema Differences
Encryption Support
Workspace Inheritance
Generator Presets
Data Sources
Data Connector Types
Supported data types
Choose 2 Types
Application Databases
PostgreSQL
MySQL / MariaDB
MongoDB
DocumentDB
DynamoDB
Vertica
Db2 LUW
Oracle
Data Warehouses
Snowflake
BigQuery
Redshift
Data Lakes (on Spark)
Databricks
Amazon EMR w/ Glue
Spark
SaaS Applications
Salesforce
Flat Files
Plain Text
JSON
CSV / TSV
XML
HTML
Don’t see a data source listed? Chances are we’re working on it. Reach out for additional details.
Collaboration
Users
10
Workspace Sharing
Commenting and Annotation
Role Based Permissions
Automation
REST API
Basic
Schema Change Alerts
Webhooks
Post Job Actions
Compliance & Security
Authentication
Tonic Auth or Google SSO
License Tier
Professional
Certifications
SOC 2 Type II


AWS Qualified Software

HIPAA Compliant
Privacy Report
Audit Report
Professional
The simplest and fastest way to generate highly realistic, privacy-preserving synthetic data that reflects production.
Access committed spend discounts and a customer success manager with an annual contract.

Book a demo
Usage
Source Data
Up to 10TB
Workspaces
Unlimited
Generated Data
Unlimited
Databases
Unlimited
Deployment
Tonic Cloud
Features
Comprehensive Generator Library
Privacy Scan
Cross-Table Consistency
Schema Change Alerts
Virtual Foreign Keys
Concurrent Generations
Subsetting with Referential Integrity
Upsert
w/o Schema Differences
Encryption Support
Workspace Inheritance
Generator Presets
Data Sources
Data Connector Types
Supported data types
Choose 2 Types
Application Databases
PostgreSQL
MySQL / MariaDB
MongoDB
DocumentDB
DynamoDB
Vertica
Db2 LUW
Oracle
Data Warehouses
Snowflake
BigQuery
Redshift
Data Lakes (on Spark)
Databricks
Amazon EMR w/ Glue
Spark
SaaS Applications
Salesforce
Flat Files
Plain Text
JSON
CSV / TSV
XML
HTML
Don’t see a data source listed? Chances are we’re working on it. Reach out for additional details.
Collaboration
Users
10
Workspace Sharing
Commenting and Annotation
Role Based Permissions
Automation
REST API
Basic
Schema Change Alerts
Webhooks
Post Job Actions
Compliance & Security
Authentication
Tonic Auth or Google SSO
License Tier
Professional
Certifications
SOC 2 Type II


AWS Qualified Software

HIPAA Compliant
Privacy Report
Audit Report
Enterprise
The complete, self-managed solution to enable consistent, unified compliance and streamlined productivity across the organization, including all of your data and use cases at scale.

Usage
Source Data
Unlimited
Workspaces
Unlimited
Generated Data
Unlimited
Databases
Unlimited
Deployment
Tonic Cloud or Self Hosted
Features
Features
Comprehensive Generator Library
Privacy Scan
Cross-Table Consistency
Schema Change Alerts
Virtual Foreign Keys
Concurrent Generations
Subsetting with Referential Integrity
Upsert
w/o Schema Differences
Encryption Support
Workspace Inheritance
Generator Presets
Data Sources
Data Connector Types
Supported data types
Choose Unlimited
Application Databases
PostgreSQL
MySQL / MariaDB
MongoDB
DocumentDB
DynamoDB
Vertica
Db2 LUW
Oracle
Data Warehouses
Snowflake
BigQuery
Redshift
Data Lakes (on Spark)
Databricks
Amazon EMR w/ Glue
Spark
SaaS Applications
Salesforce
Flat Files
Plain Text
JSON
CSV / TSV
XML
HTML
Don’t see a data source listed? Chances are we’re working on it. Reach out for additional details.
Collaboration
Users
Unlimited
Workspace Sharing
Commenting and Annotation
Role Based Permissions
Automation
REST API
Advanced
Schema Change Alerts
Webhooks
Post Job Actions
Compliance & Security
Authentication
SSO / SAML
License Tier
Enterprise
Certifications
SOC 2 Type II


AWS Qualified Software

HIPAA Compliant
Privacy Report
Audit Report
Usage
Pay-as-you-go
Professional
Enterprise
Source Data
Up to 2TB
Up to 10TB
Unlimited
Workspaces
Unlimited
Unlimited
Unlimited
Generated Data
Unlimited
Unlimited
Unlimited
Databases
Unlimited
Unlimited
Unlimited
Deployment
Tonic Cloud
Tonic Cloud
Tonic Cloud or Self Hosted
Features
Comprehensive Generator Library
Privacy Scan
Cross-Table Consistency
Schema Change Alerts
Virtual Foreign Keys
Concurrent Generations
Subsetting with Referential Integrity
Upsert
w/o Schema Differences
w/o Schema Differences
with Schema Differences
Encryption Support
Workspace Inheritance
Generator Presets
Custom Sensitivity Rules
Data Sources
Data Connector Types
Choose 2 Types
Choose 2 Types
Choose Unlimited
Application Databases
PostgreSQL
MySQL / MariaDB
SQL Server
MongoDB
DocumentDB
DynamoDB
YugabyteDB
Db2
Oracle
SAP ASE
Data Warehouses
Snowflake
BigQuery
Redshift
Data Lakes (on Spark)
Databricks
Amazon EMR w/ Glue
Spark
SaaS Applications
Salesforce
Flat Files
Plain Text
JSON
CSV / TSV
XML
HTML
Avro
Parquet
Don’t see a data source listed? Chances are we’re working on it. Reach out for additional details.
Collaboration
Users
10
10
Unlimited
Workspace Sharing
Commenting & Annotation
Role-Based Access Controls
Automation
REST API
Basic
Basic
Advanced
Schema Change Alerts
Webhooks
Post Job Actions
Compliance & Security
Authentication
Tonic Auth or Google SSO
Tonic Auth or Google SSO
SSO / SAML
License Tier
Professional
Professional
Enterprise
Certifications
SOC 2 Type II


AWS Qualified Software

HIPAA Compliant
SOC 2 Type II


AWS Qualified Software

HIPAA Compliant
SOC 2 Type II


AWS Qualified Software

HIPAA Compliant
Privacy Report
Audit Report
Not ready to commit? Start your free trial to experience Tonic Structural before you decide.

Sign up here
Common questions
How does the Pay-As-You-Go plan work?

The Pay-As-You-Go plan offers rapid access to our cloud offering of Tonic Structural, making it ideal for simple use cases and small data sizes. The plan entails a monthly credit card subscription that is billed automatically and includes up to 20 unique tables processed per month, as many times as you need. If you exceed 20 unique tables, we will issue a separate charge to your credit card on file at our on-demand table rate of $19 per table per month.

Each billing period, we count the number of unique tables that are processed across all generations (jobs), workspaces, and databases connected to Tonic. We only count tables that are processed by Tonic to the destination database. This includes:



  1. Tables that are configured using the following modes: De-Identify, Scale, Preserve Destination, Incremental;

  2. Tables that are included in a subsetting run to maintain referential integrity (in-subset tables); or,

  3. Any table with generators applied, including those that contain foreign keys that are de-identified in the parent table.

We do not count tables without generators applied (”passthrough”) or those in Truncated mode.

Can I use Tonic Subsetting without using other Tonic Structural platform features?

Yes, we support the option to purchase a limited license for access to Subsetting only, in conjunction with Tonic Ephemeral.

How do you price annual contracts?

Annual pricing for Tonic Structural is determined by the Plan you select and the amount of source data connected to Tonic. Source data is defined as the total size on disk of data sources connected to Tonic Structural, excluding logs and indexes. Our pricing model builds in volume discounts, so the incremental price per GB of source data decreases as you connect more data to Tonic Structural.

What cloud providers do you support?

Tonic Structural is compatible with databases deployed on all major cloud providers, including AWS, Microsoft Azure, and GCP.

Does Tonic sign contractual data handling agreements?

Yes, for Enterprise plan customers, we will sign Business Associate Agreements (BAA) and Data Processing Agreements (DPA). Tonic.ai is also SOC2 certified, GDPR compliant, and HIPAA compliant, and Tonic Cloud is an AWS Qualified Software.

Tonic Ephemeral

Fully populated databases for each testing cycle to supercharge efficiency and shrink costs.
Annual commitment / volume discounts available. Contact Us
Save time provisioning and maintaining test databases
Spin up databases on-demand or automatically as part of your CI/CD pipeline
Eliminate long wait-times with rapid data set duplication
Usage-based pricing based on database uptime.
Only pay for what you use.
Features
  • On-demand database creation
  • Rapid dataset duplication
  • Seamless integration with Tonic Structural
  • Easy data import
  • Unlimited users
  • Unlimited databases
  • Comprehensive APIs for seamless automation
  • Intuitive UI for manual database setup
  • Built-in expiration timers
  • Automatic shutdowns of unused databases
  • Storage controls
  • Cloud or self-hosted deployments
Common questions
What are the benefits of using Tonic Ephemeral?

Tonic Ephemeral makes it cheap and easy to create isolated test and development databases to accelerate your engineering velocity and shrink compute costs. We built Ephemeral in response to the challenges we’ve seen many customers confront in getting test data into the hands of their developers.

Rapid, isolated access to data enables developers to:

  • One-click through a CI/CD process and get de-identified data spun up in an environment for testing and development.
  • Spin up databases as-needed — for example to reproduce a bug, or for manual testing.
  • Stand up data subsets optimized for development (when paired with Tonic Subsetting).
How do you price Tonic Ephemeral?

Our usage-based pricing model means you only pay for the hours your ephemeral databases are running each month. Tonic Ephemeral provides multiple expiration options to ensure your database is running only when needed. You can deactivate databases after a defined period of inactivity, at a specific date and time, after business hours, manually, or with an API command. In cases where the database is still needed after deactivation, Tonic Ephemeral rapidly reactivates the database with the data intact. With a reactivated database, the data is preserved in its previous state so you can pick up right where you left off, as if you had kept the database running 24/7 but at a fraction of the cost.

Note on Ephemeral database billing units (hours): For Ephemeral Cloud, a unit is equal to one eighth of a vCPU running for an hour. For self-hosted instances of Ephemeral, where the databases run on the customer’s own equipment, a database billing unit is equivalent to an hour of database activity (vCPU is not taken into account for self-hosted pricing).

Can I use Tonic Ephemeral with Tonic Structural subsetting?

Yes, we support the option to purchase a limited license for access to subsetting in Tonic Structural in conjunction with Tonic Ephemeral. Subsetting enables you to create targeted snapshots of your entire production databases, using percentages or custom WHERE clauses. By using this in conjunction with Tonic Ephemeral, developers can access the specific data they need for testing and your organization can avoid the mounting costs of data bloat that results from traditional data provisioning approaches.

Tonic Textual

All-in-one data platform designed to eliminate integration and privacy challenges ahead of RAG ingestion or LLM training.
Annual commitment / volume discounts available. Contact Us
Extract, govern, enrich, and deploy your unstructured data for Gen AI
Elevate RAG performance and accuracy with entity metadata tags
Protect sensitive and proprietary data used for generative AI
Usage-based pricing based on words processed. Only pay for what you use.
Features
  • Standardize, enrich, and secure your unstructured data for AI development
  • Configure enterprise-grade unstructured data pre-processing pipelines in minutes
  • Enrich vector stores with NER-powered entity metadata tags to improve RAG performance
  • Automatic redaction and reversible tokenization of sensitive entities ensures privacy is maintained
  • Optional synthetic data replacement helps maintain semantic realism of your data
  • Use Textual pipelines directly in your existing workflows with Python SDK and REST API
  • Unlimited datasets
  • Unlimited custom models
  • Unlimited users
  • Per-word pricing that scales sublinearly
  • Google SSO or Tonic Auth
  • Cloud or Self-Hosted deployments
  • Native support for TXT, DOCX, PDF, CSV, XLS, TIFF, PNG, JPEG, PPTX, PARQUET, HTML, XML, and JSON
Have sensitive text data in Snowflake?
Use our Snowflake Native App to leverage Textual models directly in your Snowflake environments.

Try Textual on Snowflake
Common questions
How do you price Tonic Textual?

Tonic Textual pricing is volume based. You will be charged based on what you use, defined as the number of words processed. Our pricing model scales sublinearly such that larger amounts of text are significantly discounted per unit. Contact us to receive a custom quote.

Are annual contracts available?

Yes, for annual engagements, we will work with you to scope your estimated needs for the year and allocate a bank of words that you are free to use throughout the year.

What are your deployment options?

Our hosted cloud version of Tonic Textual is optimized for speed and performance and is the best option. Textual is also available for self-hosted deployments, as an AMI on the AWS Marketplace, or as a Snowflake Native App.

Tonic Validate

Rigorous metrics, benchmarking, and continuous monitoring for production RAG systems.
Automated, LLM-powered RAG evaluation and experimentation
Log, measure, and assess RAG response quality against a set of benchmark responses
Track how well your RAG system is responding to user queries in production
Open-source and always free.
Use our free cloud UI for easy visualizations and configurations.
Features
  • 25+ native metrics for each component of your RAG stack
  • GitHub Actions templates for easy CI/CD integration
  • Integrate directly into your RAG workflows
  • Supports Azure OpenAI, GPT-3.5, GPT-4, Gemini, Anthropic, and more
  • Visualize granular changes in system performance over time
  • Review retrieved context and LLM responses
  • Build custom performance benchmark datasets
  • Score RAG system performance on real production user queries
Personal
Usage
Price
Free
Runs
20 free runs per month
Projects
2
Data Retention
6 months
SDK Integration
Yes
Collaboration
Collaborative Teams
1
Team Members
-
Security
SOC2 Certified
HIPAA
Single Sign-On
GitHub
Deployment
Tonic Cloud (US)
RBAC
Support
Community Slack
Email Support
Dedicated CSM
Private Slack Channel
Procurement
Billing
Monthly, self-serve
Credit Card
Custom Terms, DPA, BAA
InfoSec Review
Teams
Usage
Price
$19
Runs
250 free runs per month
$0.10/run thereafter
Projects
5
Data Retention
Unlimited
SDK Integration
Yes
Collaboration
Collaborative Teams
Unlimited
Team Members
10
Security
SOC2 Certified
HIPAA
Single Sign-On
GitHub
Deployment
Tonic Cloud (US)
RBAC
Support
Community Slack
Email Support
Dedicated CSM
Private Slack Channel
Procurement
Billing
Monthly, self-serve
Credit Card
Custom Terms, DPA, BAA
InfoSec Review
Enterprise
Usage
Price
Custom
Runs
Custom
Projects
Custom
Data Retention
Unlimited
SDK Integration
Yes
Collaboration
Collaborative Teams
Unlimited
Team Members
Unlimited
Security
SOC2 Certified
HIPAA
Single Sign-On
Custom SSO
Deployment
Add-on for self-hosted
deployment in customer's
VPC
RBAC
Support
Community Slack
Email Support
Dedicated CSM
Private Slack Channel
Procurement
Billing
Annual Invoice
ACH
Custom Terms, DPA, BAA
InfoSec Review
Common questions
What is the difference between the open-source SDK and the Validate UI?

Tonic Validate includes an intuitive UI to help manage projects, easily visualize changes in your RAG system components over time, build your benchmark Q&A set, and leverage additional metrics including toxicity, bias, hate speech, PII leakage, legal and ethical compliance checks, and more.

tonic_validate is our open-source logging and metrics SDK. It allows you to programmatically iterate through your benchmarks, log LLM responses and metadata, and calculate desired metrics calculations. It is and will remain free to use forever.

What is a run?

A run represents an assessment of the RAG responses to a set of questions based on the RAG system configuration at a given point in time.

For each response, the run includes:

  • The question and, optionally, the corresponding ideal answer. A benchmark is one option for providing the questions.
  • The LLM's response and the context that the RAG system retrieved
  • Metadata in the form of key-value pairs that you specify. For example, ""Model"": ""GPT-4""
  • Scores for the responses that use your chosen metrics

The run also includes overall scores for the given metrics.

What is a project?

Projects are collections of runs that allow you to see how your run performance changes over time. Before you can upload runs to the Validate application, you must first create a project.

What is a benchmark?

In Validate, a benchmark is a collection of questions with or without responses. The responses represent the ideal answers to the given questions. A benchmark is one way to provide the questions for Validate to use to evaluate your RAG system.

Join the thousands of developers empowered by Tonic

Bundles

Tonic.ai’s solutions, bundled together to offer you savings and comprehensive use case coverage.
Ephemeral + Structural
Modern test data automation
Get secure, realistic fake data to every developer, on-demand and cost-efficiently.
Ephemeral + Subsetting
Targeted data minimization
Get developers the data they need, when and where they need, it with targeted subsets sent directly to on-demand databases
Textual + Structural
Full coverage data protection
Protect sensitive data in structured and unstructured formats with AI-powered data de-identification and synthesis.
Textual + Validate
Secure LLM Development
Protect sensitive text data for secure LLM training, and monitor internal generative AI tools.
The Tonic Portfolio
The Tonic.ai portfolio is your solution to building data-driven software and AI/ML models. Get all the value of production data without actually having to copy sensitive data around your organization and unlock strategic data assets for use across functions, from engineering, to business operations, to data science/ML teams. With high quality data, teams author fewer defects and ship faster all while having a strong security posture.

Start a free trial