Log in
Fixed an issue with searching for a user to grant permission set access to.
For the Date of Birth entity type, when you choose to synthesize values, you can now select whether to use the Date Shift Generator, which was previously the only option, or the Date Truncation Generator. For the Date Truncation Generator, Textual always sets the month and day to January 1. If the original year is less than 90 years ago, then Textual keeps the original year. If the original year is 90 or more years ago, then Textual sets the year to the current year minus 89.
For the Age entity type, when you choose to synthesize values, you can now select whether to use the Age shift generator, which was previously the only option, or the Passthrough or group age generator. For Passthrough or group age, if the original age is less than 90, then Textual keeps the original age. If the original age is 90 or older, then Textual sets the age to 90+. This is to meet HIPAA Safe Harbor requirements.
For CSV files in a dataset, from the file preview, you can now configure entity type handling for entire columns. A column might be structured and sensitive, structured but not sensitive, or unstructured. Structured columns contain a single consistent type of value, such as a name, date, or number. For unstructured columns, such as a description or notes, the values can contain multiple entities of different types, and must be scanned individually. Note that this only applies to CSV files that are added to a dataset after this release. Existing CSV files are not affected.
Fixed HTML replacements so that inline breaks such as `
` do not pull adjacent text into phone and email replacement values.
Fixed HTML report mappings so that spanning HTML replacements keep their surrounding markup in report values.
Added an optional phone metadata flag to preserve US prefixes such as `(855)` or `1 (234)` during synthesis.
Added a new endpoint to retrieve dataset entity mappings grouped by file. From the Python SDK, to retrieve the entity mappings, use Dataset.get_entity_mappings(). The results include original, redacted, synthetic, and final output values. When available, the results also include tabular row, column, and score metadata. The results respect dataset entity type handling configuration. When none of the entity types are active, it can return empty per-file entity lists.