Skip to content

Clean And Extract Text

These transformation types are useful when the value needs text cleanup rather than numeric or date parsing.

Use this when the text is correct but the letter casing is not.

Field used:

  • Text Case

Choices:

  • Uppercase
  • Lowercase
  • Title Case

Examples:

BeforeRuleAfter
invoice totalUppercaseINVOICE TOTAL
CUSTOMER CODELowercasecustomer code
main office westTitle CaseMain Office West

Use this when you want a simple find-and-replace.

Fields used:

  • Find Text
  • Replace With

What it does:

  • Replaces every matching occurrence.
  • Matches text exactly.
  • Is case-sensitive.
  • If Replace With is blank, the found text is removed.

Examples:

BeforeRuleAfter
ORD-10452Find ORD-, Replace with blank10452
Total: $1,500.00Find $, Replace with blankTotal: 1,500.00
Ref_2024_01Find _, Replace with -Ref-2024-01
abc ABC abcFind abc, Replace with XX ABC X

Use this when the value contains extra text and you only want the part that matches a pattern.

Fields used:

  • Regex Pattern
  • Regex Group

What it does:

  • Searches the value using the pattern you enter.
  • Returns the first text match it finds.
  • If nothing matches, the original value stays unchanged.

Think of this as: “pull out the first piece of text that matches my rule.”

Examples:

BeforeRuleAfter
Reference: ORD-10452Pattern ORD-\\d+ORD-10452
Invoice 8841 dated 2024-12-25Pattern \\d{4}-\\d{2}-\\d{2}2024-12-25
Amount due: 1,234.56 USDPattern [\\d,.]+1,234.56
Customer code C-10025 / batch 77Pattern C-\\d+C-10025

Pattern ideas:

GoalPatternWhat it matches
Pull an order codeORD-\\d+ORD-10452
Pull an invoice date\\d{4}-\\d{2}-\\d{2}2024-12-25
Pull the first number with separators[\\d,.]+1,234.56
Pull a code with a letter prefix[A-Z]-\\d+C-10025

Good fit:

  • The value contains the part you want plus extra words.
  • You need the first matching code, date, or amount.

Poor fit:

  • You need several different clean-up steps in a row.
  • You need capture-group style extraction behavior.

The transformation card shows a Regex Group field, but the regex transformation returns the first overall match. The group number is not currently used when the value is transformed.