Integromat
  • Back to website
Articles in this section
  • Formstack
  • Magento 2
  • Google Cloud Vision
  • ManyChat
  • Airtable
  • Bubble
  • Email (legacy)
  • Facebook Groups
  • Google Sheets (legacy)
  • HTTP
See more
  1. Integromat Support
  2. Apps
  3. Docs

Text Parser

Integromat Support Team Integromat Support Team Updated March 31, 2021 18:13

Getting Started with Text Parser

Transformers

The module's dialog field names that are displayed in bold (in the Integromat scenario, not in this documentation article) are mandatory!

Get Elements from HTML

Retrieves the desired elements from an HTML code.

Continue the execution of the route even if the module returns no results If enabled, the scenario will not be stopped by this module.
Element type Select the type of element you want to retrieve from the HTML code. Image, link, or iframe element(s).
HTML Enter the HTML code you want to retrieve the specified element types from.

HTML to Text

HTML Enter the HTML code you want to convert to plain text.
Line break Select the type newline (line break). Detailed information about the line break can be found on Wikipedia.
Uppercase headings
The text enclosed in the heading tags (e.g., <h2> </h2>) will be converted into uppercase text.

Match Pattern

The Match pattern module enables you to find and extract string elements matching a search pattern from a given text. The search pattern is a regular expression (aka regex or regexp), which is a sequence of characters in which each character is either a metacharacter, having a special meaning, or a regular character that has a literal meaning.

  • The complete list of metacharacters can be found on the MDN web docs website.
  • For a tutorial on how to create regular expressions, we recommend the RegexOne website.
  • For an easy, quick regex generator, try the Regular Expressions generator.
  • For experimenting with regular expressions, we recommend the regular expressions 101 website. Just make sure to tick the ECMAScript (JavaScript) FLAVOR in the left panel:

2019-01-25_11_27_13-Window.png

Pattern

Enter the regular expression pattern. For example, [+-]?(\d+(\.\d+)?|\.\d+)([eE][+-]?\d+)? extracts all numerals in the provided text.

The pattern will contain at least one capture group in parenthesis () for the output bundle to contain some items. If the pattern does not contain any capture groups, the output bundle will be empty:
mceclip0.png 
Global match If enabled, then the module retrieves all matches in the text. If disabled, then the module retrieves only the first entry.
Case sensitive You can disable the case sensitivity by disabling this option (default=case sensitive).
Multiline If checked, beginning and end metacharacters (^ and $) will match the beginning or end of each line, not just the very beginning or end of the whole input string.
Continue the execution of the route even if the module returns no results If enabled, the scenario will not be stopped by this module.
Text Enter the text you want to match the pattern.

Replace

Searches the entered text for a specified value or regular expression, and replaces the result with the new value.

Pattern Enter the search term. You can also use a regular expression. For more details about the regular expression, refer to the Match Pattern module.
New value Enter a value that will replace the search term.
Global Match If this option is enabled, the module will find all matches rather than stopping after the first match. Each match will be output in a separate bundle.
Case sensitive If this option is enabled, the search will be case sensitive.
Multiline If checked, the beginning and end metacharacters (^ and $) will match the beginning or end of each line, not just the very beginning or end of the whole input string.
Text Enter the text to be searched.

Data Scraping

Data scraping, sometimes called web scraping, data extraction, or web harvesting is simply the process of collecting data from websites and storing it in your local database or spreadsheets. If you wish to scrape data from a website and you are not familiar with regular expressions, you may use a data scraping tool:

  • Apify is an excellent tool, and we already have it integrated
  • Best Data Scraping Tools for 2019
  • Web Data Extractors 2019
  • Email/Text Parser visual tool (request)

If the data scraping tool provides a REST API, you can connect to it via our universal HTTP and Webhooks modules. You can also implement an app on your own using the Integromat App SDK.

Was this article helpful?
17 out of 90 found this helpful
Menu
  • Features
  • Apps & Services
  • Templates
  • Pricing
  • What's new
  • Use cases
  • Feature requests
Information
  • Privacy
  • Terms
  • FAQ
  • Help & Tutorials
  • Community
  • Academy
  • Find a partner
  • Blog
Cooperation
  • Developer platform
  • Affiliate Program
  • Become a partner
  • Schools and Universities
Company
  • About the company
  • We are hiring
  • Contact
Follow us
© 2021 Integromat by Celonis. All rights reserved. Integromat® is a registered trademark.
  • English