Skip to main content

Captcha Recognition

🔗 Original page — Source of this material


Captcha Recognition

Description

This action is used for automatic captcha solving via ❗→ services or ❗→ manually.

A captcha (from CAPTCHA — Completely Automated Public Turing test to tell Computers and Humans Apart) is a computer test used to determine whether the user is human or a bot.

Some types of captchas
Some types of captchas

How to add the action to your project?

There are several ways to add this action to your project.

Via the BROWSER context menu

To add the captcha recognition action using the ❗→ browserProjectMaker context menu, right-click on the image on the site and choose This is a captcha!

image-20200819-180955

Once the action is added, the ❗→ manual captcha recognition window will immediately pop up, which you can close for now and move on to configuring the action.

Via the PROJECT context menu

Add actionTabsRecognize captcha

image-20200819-175851

The downside to this method is that you first need to download the image to your computer and then specify the file path in the action.

Or you can use the ❗→ smart search.

Captcha recognition action settings

Main

image-20210531-140331

Recognition module

Choose the module (captcha service) through which the captcha will be recognized.

From the dropdown list, select the desired captcha recognition service (you’ll need to specify its ❗→ API key in the settings first). By default, MonkeyEnter.dll is used — ❗→ manual input.

Keep in mind

You can use project variables in this field.

Settings

Clicking the Settings button opens the program settings, in the ❗→ captcha service tab.

Before you can interact with an element on a page, you have to find it. In the actions ❗→ Get value , ❗→ Set value , ❗→ Fire event , ❗→ Touch event , ❗→ Swipe event, there are two ways of finding elements - classic and XPath.

Classic - Search by HTML element parameters: tag, attribute, and its value.

image-20200805-202115

XPath - search using ❗→ XPath expressions. With this, you can implement a more robust way of finding data that withstands layout changes compared to the classic search or regex.

image-20200805-202209

Which tab

Select the tab in which the element will be searched. Possible values:

  • Active tab
  • First
  • By name - when selected, an input field will appear to enter the tab name.
  • By number - the input field here should contain the tab’s number (count starts from zero!)

Document

It’s recommended to set this value to -1 (search in all documents on the page).

Form

Also better to set as -1 (search all forms on the page). This makes your template more universal.

Why set "-1"?

Example: there are 3 forms on a page — search, registration, and order. You need to click a button in the order form, so you choose “Form” value 2 (index starts from zero). After some time, a new login form is added before the order form, so number 2 will now be the login form — your template will either throw an error (button not found), or worse, click the wrong button.

Keep in mind

In the program’s settings, you can tick two checkboxes — Search in all forms on the page and Search in all documents on the page. This means whenever an element is added to Action Builder, -1 is used for document and form numbers by default.

Tag (classic search only)

image-20210525-095347

The HTML tag you want to get the value from.

Tip

You can specify several tags, separated by ; (semicolon)

Conditions (classic search only)

image-20210525-100053

  1. Group - priority of this condition. The higher the number, the lower the priority. If the element can’t be found using the condition with the highest priority, it moves to the next, and so on, until the element is found or the conditions run out. You can add multiple conditions with the same priority – in that case, search is done across all of them simultaneously.

  2. Attribute — the HTML tag attribute to search by.

  3. Search type:

  4. text - search for full or partial text match;

  5. notext - search for elements not containing the specified text;

  6. regexp - using ❗→ regular expressions. By default, the search is case-insensitive. To make regex case-sensitive, add (?-i) to the start of your pattern (turns off case insensitivity).

  7. Value - value of the HTML tag attribute.

  8. # of match - sequence number of the found element (count starts from zero!). You can ❗→ use ranges and ❗→ variable macros here.

Keep in mind

To delete a search condition, left-click the field to its left (blue highlighted in the screenshot), then press Delete on your keyboard.

Keep in mind

You can use several conditions to search for an element.

It’s always important to try and choose your search conditions so only one element remains, i.e., match number 0 (count starts from zero).

Save the result to a variable

The recognition result will be saved into the specified ❗→ project variable.

Additional

image-20200819-184415

Waiting

image-20200819-184609

Wait before execution If you specify positive numbers in the *FROM and *TO fields, the action will pause for a random amount of time within that range before starting.

Wait for element up to If after this time the element is not found, the action will exit via the red branch (with an error).

Module parameters

image-20200819-185149

Here you can specify additional parameters (options) for solving the captcha — case sensitivity, only Cyrillic letters, math captcha, multiple words, etc.

Format: parameter_name=parameter_value — separate multiple parameters with & (ampersand)

Example (based on the RuCaptcha API): phrase=1&numeric=2&regsense=1

  • captcha consists of two or more words, only letters, case sensitive

image-20200819-193201

Sample parameters from different services
Warning

Additional parameters and their possible values are specific to each service.

Here are a few examples based on two popular captcha solving services.

RuCaptcha - go to the API documentation page https://rucaptcha.com/api-rucaptcha#solving_normal_captcha and scroll down to find a table with parameters you can specify

here are just some of the possible parameters
here are just some of the possible parameters

Anti-Captcha - on the ❗→ documentation page for solving simple text captchas, there’s also a table of allowed parameters

image-20200819-191315

Even just between these two services and just a small fraction of their parameters, you’ll see that

  • some parameters mean the same thing but are named differently (case sensitivity — *case and *regsense)
  • others have the same name and meaning, but accept different types of values (*phrase)
  • some parameters fully match by name, purpose, and accepted values, but one service allows more options than another (*numeric)

Be extremely careful when using additional parameters if you are creating a project for multiple captcha services.

Captcha parameters

image-20200820-181121

Scale This option lets you reduce or increase the size of the captcha image you send.

Combine captchas Sometimes a captcha consists of several images, so you can combine them to avoid paying for each piece separately. To combine captchas (if you didn’t do it when recording your template), set the “Combine captchas” flag in the properties window for the first captcha element. Then right-click on the next element, and a new context menu item will appear: Attach to captcha.

image-20200820-193749

Each click creates a new action; for the last one, the Last captcha checkbox will be set (for previous ones, this checkbox is unchecked).

Async recognition

image-20200820-181817

This setting allows you not to wait for a service response and continue executing your template.

Enabling this option creates a new action: Wait for captcha recognition. It does not have any settings, just a To start of recognition button, which will take you back to the main action (very handy if your template’s actions are on opposite sides of the workspace). The main action has a back button — To end of recognition.

image-20200820-182040

Once the template reaches the main Captcha Recognition action, it will send the captcha to the service and keep working until it hits the Waiting... action, where it will pause until the service’s response is received. Afterward, you can use the variables you specified in the main action.

Complaint URL

image-20200820-182859

Captchas are solved by humans on the service, and people, as we know, can make mistakes. Sometimes workers are inattentive or misread the task and, instead of writing the answer to an expression 3+88=? , just write down the expression itself, even though the settings asked them to solve the math task.

For such cases, you can use this setting — if a captcha was solved incorrectly, send a request to this URL to complain about this specific recognition and the service will refund you.

Warning

Please don’t abuse this feature and only use it when a worker actually made a mistake and really solved the captcha incorrectly. If you complain and refund for correctly solved captchas, you will get banned very quickly.

Saving

image-20200820-185127

These options let you save the captcha image and answer to the specified folder.

  • Folder - the directory where images will be saved (you can use ❗→ variables)

  • Answers - where to save the answers to captchas:

  • In the filename - handy, but may not always work since captchas might contain characters that cannot be used in Windows filenames.

image-20200820-190103

  • To a file - in this case the captcha image will be saved as *captcha(X).png in the chosen directory, where X is the captcha number. A file captcha(X).txt will also be created containing the answer. Here, filename restrictions won’t be a problem.
  • Ignore “sorry” answer - for some errors, the Recognize captcha action returns sorry instead of a captcha answer. If this option is enabled, the program will not save captchas with such a response.
Where this can be useful:
  • if you want to create your own module for the CapMonster Cloud service.
  • when using ❗→ CapMonster 2 (a program for automatic captcha recognition) — this software supports many captchas out of the box, but there are some for which you need to ❗→ create modules yourself. To create a module, you need a base of correctly solved captchas — that’s where these action options come in: solve captchas (manually or via services), save the captchas and answers, and then train CapMonster 2 on them.

More information

Text captchas

Quite often, especially on poorly protected resources, you’ll encounter text captchas. Unlike an ordinary (graphic) captcha, these aren’t drawn on an image — they’re just text. Such captchas don’t need to be sent anywhere, you can just extract (parse) them directly from the page text. To do this, get the page text via the ❗→ Data action, select the page text, check “parse result”, and enter a ❗→ regular expression in the parameters for parsing the page.

Math captcha

You might also encounter math text captchas. These are just like text captchas, except they usually show a math expression like 58+63. You could turn this text into an image and send it for recognition, or you can use JavaScript. To solve such a captcha, you can use the ❗→ JavaScript action from the “Your code” category. In the code field, insert a link to the variable that contains the parsed expression (like 58+63), and after execution, the action will return the result (121).

Flash captcha or captcha from any other element

If you come across a flash captcha, you can render it as a regular image and also send it for recognition. Find this element in the ❗→ elements tree, right-click to open the action menu, and choose “This is a captcha” — that’s it!

How to handle CAPTCHA recognition errors

Warning

The video was recorded for an older version of ZennoPoster, but the error handling algorithm remains the same and is not dependent on the program version.

How to take a browser screenshot using the Recognize Captcha action?

Sometimes there’s a need to take a screenshot of a specific HTML element or the whole site (even parts outside the visible area).

Warning

If you only need a screenshot of the browser window (the visible part of the site), it’s better to use the Image Processing action.

To do this:

  • add the Recognize Captcha action to your project (make sure to do it via the browser context menu Warning, i.e., right-click any image on the site)
  • for the recognition module, select CaptchaSaver.dll
  • specify the search criteria for the element you want to screenshot
  • on the Additional tab, in the Module Parameters field, specify the full path for saving the image (you can use ❗→ variable macros)
Example action settings to screenshot the entire site

image-20210531-144132

image-20210531-144213


Example usage

Typical case

  • Right-click (RMB) on the captcha image and choose This is a captcha! from the context menu

  • right after adding this action, the ❗→ manual recognition window will pop up — you can close it

  • select the desired recognition module (by default, MonkeyEnter.dll - manual input)

  • make sure you’ve entered the API key in the ❗→ settings and funded your service account

  • Right-click in the input field where the captcha answer should go and select Field for captcha recognition result to add another ❗→ action for inputting the captcha answer (recording must be enabled in the project).

  • Or, you can find the field manually using the ❗→ Action Builder and input the answer using a ❗→ Set value action.

Combining

For this example, we’ll use a page with the following content:

Source code of the test page.
<!DOCTYPE html>
<html>
<head>
<title>CAPTCHA Test</title>
</head>
<body>
<img src="" id="1">
<img src="" id="2">
<img src="" id="3">
<img src="" id="4">
</body>
</html>

Each symbol is a separate HTML element. Right-click the first image — This is a captcha!, select Combine captchas in the settings, then right-click the other images and select Attach to captcha from their context menus. In the end, you should have four actions:

image-20200820-200754

When you run the project, the first three actions will just collect the images and merge them, and only the last one will add the final part and send it to the service for full captcha recognition.

Additional parameters when sending

Let’s say you get a captcha like this:

image-20200820-201350

It consists of separate pieces and you need to write the result of the expression (in this case — addition).

Source code of the page with the captcha
<!DOCTYPE html>
<html>
<head>
<title>CAPTCHA Test</title>
</head>
<body>
<img src="" id="2">
<img src="" id="1">
<img src="" id="3">
<img src="" id="4">
</body>
</html>

First, merge all the images into one. Then, for the final action, select the required service (in this example, RuCaptcha) and in Parameters under Additional, specify that a math operation must be performed (for RuCaptcha: ```calc=1`)

image-20200820-202611