Metrics and groups

From Wiki Kaminari Click
Other languages:

User agent

User agent is a header that a browser, program, or application sends to a server when accessing a website. The user agent typically contains the browser name, its version, the engine on which the browser is based, the operating system and its version. The user agent can also contain the name and model of the mobile device on which it is running and some other information. In other words, it is a browser's business card.

The problem is that this business card is very easy to fake - using extensions or even just in DevTools. Almost any programming language has the ability to pass any information to the user agent when accessing a server. Therefore, user agents should be treated with caution.

Normally, the browser does not lie about itself. There are, of course, exceptions, such as extensions that help change the user agent, or mobile browsers or applications in which an unchanged user agent is embedded because developers are too lazy to change it. But such cases can be ignored - they are either isolated or can be added to the exceptions.

It's another matter when the transmitted information does not match what we learned about the browser as a result of checking, and this pattern prevails on a slice - through the user agent, we see a variety of browsers, but the analysis shows that these are all accesses from the same browser.

Therefore, the user agent is an "unreliable storyteller" and the information we obtain from it should be treated with a healthy dose of skepticism.

Metrics

Metrics are parameters that allow you to answer the question 'how much?' How many visits did the website have? How many of them were made by bots, and how many by real users?

General metrics

Name Description
Hits Total number of checks.
Unique users Number of unique users counted based on IP, user agent and client hints.
Good Number of checks resulting in non-bot, non-suspicious, and non-technical loss traffic.
Tech. losses Number of clicks that we couldn't verify. Reasons include browsers with disabled JS, outdated browsers, etc.

Invalid traffic (IVT)

Invalid traffic consists of bot traffic and scammers traffic.

A bot is a program or automated script that requests web content (including digital advertising) without user involvement. It could be, for example, a browser on an infected computer that is part of a botnet, or a script written by a programmer that scrapes websites for some purpose.

Scammers traffic is traffic generated by real people who are tricked into requesting web content (including digital advertising) without knowing about it.

Name Description
IVT total Number of hits made by bots or caused by scammers.
Crawler Legitimate bots that honestly declare their botness through the user agent. These could be search engine bots indexing the web, social media bots, etc. Despite not having malicious intent, they cannot convert and are not the target audience, so they should be ignored.
Spoofing Bots whose user agent does not match the actual browser version and/or operating system.
Automated Bots using headless browsers or automation tools like Selenium. The original purpose of such solutions is to automate software, website, application testing, etc. Then bot operators realized they could also use them for click fraud, view fraud, etc., and added them to their inventory.
Incorrect requests During the analysis, we found that the impression came from an invisible element. For example, a user clicked on a link, but this click ended up in an invisible iframe that was located on top of the link. So the impression will be counted, but the real user did not see the advertisement and did not know where he was clicking.

Or we found that the impression was not initiated by the user, but by a script without the user’s knowledge..

Bad reputation IP IPs from which only technical losses come. For example, someone set up a server in a data center, and using CURL, or another tool for http requests (and they are found in all programming languages), parses pages, and then clicks on advertising links using the same tool.
Blocked hits Hits that were blocked according to integration settings.

Suspicious visits (SV)

We see something suspicious in the analysis results, but it is not enough to confidently state that it is a bot. Possible reasons include the user having a harmless extension that somehow affects our checks, or the user accessing through a VPN.

Suspicious clicks require an individual approach, for example, to determine where to attribute suspicious traffic based on conversions - to fraudulent or good traffic.

Name Description
SV total Number of suspicious clicks.
Proxy Users accessing through VPN, proxy, or Tor. These are live clicks, but they come from non-targeted geo. If you are buying traffic from a specific geo, and there are a lot of proxies on it, then the traffic is bad.
Suspicion of spoofing We suspect that user agent spoofing techniques are being used in the browser. If this metric is low, there is nothing to worry about. But if its percentage is high and you are not buying any specific mobile traffic, then there is a reason to think about it.
Suspicion of automated We suspect that automation tools are being used in the browser.
Suspicion of fraud Other types of suspicious activity, such as a too-wide screen, can be the result of poor layout or an attempt to hide ads outside the screen.

Feature

Some browser features that we detect.

Name Description
AdBlock Whether AdBlock is installed in the user's browser or not. Note that we only check for the presence of AdBlock; whether it is turned on or off is not checked.
Is mobile Checking by characteristic features (not by user agent) whether the device is mobile (smartphones, tablets) or not (desktops, TVs).
WebView Number of visits with Chrome WebView. Chrome WebView is a greatly reduced functionality version of mobile Chrome that is used in apps to view web pages, as well as many mobile browsers for Android, which are customized WebView. Most WebViews cannot display pushes. Therefore, if there is a high percentage of WebView on push traffic, the traffic may be bad.
IFrame Number of visits made via iframe. <iframe> tag HTML element represents a nested browsing context, embedding another HTML page into the current one.
Push notifications Number of visits from browsers that support push notifications.

Behavioral analysis

Metrics based on the analysis of how the user behaved on the page.

Name Description
Visible page The number of visits when the page received focus, meaning it was open in the foreground rather than in the background tab of the browser.
Active page The number of visits when there was at least minimal activity on the page. Activity is defined as clicks, touches, scrolls, mouse movements.
Accepted page The number of visits when the time from page opening to its closure exceeded 15 seconds.

Groups

Groups are parameters that allow you to answer the question "which?". From which countries were the clicks? Which browsers did users use?

Time

Name Description
Hour Groups statistics by hours.
Day Groups statistics by days.
Week Groups statistics by weeks.
Month Groups statistics by months. The maximum period that can be selected is two months.

Tags

Name Description
Integration Kaminari integration ID.
Bot type Grouping by bot types.
Referer URL of the page from which the user landed on the page.
Sub1..Sub7 Tags set by the client.

Geo

Name Description
Language User's preferred language, i.e. UI language set in the browser settings.
Timezone User's timezone data obtained from IP address.
Country Country data obtained from IP address.
City City data obtained from IP address.
Provider Internet service provider data obtained from IP address.
IP IP without the last octet, i.e. there will always be zeros at the end, for example: 192.168.1.00.

Device

Name Description
Real browser engine All modern browsers are based on three open-source engines:
  1. Blink (Chrome, Chromium, Yandex Browser, Samsung Browser, modern Opera and MS Edge, most Android browsers);
  2. WebKit ((Safari; versions of Chrome, Yandex Browser, Firefox, etc. for iOS; browsers in PlayStation);
  3. Gecko (Firefox).

There are also three old unsupported engines that are still present on the internet in the form of Internet Explorer, in MS Edge up to version 18 and in Opera up to version 12. We can determine all 6 engines by their characteristic features, as well as determine the specific version of the engine. It is the browser engine that we write this data to.

Real OS The current OS without version.
Browser from UA Browser name and version taken from user agent.
OS from UA Operating system and its version taken from user agent.
Device from UA Mobile device manufacturer obtained from user agent.
Device type from UA Device type (desktop, smartphone, smart TV, etc.) obtained from user agent.
Connection type The type of connection that the device uses to connect to the network (Bluetooth, Wi-Fi, mobile internet).

Screen

Name Description
Screen width Screen width in pixels.
Screen height Screen height in pixels.
Screen orientation Screen orientation is relevant for mobile devices - portrait, upside-down, turned left, turned right.
Pixel density How many screen pixels will be used to render one CSS pixel, i.e. this is actually the screen resolution.