Document Auto Capture

Document auto capture is a component to capture the photo of an identity document in the required quality without the need of the user pressing the shutter button.

In order to successfully detect and classify an identity document from a photo and to achieve high accuracy in optical character recognition (OCR), it is important to capture image with the highest quality possible. The Document Auto Capture functionality is a part of the DOT mobile libraries and web components.

The Document Auto Capture supports taking pictures of ID document cards and single passport pages that have visible corners, have the correct size ratio and are of light color tones.

Document Auto Capture Mobile UI Component

Document Auto Capture UI Component

The Document Auto Capture UI component is provided for easy integration into apps. It provides a camera preview with a rectangular placeholder and text instructions in the middle of the screen. The component continuously looks for an identity document in the preview frames and analyses the image parameters. A text instruction is shown which instructs the user to achieve the conditions in which the position and quality of the image are of sufficient quality. The component in case of success returns a high-quality document image suitable for further data extraction.

Document Auto Capture Web Component

The Document Auto Capture Web component is provided for integration into web frontend. It provides the same functionality as the mobile UI components.

Video Stream Preview Scale Type

There are two available ways how to scale the camera preview with mobile libraries:

  • Fit Center - Scale the preview, maintaining the source aspect ratio, so it is entirely contained within the UI component, and center it inside the view. This may show black rectangles on the sides of the auto capture component area. Recommended scale type for Optical Character Recognition, because it produces the maximum possible output image size.
  • Fill Center - Scale the preview, maintaining the source aspect ratio, so it fills the entire UI component, and center it in the view. This crops Keep in mind, that the output image might be smaller in size than the Fit Center type and therefore might not be optimal for Optical Character Recognition.

The Web component has to be wrapped inside the parent node with defined height and width. The media stream will be resized to fit inside that node maintaining the source aspect ratio.


StateInstruction (default)
Document is not detected in the imageScan document
Document is too smallMove closer
Document does not fit the placeholderFit document into rectangle
Sharpness is too low in the placeholder areaMore light needed
Brightness is too low in the placeholder areaMore light needed
Brightness is too high in the placeholder areaLess light needed
Hotspots are present in the placeholder areaAvoid reflections
Image is good enoughHold still…
The instructions can be localized as follows:

Controlling the process

It is also possible to build your own UI on top of the Document Auto Capture process. The Document Auto Capture Controller non-UI component is designed for this purpose. This component also controls the process in Document Auto Capture UI Component.

The Document Auto Capture Controller continuously accepts image frames (from camera preview), processes them and returns the result for each frame as a callback. The component is configured with an ordered list of validators. A frame should pass all of them to be considered as a valid frame. If the quality is not sufficient, the component returns a Hint according to the first not passed validator. The UI handles the result (e.g. shows a text instruction).

The workflow

  1. The component continuously accepts image frames until there is a defined count of valid frames in a row.
  2. The component enters the Stay still phase which means that the arrangement is good enough and the user should be instructed not to move. This phase lasts for a defined amount of time.
  3. The component selects the best image from all valid images and returns it as the result of the auto capture process.

Customize the frame validation

DOT Mobile Kit libraries contain a predefined list of validators which are used in the default configuration. You can define your own list of validators or you can implement a custom validator and evaluate available image data such as sharpness, brightness or document corner coordinates etc.

In case an own Document Auto Capture solution is preferred, one can leverage the DOT’s document detection and image analysis technology.

Detect document in an image

Document Detector is a state-less non-UI component for identity document detection in an image.

Document Detection

Image requirements

  • The document card edges must be clearly visible and be placed at least 10px inside the image area.
  • Images should not contain other objects or backgrounds with visible edges.

Analyze image parameters

Image Parameters Analyzer is a state-less non-UI component for calculating sharpness, brightness and hotspot presence in an image.