Optical Character Recognition Document

AI and ML

Optical Character Recognition Document

Every day, a vast quantity of textual information is written or printed on tangible paper, such as study-related messages, invoices, periodicals, books, ads, and so on. Paper contamination is a major issue in the corporate world and has obvious environmental consequences. Aside from that, it will be difficult to keep a large quantity of information or conduct a quick look for information if we use physical paper in business. Both Saigon Technology and the clients are affected by these issues.

Introduction

Recent advances in science and technology, particularly in the field of artificial intelligence, have given us the inspiration to create innovative ways to address the issue of paper pollution, such as an automated system to transfer all textual information currently stored on paper to a digital format.

We boast a powerful AI team at Saigon Technology with extensive expertise in Computer Vision and Natural Language Processing areas to create an OCR model, build the automation end-to-end system to transform the input image into digital text data, and ultimately launch it for us and our clients to use in business. In the process of going paperless, it can help save a lot of time and energy.

Our Approaches

Our purpose is to convert text image data to text and then process the output text to extract some important information. To do that, we have applied some Deep Learning models in Computer Vision to detect the text location on the natural image and then recognize some specific words. We separate our system into multi parts from pre-processing input images to get the final meaning of the text.

As you could see, firstly our system will receive data from the input text image or printed image... This input data will be cleaned or pre-processed by some methods like enhancing the image quality, removing blur, noise, and normalization. Then, the system will run some Deep Learning models to detect the text region on the cleaned input image and recognize, classify each text to some specific word, and at this step, we will have the output text data. Finally, there is an NLP model to clean again this text data to make these text data meaningful and extract the necessary information from them.

Usage

Step 01

Step 1: Access to the Optical Character Recognition site: https://experiment.saigontechnology.vn/invoice/ or https://experiment.saigontechnology.vn/cvparser Or you can access the main Saigon Technology AI Research Lab page here: https://experiment.saigontechnology.vn/ , select the Optical Character Recognition section, and click Try our demo button.

Step 02

On the Optical Character Recognition page, to start please click the Browse files button.

Step 03

Choose an image file (.png, .jpg or another image format...) you want to run.

Step 04

After the chosen image is uploaded, click the Run button to run the OCR model.

Step 05

The output of the OCR model will be drawn directly on the image like below.

Step 06

Scroll down to see the output text of the OCR model as below.

Related Projects

Web Application

ITS – INDIVIDUAL TAX SYSTEM

The INDIVIDUAL TAX SYSTEM is a CRM-integrated solution. It offers advanced permission controls to manage field visibility. They can enable or disable access based on user roles.

Detail

IT Services

Google Workspace to Microsoft 365 Migration

Saigon Technology seamlessly migrated 350+ users from Google Workspace to Microsoft 365, preserving data integrity, permissions, and workflows with minimal disruption.

Detail

Business

Peptalk Webflow

During the cooperation with Saigon Technology, Peptalk has decided to improve product quality by switching to Webflow for a fresh and improved design.

Detail

Business

MeetDoris

MeetDoris is a web-based project management tool for better productivity and time management. The app schedules tasks in a project, adjusting the calendar accordingly.

Detail

Business

Visit DA

Visit DA is a project to provide a data management and visualization platform that can be used to monitor the health of equipment across many locations. Users may make informed decisions about storage allocation in light of the resulting data.

Detail

QA Testing

Survey

Customer: InnoBay Group - Singapore Engagement model: Fixed-price

Detail

Business

SOS Pro

SOS Pro is an advanced personnel alarm for the professional market. It is used by security companies, municipalities and private companies. If your employees feel insecure at work, SOS Pro can offer fast and efficient notification in emergency situations.

Detail

Business

Smart GIS

Smart GIS is a cutting-edge technology product that offers dynamic location intelligence software. The team of the client possesses a significant amount of collective experience in the development of geospatial software.

Detail

Business

Simply DMS

The current document management system (DMS) was developed by a team quite some time ago. The current system has not been maintained or updated by a dedicated group.

Detail

Browse Our Portfolio

Do you need specific Case studies for Your industry?

We are always ready to assist you. Reach out to us today

HO CHI MINH (HEAD OFFICE)

HO CHI MINH (ALOHA OFFICE)

Optical Character Recognition Document

Introduction

Our Approaches

Usage

Step 01

Step 02

Step 03

Step 04

Step 05

Step 06

Related Projects

ITS – INDIVIDUAL TAX SYSTEM

Google Workspace to Microsoft 365 Migration

Peptalk Webflow

MeetDoris

Visit DA

Survey

SOS Pro

Smart GIS

Simply DMS

Do you need specific Case studies for Your industry?

Vietnam

HO CHI MINH (HEAD OFFICE)

HO CHI MINH (ALOHA OFFICE)

DA NANG (BRANCH)

United States

SAIGON TECHNOLOGY - USA OFFICE

Australia

SAIGON TECHNOLOGY AUSTRALIA

Singapore

SAIGON TECHNOLOGY SINGAPORE