AI-Based Image Analysis Solution

Custom development of an AI-powered image processing tool with object detection, captioning, and tagging functionality.

ABOUT the project

Client:: Leobit's Internal Project
Location:: USA
Company Size:: 100+ Employees
Industry:: Information Technology
Solution:: Custom Software Development

Services:

AI/LLM Development

Custom Software Development

Software Architecture Design

Technologies:

.NET

Angular

Azure AI Vision Image Analysis

Azure Blob Storage

Fabric.js

TailwindCSS

An AI-powered image analysis solution for intelligent image processing. It can automatically extract objects from images, generate image captions, tag objects, and process large volumes of unstructured visual data.

The solution uses the functionality of Azure Computer Vision enhanced with custom code, as well as convenient and interactive user interface built with Angular and TailwindCSS. It holds a significant potential for integration and implementation across different industries, ranging from healthcare to financial services.

The solution holds a significant potential and can be applied for processing unstructured visual data across industries. It facilitates data categorization, helps detect objects or persons in images, and provides features for digitizing documents. The tool and its functionality can be enhnaced to achieve greater precision and make it suitable for various domains.

Yurii Shunkin

R&D Director at Leobit

Customer

It was our internal project aimed at testing the capabilities of Azure Computer Vision and empowering our actual and potential customers.

Business Challenge

Processing large volumes of unstructured visual data from images is a common challenge across various industries, ranging from healthcare to education. The Leobit team decided to build a solution to this problem while exploring the capabilities of Azure Computer Vision.In particular, searching for different types of materials across datasets may be a challenging task due to issues with image categorization. The Leobit team decided to build a solution that responds to the challenge.

Project
in detail

While the project was completed in the short term, it still involved several major stages, namely planning, configuration of Azure AI Vision Image Analysis, and the development of the app powered with AI functionality.

Project planning

Our R&D Department specialists had an idea of building a smart image processing solution based on a custom Azure AI Vision Image Analysis configuration. We came up with a vision of a simple yet convenient UI providing the users with access to AI-powered image processing functionality.

Azure Computer Vision configuration

Our specialists configured Azure AI Vision Image Analysis to detect objects and mark their coordinates on images. The service also leverages OCR algorithms to recognize and extract text patterns, while tagging algorithms generate descriptive image captions that identify both object categories and their particular properties.

UI and back-end development

To ensure users can easily apply the solution’s functionality, we developed an Angular-based user interface. Our team used TailwindCSS for styling and Fabric.js for canvas rendering, enabling users to view bounding boxes of identified objects on the image.

Results

An early version of an AI-powered solution for image analysis developed in around a week.
Smart image processing with content detection and captioning capabilities.
Extensive opportunities for continuous optimization of the solution and its integration with software systems across various industries.

AI-Based Image Analysis

Our solution applies custom Azure AI Vision Image Analysis configurations to process images with high accuracy. It efficiently detects objects, providing coordinates for bounding boxes that are automatically generated with a custom algorithm based on Fabric.js. Azure AI vision Image Analysis also provides capabilities for AI-powered image classification and processing that support efficient and precise image tagging.

The tool also uses Azure AI Vision’s optical character recognition (OCR) capabilities to enable fast and accurate document digitization. Additionally, its algorithms can be fine-tuned to improve precision, efficiently process larger volumes of unstructured visual data, and ensure more accurate object detection using bounding boxes aligned with object contours.

Processing of Large Volumes of Unstructured Data

The solution provides advanced functionality for searching through unstructured data, which allows users to locate particular objects, persons, or animals. These capabilities can enhance various tasks, such as creating presentations, enhancing media content, or streamlining content discovery. Our tool also supports image classification and grouping, such as organizing photos by the individuals depicted.

With such functionality, teams can collect datasets for powering machine learning tasks, building recommendation engines, and developing systems that find similar content. It is worth noting that we at Leobit use the solution internally, evolving from basic data store searches to a more intelligent, data-driven approach.

Simple and Convenient UI

To ensure usability of the solution, we created a custom UI powered with Angular and TailwindCSS for styling. Our specialists used Fabric.js to ensure interactive image rendering with highlighted boundaries of objects identified and marked by Azure AI Vision Image Analysis. To connect the given service with the app’s UI, we used .NET as a back-end technology. All the processed images are temporarily stored in the Azure Blob Storage.

Automated Image Tagging and Caption Generation

Automated Image Tagging for Detected Objects

The solution automatically generates tags with concise descriptions of detected objects, including their classification and specific properties. In addition, it can generate simple yet accurate descriptive captions. The tool also assigns confidence scores to the objects it manages to identify across images. Therefore, even if a solution fails to identify an object with a 100% confidence, it leaves notes that simplify human human review of an image.

Explore
The solution prototype

Explore a PoC image analysis platform that can be integrated into custom applications across industries, providing visual data processing support for daily operations, compliance, and service quality.

Explore demo

Technology Solutions

Custom Azure AI Vision Image Analysis configurations for an AI-powered image processing.
A user-friendly interface that seamlessly integrates with Azure AI Vision, providing bounding boxes and descriptive tags for objects identified on the image, as well as OCR capabilities for capturing text on images.
Cloud-based architecture based on .NET, Azure AI Vision, and Azure Blob Storage.

Value Delivered

An efficient tool for processing large volumes of unstructured visual data.
The tool that can be applied across various industries, ranging from healthcare to education.
Rich possibilities for continuous optimization and training for an image processing model.
Optical Character Recognition (OCR) features for efficient digitization of documents and other text on images.