AI PoC: Video Transcription and Summarization Tool Built with Azure Services
Custom development of an AI-powered video transcription and summarization solution with a flexible architecture
ABOUT
the project
Client:
Location:

Ukraine
Company Size:
Industry:
Solution:
Services:
PoC Development
The Leobit team developed a .NET-based proof of concept (PoC) of a software solution that provides video transcription, storage, and AI-powered summarization. Our software development specialists leveraged API endpoints to integrate the app with Azure services that provide the above-mentioned functionality.
We experimented with an architectural approach to deliver an efficient AI-powered solution for video transcription and summarization. Its multi-layered architecture based on API integrations with Azurer’s service allowed us to build a solution in the short term and ensure its flexibility, which is essential for its forthcoming expansion.

Customer
We at Leobit decided to build this solution as our internal tool. It allowed us to experiment with an innovative architectural approach and fully utilize the vast potential of various Azure cloud services. In addition, this solution can be used in our internal workflows and can be especially useful to our marketing team.
Business Challenge
We had to come up with the most time- and resource-efficient approach in order to build a flexible and ready for future updates PoC in the shortest terms possible.
Project
in detail
While our primary goal was developing a PoC, we still needed to break the software development cycle in several critical stages.



Efficient and Flexible Architecture
We created an efficient layered architecture that utilizes APIs to connect the app to Azure services that power its core functionality. A custom API controller connects the solution to the service layer involving such services as Azure Blob storage, used for efficient video upload, Azure Speech Service, responsible for speech recognition, and Azrue OpenAI Service which covers the generation of video summaries, etc.
This approach to the concept's development ensures great flexibility and efficiency. The solution fully leverages the capacities of Azure's cloud services and can be expanded further with new functionality.

Comprehensive Functionality for Processing Videos
The solution provides a rich set of features for storing, editing, and managing videos. It involves functionality for:
– Uploading video files.
– Playing the video back with standard player controls.
– Transcribing the speech automatically with speaker recognition in videos.
– Processing transcription on background with progress displayed in real time.
– Generating summaries of transcriptions.
– Finding particular moments in videos by clicking on transcription segments.
– Automatically generating thumbnails for a video.
– Storing videos, transcriptions, and summaries.
– Sharing them via unique links.
As has already been mentioned, the PoC has a rich potential for continuous expansion of its functionality.

A Functional App PoC Built with .NET and Angular
We leveraged our .NET expertise to create an efficient app back end that seamlessly connects with our API Controller. We also created a functional interface allowing users to test the PoC's core features. Our specialists used Angular to build an app's front end, which provides features for uploading, viewing, transcribing, and summarizing video content.
Explore
The solution prototype
See a PoC that showcases how automated case intake, role-based dashboards, and workflow management work in practice for civil and criminal cases.
Technology Solutions
- Flexible layered architecture built according to the principles of the Onion architectural pattern.
- Custom API controller connecting the app with a variety of Azure’s services, including AI tools.
- Access to the secure and well-organized Azure Cosmos DB database.
- Azure Blob Storage for fast video uploads and convenient storage.
- Integration with the Azure Speech Service for transcribing videos and speaker recognition.
- Integration with Azure OpenAI for AI-powered video summarization.
Value Delivered
- An innovative and highly efficient architectural approach to software architecture based on the integrations with Azure’s services.
- Great flexibility and potential for continuous improvement of the PoC into a full-fledged product.
- A fully-usable PoC that verifies our innovative architectural concept; built within less than two weeks.