Automatic Visual Captioning

"Description of visual data requires manual human intervention"

Automatically extract tacit data from a video or an image to make it structured and available as language.

We use machine learning models to generate text describing the content of a picture or a video stream, which can be further be analyzed by language models to highlight points of interest. Unwanted content can be set to automatically trigger actions.

Possible Applications:

Review and suggest image captions

Flag behavior or situations

Crime detection

Digital visual content WCAG compliance

Are you curious to know more?

Register your interest in the form below, and we'll get in touch!

contact@nor.business

Privacy Policy

What we do

Who we are

In the news

Contact us

contact@norlab.ai

Privacy Policy