Skip to content
image 12

Automatic Visual Captioning


Speechmark-1

"Description of visual data requires manual human intervention"

 

Automatically extract tacit data from a video or an image to make it structured and available as language.

We use machine learning models to generate text describing the content of a picture or a video stream, which can be further be analyzed by language models to highlight points of interest. Unwanted content can be set to automatically trigger actions.

Skjermbilde 2024-06-25 kl. 09.59.38

 

Possible Applications:

Review and suggest image captions
Flag behavior or situations
Crime detection
Digital visual content WCAG compliance

Are you curious to know more?

Register your interest in the form below, and we'll get in touch!

Copyright © 2023 Norlab AI AS, Norway. All rights reserved. Org #932 305 232