Vision Language Models (VLMs)

About Us

Join 6Pages

Back to Glossary

What are vision language models (VLMs)?

Vision Language Models (VLMs) are a type of AI model that combine visual input with language processing, accelerating a "new era of robotics" by enabling robots to understand natural language instructions in visual contexts.

Related Briefs