ViewTube

ViewTube
Sign inSign upSubscriptions
Filters

Upload date

Type

Duration

Sort by

Features

Reset

5,019,731 results

Related queries

clip model explained

llm fine tuning

large language model

IBM Technology
What Are Vision Language Models? How AI Sees & Understands Images

Ready to become a certified watsonx AI Assistant Engineer? Register now and use code IBMTechYT20 for 20% off of your exam ...

9:48
What Are Vision Language Models? How AI Sees & Understands Images

83,997 views

7 months ago

Ilia
LLMs Meet Robotics: What Are Vision-Language-Action Models? (VLA Series Ep.1)

The first video in the series about Visual Language Action policies for robotics! If you've seen recent videos of robots folding ...

35:07
LLMs Meet Robotics: What Are Vision-Language-Action Models? (VLA Series Ep.1)

17,031 views

3 months ago

3Blue1Brown
Large Language Models explained briefly

A light intro to LLMs, chatbots, pretraining, and transformers. Dig deeper here: ...

7:58
Large Language Models explained briefly

4,782,491 views

1 year ago

Vizuara
Introduction to Vision Language Models (VLM)

In this lecture from the Transformers for Vision series, we take a clear and practical first step into multi-modal AI, where models ...

37:00
Introduction to Vision Language Models (VLM)

6,179 views

1 month ago

Uygar Kurt
Fine-Tune Visual Language Models (VLMs) - HuggingFace, PyTorch, LoRA, Quantization, TRL

We will fine-tune VLMs to chat with images using Python! Specifically, we'll fine-tune the Qwen2-VL-7B-Instruct model using LoRA ...

45:48
Fine-Tune Visual Language Models (VLMs) - HuggingFace, PyTorch, LoRA, Quantization, TRL

15,648 views

11 months ago

3Blue1Brown and Welch Labs
But how do AI images and videos actually work? | Guest video by Welch Labs

Diffusion models, CLIP, and the math of turning text into images Welch Labs Book: ...

37:20
But how do AI images and videos actually work? | Guest video by Welch Labs

1,396,480 views

5 months ago

Computerphile
How AI 'Understands' Images (CLIP) - Computerphile

With the explosion of AI image generators, AI images are everywhere, but how do they 'know' how to turn text strings into ...

18:05
How AI 'Understands' Images (CLIP) - Computerphile

319,190 views

1 year ago

EEML Community
[EEML'24] Jovana Mitrović - Vision Language Models

... training and evaluation V of visual representations using language it was simple uh the model the objective is also rather simple ...

1:16:34
[EEML'24] Jovana Mitrović - Vision Language Models

8,162 views

1 year ago

Umar Jamil
Coding a Multimodal (Vision) Language Model from scratch in PyTorch with full explanation

Full coding of a Multimodal (Vision) Language Model from scratch using only Python and PyTorch. We will be coding the ...

5:46:05
Coding a Multimodal (Vision) Language Model from scratch in PyTorch with full explanation

118,343 views

1 year ago

NVIDIA
Build Visual AI Agents with Vision Language Models

Empower your operations team with visual AI agents that provide richer insights and natural interactions for faster ...

0:50
Build Visual AI Agents with Vision Language Models

17,665 views

1 year ago

Code Mechanics: My PhD Life in AI & Robotics
Advancing Robotics with Vision Language Action (VLA) Models | Prelim Exam Talk

What's it like to give a preliminary exam (aka Area Exam) talk as a PhD student in robotics? In this video, I share my prelim exam ...

37:13
Advancing Robotics with Vision Language Action (VLA) Models | Prelim Exam Talk

906 views

1 month ago

Uygar Kurt
Implement and Train VLMs (Vision Language Models) From Scratch - PyTorch

In this video, we will build a Vision Language Model (VLM) from scratch, showing how a multimodal model combines computer ...

1:00:25
Implement and Train VLMs (Vision Language Models) From Scratch - PyTorch

4,458 views

4 months ago

3Blue1Brown
Transformers, the tech behind LLMs | Deep Learning Chapter 5

Breaking down how Large Language Models work, visualizing how data flows through. Instead of sponsored ad reads, these ...

27:14
Transformers, the tech behind LLMs | Deep Learning Chapter 5

8,755,348 views

1 year ago

AI Papers Academy
Perception Language Models (PLMs) by Meta – A Fully Open SOTA VLM

In this video, we dive into Perception Language Models (PLMs), introduced in a recent paper from Meta titled PerceptionLM: ...

8:35
Perception Language Models (PLMs) by Meta – A Fully Open SOTA VLM

7,541 views

7 months ago

Yannic Kilcher
OpenAI CLIP: ConnectingText and Images (Paper Explained)

ai #openai #technology Paper Title: Learning Transferable Visual Models From Natural Language Supervision CLIP trains on 400 ...

48:07
OpenAI CLIP: ConnectingText and Images (Paper Explained)

167,345 views

4 years ago

AI Study Hub
Vision Language Models Explained | How AI Understands Images and Text

What are Vision Language Models (VLMs) and why are they so important in modern AI? In this video, we explore ...

3:49
Vision Language Models Explained | How AI Understands Images and Text

151 views

6 months ago

Y Combinator
Chelsea Finn: Building Robots That Can Do Anything

... Physical Intelligence: A New Approach 01:47 - Learning from Language Models 02:08 - Data Sources for Training Robots 03:32 ...

44:53
Chelsea Finn: Building Robots That Can Do Anything

82,139 views

5 months ago

Trelis Research
Top Vision Models 2025: Qwen 2.5 VL, Moondream, & SmolVLM (Fine-Tuning & Benchmarks)

... One-click-llms: https://github.com/TrelisResearch/one-click-llms/ TIMESTAMPS: 00:00 Introduction to Vision Language Models ...

1:11:20
Top Vision Models 2025: Qwen 2.5 VL, Moondream, & SmolVLM (Fine-Tuning & Benchmarks)

15,409 views

10 months ago

Trelis Research
Fine-tune Multi-modal LLaVA Vision and Language Models

ADVANCED Vision Fine-tuning Repo: https://trelis.com/advanced-vision/ 🗝️ Get Trelis All Access (Trelis.com/All-Access) 1.

51:06
Fine-tune Multi-modal LLaVA Vision and Language Models

42,530 views

1 year ago

IBM Technology
How Large Language Models Work

Learn in-demand Machine Learning skills now → https://ibm.biz/BdK65D Learn about watsonx → https://ibm.biz/BdvxRj Large ...

5:34
How Large Language Models Work

1,292,227 views

2 years ago

The TWIML AI Podcast with Sam Charrington
π0: A Foundation Model for Robotics with Sergey Levine - 719

We dig into the model architecture, which pairs a vision language model (VLM) with a diffusion-based action expert, and the ...

52:01
π0: A Foundation Model for Robotics with Sergey Levine - 719

20,103 views

10 months ago

Ultralytics
Vision Language Models | Multi Modality, Image Captioning, Text-to-Image | Advantages of VLM's

Join us in this episode as we explore the world of Vision Language Models (VLMs) and their diverse applications. We'll dive into ...

6:35
Vision Language Models | Multi Modality, Image Captioning, Text-to-Image | Advantages of VLM's

14,476 views

1 year ago