Upload date
All time
Last hour
Today
This week
This month
This year
Type
All
Video
Channel
Playlist
Movie
Duration
Short (< 4 minutes)
Medium (4-20 minutes)
Long (> 20 minutes)
Sort by
Relevance
Rating
View count
Features
HD
Subtitles/CC
Creative Commons
3D
Live
4K
360°
VR180
HDR
15,392 results
Ready to become a certified watsonx AI Assistant Engineer? Register now and use code IBMTechYT20 for 20% off of your exam ...
17,911 views
6 months ago
Red Hat's Mark Kurtz and Megan Flynn examine speculative decoding, a technique that uses a smaller, faster model—the ...
317 views
1 month ago
About the seminar: https://faster-llms.vercel.app Speaker: Hongyang Zhang (Waterloo & Vector Institute) Title: EAGLE and ...
2,788 views
10 months ago
Speculative decoding is one of the most important performance optimizations in modern LLM serving—and most people still don't ...
47 views
1 day ago
There is a lot of possibility with Speculative Decoding allowing us normal folks to run larger and larger AI models at home. I hope ...
18,531 views
9 months ago
Speed up your Large Language Model by 2 or 3 times with OpenVINO's speculative decoding. Much faster inference without ...
196,877 views
5 months ago
This is a single lecture from a course. If you you like the material and want more context (e.g., the lectures that came before), check ...
109 views
3 weeks ago
Today, we're joined by Chris Lott, senior director of engineering at Qualcomm AI Research to discuss accelerating large language ...
1,655 views
This video introduces the playlist, explains the Speculative Decoding Algorithm from the paper https://arxiv.org/pdf/2211.17192 in ...
18 views
5 days ago
Session covering an overview of speculative decoding and several seminal papers in the space, including Medusa, Eagle 1/2/3, ...
458 views
2 days ago
Speculative decoding is a technique used to speed up LLM inference by using a small, fast model to quickly generate text. A big ...
188 views
7 months ago
In this video, we're diving deep into Speculative Decoding, an advanced technique that is revolutionizing how AI language ...
329 views
8 months ago
AI Gold Nugget #2.1 – Speaker #1: Raphael Vienne Can we go faster…
33 views
Speculative decoding is usually discussed as a way to make real time LLM APIs feel faster. But what happens when you apply it to ...
28 views
Speculative decoding is one of the most powerful - and misunderstood - techniques for speeding up LLM inference.
54 views
arxiv - https://arxiv.org/pdf/2510.19779 Become AI Researcher & Train LLM From Scratch ...
435 views
2 months ago
6 views
Title: Accelerating LLM Inference with Lossless Speculative Decoding Algorithms for Heterogeneous Vocabularies Authors: ...
50 views
What is speculative decoding—at a high level? Why is it used (what problem does it solve)? How does it differ from standard ...
37 views
4 months ago
Full Title:Yikai Zhu, Lukec Wang:SpecForge: Open Source Framework for Training Speculative Decoding Models Speculative ...
3 views
2 weeks ago