ViewTube

ViewTube
Sign inSign upSubscriptions
Filters

Upload date

Type

Duration

Sort by

Features

Reset

9,737 results

Tim Lohner
Speculative Decoding in a Nutshell

What is speculative decoding—at a high level? Why is it used (what problem does it solve)? How does it differ from standard ...

3:14
Speculative Decoding in a Nutshell

37 views

4 months ago

Fahd Mirza
LLM Inference - Self Speculative Decoding

This video shares a research paper which introduces a novel inference scheme, self-speculative decoding, for accelerating Large ...

2:45
LLM Inference - Self Speculative Decoding

680 views

2 years ago

MrMacaroonable
episode 1: Speculative Decoding

In this episode, we discuss how does speculative decoding speed up Large Language Models.

2:16
episode 1: Speculative Decoding

42 views

1 year ago

preminstrel
TriForce: Lossless Acceleration of Long Sequence Generation with Hierarchical Speculative Decoding
0:31
TriForce: Lossless Acceleration of Long Sequence Generation with Hierarchical Speculative Decoding

120 views

1 year ago

工gin師
What is Speculative Sampling? How does Speculative Sampling Accelerate LLM Inference

What is speculative sampling in deep mines paper accelerating large language model decoding with speculative assembly a ...

2:49
What is Speculative Sampling? How does Speculative Sampling Accelerate LLM Inference

787 views

2 years ago

Investing Basics
Jim Simons: How I made Billions

More Videos like this Charline Munger: Why I HATE Tesla? https://youtu.be/SzAVnkwo8I0 Charlie Munger: Why China is Better ...

0:33
Jim Simons: How I made Billions

681,742 views

4 years ago

IBM Research
IBM Granite tops the SQL charts, faster chatbots with speculative decoding, & IBM AI at Wimbledon

Welcome to The Short, the biweekly recap of IBM's latest innovations and research. This week we dive into IBM's Granite LLM ...

2:19
IBM Granite tops the SQL charts, faster chatbots with speculative decoding, & IBM AI at Wimbledon

717 views

1 year ago

Arxiv Papers
[short] Online Speculative Decoding

Online speculative decoding is introduced as a technique to improve the efficacy of speculative decoding in large language ...

2:18
[short] Online Speculative Decoding

63 views

2 years ago

Prophet Perry
DECODING THE GLITCH: The Secret Code Behind the "Kubala Kingdom" Viral Video

Is the viral "Kubala Kingdom" incident just random chaos, or is it a carefully engineered social experiment? We received an ...

1:45
DECODING THE GLITCH: The Secret Code Behind the "Kubala Kingdom" Viral Video

216 views

3 months ago

FranksWorld of AI
AI Inference: Cutting Latency 4X with Speculative Decoding #shorts

Model speculation dramatically slashes inference times! A smaller, faster model predicts tokens, while the larger model verifies.

0:38
AI Inference: Cutting Latency 4X with Speculative Decoding #shorts

216 views

4 months ago

AI Research Roundup
AdaSPEC: Selective KD for Faster LLM Spec Decoding

... Selective Knowledge Distillation for Efficient Speculative Decoders' This work tackles inefficiencies in speculative decoding, ...

3:42
AdaSPEC: Selective KD for Faster LLM Spec Decoding

0 views

2 weeks ago

Anthony Lee - The A.I. Marketer Guy
GPT4 structure leaked! Speculative decoding may be reason for declined performance

GPT4 structure leaked! Speculative decoding may be reason for declined performance. #gpt4 #openai.

2:12
GPT4 structure leaked! Speculative decoding may be reason for declined performance

204 views

2 years ago

History – Dark Archive
Buga Sphere's SECRET Makeup Revealed! 🛸🧪

FULL VIDEO: https://youtu.be/5l6IC-BaLmE The mysterious Buga Sphere's secret composition is finally REVEALED! Scientists ...

0:52
Buga Sphere's SECRET Makeup Revealed! 🛸🧪

687,272 views

7 months ago

Devansh: Chocolate Milk Cult Leader
Speculative Decoding: The inference technique that will change LLMs

Speculative decoding is a technique designed to accelerate the inference speed of large language models (LLMs). It leverages a ...

0:44
Speculative Decoding: The inference technique that will change LLMs

648 views

10 months ago

Cooking Astrology
Two Planets - Career in Stock Market (Full Time Trader)

Stock Market in Astrology. In this video you will learn about the yoga for becoming successful in Stock Market. What are the two ...

0:30
Two Planets - Career in Stock Market (Full Time Trader)

177,396 views

2 years ago

sami malik
NVIDIA TensorRT + Speculative Decoding: The AI Speed Upgrade You Need

Discover how speculative decoding can significantly boost the inference speed of the Llama 3.3 70B model, increasing ...

1:53
NVIDIA TensorRT + Speculative Decoding: The AI Speed Upgrade You Need

63 views

11 months ago