reinforcement learning tutorial

Reinforcement learning research with Joseph Suarez

Watch science advance live! I am an MIT PhD and stream my research on reinforcement learning. You can also find me here: ...

4:52:42

Reinforcement learning research with Joseph Suarez

127 views

Streamed 35 minutes ago

FlexSim Geek

Reinforcement Learning in FlexSim | Complete Setup & Python Training Tutorial

Welcome back to FlexSim Geek! In this tutorial, you'll learn how to set up and train a Reinforcement Learning (RL) agent using ...

8:37

Reinforcement Learning in FlexSim | Complete Setup & Python Training Tutorial

45 views

5 days ago

DEEPTECH AI LABS

FUGU Proves Intelligence Isn't What You Think

... (three-role multi-model orchestration — Thinker, Worker, Verifier) and Conductor (GRPO reinforcement learning that trained the ...

5:48

FUGU Proves Intelligence Isn't What You Think

4,720 views

1 day ago

Wes Roth

The latest AI News. Learn about LLMs, Gen AI and get ready for the rollout of AGI. Wes Roth covers the latest happenings in the ...

19:34

Cursor JUST beat EVERYONE...

50,569 views

1 day ago

NeuraForge

Deep Reinforcement Learning — what do you get when you combine an agent that learns by trial and error with the raw ...

6:50

Deep Reinforcement Learning

18 views

4 days ago

Gyeongsik Moon

[Reinforcement Learning] 13. Behavior Cloning

Reinforcement Learning (2026-1) Korea University Prof. Gyeongsik Moon Lecture slides: ...

50:13

[Reinforcement Learning] 13. Behavior Cloning

27 views

2 days ago

Field AI & Robotics Laboratory

Vision-Based Autonomous Drone Landing on Moving Platforms via Deep Reinforcement Learning

This video presents our work, “Vision-Based Autonomous Drone Landing on Moving Platforms With Uncertain Motion via Deep ...

3:34

Vision-Based Autonomous Drone Landing on Moving Platforms via Deep Reinforcement Learning

323 views

3 days ago

Learn AI Visually

MiniMax-M2 — Forge RL prefix-tree merging: 40× faster RL training

It is how MiniMax-M2 trains its 230-billion-parameter open mixture-of-experts model with reinforcement learning up to 40× faster.

0:49

MiniMax-M2 — Forge RL prefix-tree merging: 40× faster RL training

0 views

15 hours ago

Anas M Elgezawy

GoalNet-RL: Multi-Agent Football Reinforcement Learning System | Q-learning, DQN, Double, & Dueling

Techniques: Python, PyTorch, Reinforcement Learning, Deep Q-Networks, Multi-Agent Systems – Designed and developed a ...

1:37

GoalNet-RL: Multi-Agent Football Reinforcement Learning System | Q-learning, DQN, Double, & Dueling

1 view

4 days ago

AI Focus

Unsupervised Reinforcement and Auxiliary Learning (UNREAL)

UNREAL agent combines the primary policy 𝜋 trained with A3C and auxiliary task polices 𝜋_c trained on data in experience ...

8:55

Unsupervised Reinforcement and Auxiliary Learning (UNREAL)

33 views

6 days ago

Simplilearn

Enroll Now To The Best AI and Machine Learning Courses ...

8:32:55

Deep Learning Full Course 2026 [FREE] | Deep Learning Tutorial | Deep Learning Course | Simplilearn

1,549 views

Streamed 1 day ago

Weight and See

Sim to Real: The Physical AI Breakthrough

03:05 Solution 1: Sim-to-Real Transfer Explained 05:10 Solution 2: Generalized Reinforcement Learning for Physics 07:00 What ...

2:32

Sim to Real: The Physical AI Breakthrough

1 view

4 days ago

Gyeongsik Moon

[Reinforcement Learning] 2. 3D Representations

Reinforcement Learning (2026-1) Korea University Prof. Gyeongsik Moon Lecture slides: ...

1:00:52

[Reinforcement Learning] 2. 3D Representations

406 views

2 days ago

CR Labs

How AI Models Are Aligned: Fine-Tuning and RLHF Explained

03:33 Reinforcement learning from verifiable rewards 03:36 When you can check the answer, RL wins 03:59 The arc in one line ...

5:29

How AI Models Are Aligned: Fine-Tuning and RLHF Explained

8 views

6 days ago

Creative Py

🚗 How Self-Driving Cars Learn? | Reinforcement Learning Algorithms Explained #ai

Not all AI learns from data the same way. Some AI learns from labels. Some AI discovers hidden patterns. But the most advanced ...

8:38

🚗 How Self-Driving Cars Learn? | Reinforcement Learning Algorithms Explained #ai

6 views

6 days ago

Standarity

What is Gepa Prompt Optimizer? GEPA (ICLR 2026) reflective prompt-evolution optimizer beats GRPO by up to 20% u What if an ...

4:24

What is Gepa Prompt Optimizer?

2 views

6 days ago

Unreal_Patrick

10- IsaacLab External Project: Customizing parameters of the Reward Function and RL algorithm

This is a beginner-friendly tutorial on how to add a new task to your external project and how to customize its parameters for the ...

18:29

10- IsaacLab External Project: Customizing parameters of the Reward Function and RL algorithm

22 views

1 day ago

Bookworm

TRACER: Turn-level Regret Matching with Inner Reinforcement Credit for Cooperative Multi-LLM Reasoning Paper: ...

7:50

EP258 TRACER AI Collaboration

4 views

3 days ago

Simplilearn

Enroll Now To The Best AI and Machine Learning Courses ...

8:32:51

Deep Learning Full Course 2026 [FREE] | Deep Learning Tutorial | Deep Learning Course | Simplilearn

940 views

Streamed 1 day ago

FelixRomo

Metis-Core v0.2.0: Multi-Agent Reinforcement Learning with Shared-Trunk Multi-Head Architecture

Technical demonstration of Metis-Core v0.2.0-alpha, my custom AI framework developed in C++ and LibTorch, specifically ...

4:23

Metis-Core v0.2.0: Multi-Agent Reinforcement Learning with Shared-Trunk Multi-Head Architecture

5 views

1 day ago

ViewTube