ViewTube

Sort by

Newest

Oldest

Popular

Open-source LLM tracing, evals and prompt optimization with Evidently

Open-source LLM tracing, evals and prompt optimization with Evidently

270 views

3 months ago

8. Tutorial: Adversarial testing for LLM applications

8. Tutorial: Adversarial testing for LLM applications

630 views

9 months ago

7. Tutorial: Building and evaluating an AI agent

7. Tutorial: Building and evaluating an AI agent

657 views

9 months ago

6.2. Tutorial: Building and evaluating a RAG system

6.2. Tutorial: Building and evaluating a RAG system

967 views

9 months ago

6.1 How to evaluate a RAG system: methods and metrics

6.1 How to evaluate a RAG system: methods and metrics

2,079 views

9 months ago

5. Tutorial: Evaluating LLMs on content generation tasks. Tracing and experiments.

5. Tutorial: Evaluating LLMs on content generation tasks. Tracing and experiments.

667 views

9 months ago

4. Tutorial: Evaluating LLMs on classification tasks

4. Tutorial: Evaluating LLMs on classification tasks

920 views

9 months ago

3. Tutorial: How to create an LLM judge and align with human labels

3. Tutorial: How to create an LLM judge and align with human labels

1,969 views

9 months ago

2.3. Tutorial on LLM evaluation methods: Reference-free evals.

2.3. Tutorial on LLM evaluation methods: Reference-free evals.

1,217 views

9 months ago

2.2. Tutorial on LLM evaluation methods: Reference-based evals.

2.2. Tutorial on LLM evaluation methods: Reference-based evals.

1,309 views

9 months ago

2.1. Tutorial on LLM evaluation methods. Overview and Basic API.

2.1. Tutorial on LLM evaluation methods. Overview and Basic API.

2,290 views

9 months ago

1. Introduction to LLM evaluations in 10 key ideas

1. Introduction to LLM evaluations in 10 key ideas

4,319 views

9 months ago

LLM evaluation for builders - Course announcement

LLM evaluation for builders - Course announcement

2,679 views

10 months ago

How to run LLM evals with no code | PRACTICE

How to run LLM evals with no code | PRACTICE

2,213 views

1 year ago

How to continuously improve LLM products?

How to continuously improve LLM products?

891 views

1 year ago

A business case for LLM evaluations

A business case for LLM evaluations

696 views

1 year ago

AI agent and RAG evaluation

AI agent and RAG evaluation

1,168 views

1 year ago

LLM observability in production: tracing and online evals

LLM observability in production: tracing and online evals

996 views

1 year ago

LLM safety and red-teaming

LLM safety and red-teaming

1,147 views

1 year ago

LLM-as-a-judge: evaluating LLMs with LLMs

LLM-as-a-judge: evaluating LLMs with LLMs

7,842 views

1 year ago

LLM evaluation methods and metrics

LLM evaluation methods and metrics

6,831 views

1 year ago

LLM evaluation datasets: test cases and synthetic data

LLM evaluation datasets: test cases and synthetic data

2,283 views

1 year ago

How to evaluate an LLM application

How to evaluate an LLM application

2,629 views

1 year ago

LLM evaluation benchmarks

LLM evaluation benchmarks

1,891 views

1 year ago

What is an LLM-powered product?

What is an LLM-powered product?

1,806 views

1 year ago

Welcome to the LLM evaluation course

Welcome to the LLM evaluation course

3,048 views

1 year ago

Open-source LLM Evaluation with Evidently - Intro

Open-source LLM Evaluation with Evidently - Intro

454 views

1 year ago

LLM Evaluation Tutorial with Evidently

LLM Evaluation Tutorial with Evidently

1,595 views

1 year ago

6.5. Connecting the dots: full-stack ML observability.

6.5. Connecting the dots: full-stack ML observability.

682 views

2 years ago

6.4. ML monitoring with Evidently and Grafana. [OPTIONAL CODE PRACTICE]

6.4. ML monitoring with Evidently and Grafana. [OPTIONAL CODE PRACTICE]

1,948 views

2 years ago

Show more