ViewTube

ViewTube
Sign inSign upSubscriptions
Filters

Upload date

Type

Duration

Sort by

Features

Reset

46 results

Notre Dame - IBM Technology Ethics Lab
Evaluations, Metrics, and Benchmarks with Diego Gomez-Zara, Werner Geyer, and Zahra Ashktorab

We are developing interactive tools that make it easier for researchers and practitioners to assess large-language-model (LLM) ...

6:21
Evaluations, Metrics, and Benchmarks with Diego Gomez-Zara, Werner Geyer, and Zahra Ashktorab

10 views

6 days ago

Neural Intel Media
Beyond the Exam Room: Stress-Testing Clinical AI with Medmarks v0.1

In this deep-dive episode, Neural Intel goes behind the data of the Medmarks v0.1 benchmark suite, led by Sophont and the ...

27:13
Beyond the Exam Room: Stress-Testing Clinical AI with Medmarks v0.1

0 views

6 days ago