ViewTube

ViewTube
Sign inSign upSubscriptions
Filters

Upload date

Type

Duration

Sort by

Features

Reset

508 results

I had no idea
RDEP: Replicated Dense / Expert Parallel

In this video, we go over the RDEP or Replicated Dense / Expert Parallel technique used to train Mixture of Experts large ...

6:11
RDEP: Replicated Dense / Expert Parallel

135 views

6 days ago

NeuralNine
Linear Regression From Scratch in Python (Mathematical, Closed-Form)

Today we implement Linear Regression from scratch in Python using the closed-form solution. We first cover the mathematical ...

27:26
Linear Regression From Scratch in Python (Mathematical, Closed-Form)

4,085 views

6 days ago

Discover AI
Forget LLM: MIT's New RLM (Phase Shift in AI)

We've been misled by the promise of "infinite" context windows: new AI research proves that "Context Rot" is destroying reasoning ...

32:48
Forget LLM: MIT's New RLM (Phase Shift in AI)

22,375 views

4 days ago

InstaLILY AI
How Presentation Impacts LLM Performance on NP-Hard Problems | Insta After Hours

In this lecture, Alex Duchnowski, a software engineer at Instalily, presents research examining whether large language models ...

1:12:06
How Presentation Impacts LLM Performance on NP-Hard Problems | Insta After Hours

253 views

6 days ago

Magicalbat
coding a machine learning library in c from scratch

corrections: 23:23 - Forgot to change a cols to a rows in for loop 1:35:10 - You should also check if cur does not require gradient ...

2:26:17
coding a machine learning library in c from scratch

149,142 views

7 days ago

AI Engineer
DSPy: The End of Prompt Engineering - Kevin Madura, AlixPartners

Applications developed for the enterprise need to be rigorous, testable, and robust. The same is true for applications that use AI, ...

1:13:13
DSPy: The End of Prompt Engineering - Kevin Madura, AlixPartners

8,813 views

16 hours ago

Matthew Farrugia-Roberts
Linear Regression with JAX Autodiff // Hi, JAX! Act I // Lecture 01

Welcome to the first lecture for Act I of "Hi, JAX!", an introduction to vanilla JAX for deep learning research! In this video, we ...

21:14
Linear Regression with JAX Autodiff // Hi, JAX! Act I // Lecture 01

75 views

4 days ago

AI Paper Review
SEAL: Self-Adapting Language Models via Reinforcement Learning

This paper proposes a self-adaptive language model (SEAL) framework that learns by adjusting the weight by self-adjusting to ...

6:45
SEAL: Self-Adapting Language Models via Reinforcement Learning

46 views

7 days ago

CS50
CS50x 2026 - Artificial Intelligence

This is CS50, Harvard University's introduction to the intellectual enterprises of computer science and the art of programming.

47:49
CS50x 2026 - Artificial Intelligence

88,569 views

7 days ago

AI Research Roundup
Scaling Hyperparameters across Width, Depth, and Batch

In this AI Research Roundup episode, Alex discusses the paper: 'Completed Hyperparameter Transfer across Modules, Width, ...

4:55
Scaling Hyperparameters across Width, Depth, and Batch

119 views

6 days ago

Tech With Mala
#51. LangChain MathChain Explained | Solve Math Problems with LLMs

In this video, you'll learn how MathChain works in LangChain, with a clear hands-on demo. MathChain is designed to help LLMs ...

7:05
#51. LangChain MathChain Explained | Solve Math Problems with LLMs

0 views

7 days ago

Analytical Tips
Large Language Models for Dummies

This video explains that large language models function as advanced mathematical tools designed to predict the most probable ...

7:13
Large Language Models for Dummies

26 views

3 days ago

Philip Zucker
Satisfiability Modulo Theories (SMT)

A practice run for a tutorial on some topics in Satisfiability Modulo Theories for an egraphs workshop ...

36:20
Satisfiability Modulo Theories (SMT)

150 views

6 days ago

Ai Verdict
DeepSeek’s MHC Architecture: The End of the Residual Stream

For ten years, the residual connection has been the immutable standard of deep learning. It saved the industry from the vanishing ...

7:09
DeepSeek’s MHC Architecture: The End of the Residual Stream

432 views

6 days ago

LuxaK
AlphaEvolve: A coding agent for scientific and algorithmic discovery

AlphaEvolve is presented as an evolutionary coding agent designed to significantly enhance the capabilities of state-of-the-art ...

7:20
AlphaEvolve: A coding agent for scientific and algorithmic discovery

34 views

6 days ago

Göran bäcklund
CSharpNumerics Machine Learning

CSharpNumerics includes a lightweight, fully numerical machine learning framework designed for research, experimentation, and ...

9:38
CSharpNumerics Machine Learning

21 views

4 days ago

CN ACADEMY
Data Structures and algorithms - Expression trees

Data Structures and algorithms - Expression trees.

13:22
Data Structures and algorithms - Expression trees

13 views

5 days ago

kmit vista
SESSION - 11_PyTorch Implementation

Speaker : Prof. NEIL GOGTE.

1:52:56
SESSION - 11_PyTorch Implementation

54 views

4 days ago

CanConTech
10-minute paper (episode 37): Cramming 1568 Tokens into a Single Vector and Back Again

This research explores the extreme limits of information density within Large Language Model (LLM) input embeddings by ...

8:53
10-minute paper (episode 37): Cramming 1568 Tokens into a Single Vector and Back Again

141 views

7 days ago

The Savvy Scholar
How to Scale an LLM: The Engineer's Guide to Massive AI Models | The LLM Scaling Cookbook

Ever wonder how AI models like GPT-4 or Llama 3 actually run without crashing a computer? In this video, we crack open the ...

7:51
How to Scale an LLM: The Engineer's Guide to Massive AI Models | The LLM Scaling Cookbook

35 views

7 days ago