ViewTube

ViewTube
Sign inSign upSubscriptions
Filters

Upload date

Type

Duration

Sort by

Features

Reset

58 results

MLWorks
🔥Stabilize LLM Training #llms #attentionmechanism #largelanguagemodels #modeltrains #llmexplained

How to stabilize LLM training using Layer Normalization and GELU activation function.

1:17
🔥Stabilize LLM Training #llms #attentionmechanism #largelanguagemodels #modeltrains #llmexplained

98 views

7 days ago

ByteQuest
CNN Explained Visually: Padding, Stride, Pooling, Receptive Fields, Dilation & Layer Architecture

In this video, we understood the core building blocks of a convolutional neural network. We started with convolution and saw how ...

23:55
CNN Explained Visually: Padding, Stride, Pooling, Receptive Fields, Dilation & Layer Architecture

201 views

6 days ago

Uplatz
Normalization Techniques in Deep Learning: Theory, Intuition & Training Stability | Uplatz

... covariate shift and gradient instability Batch Normalization: intuition, formulation, and training impact Layer Normalization and ...

7:22
Normalization Techniques in Deep Learning: Theory, Intuition & Training Stability | Uplatz

41 views

5 days ago

Tales Of Tensors
I Followed One Token Through a Transformer (Every Step)

... encoding sentencepiece tokenizer embedding layer positional embeddings rotary positional embeddings layer normalization ...

8:17
I Followed One Token Through a Transformer (Every Step)

172 views

1 day ago

ML Guy
The Core Building Block Behind GPT (Explained Visually)

Residual Connections and Layer Normalization: why deep Transformers are stable and trainable. Rather than treating the ...

6:18
The Core Building Block Behind GPT (Explained Visually)

157 views

7 days ago

Skill Advancement
Why Batch Normalization Fails in Transformers: The Padding Problem Explained

The LayerNorm Solution: Why Layer Normalization is superior for Transformers because it operates "horizontally" across features, ...

7:37
Why Batch Normalization Fails in Transformers: The Padding Problem Explained

19 views

1 day ago

Code With Aarohi
L-6 | Transformer Encoder Explained | Self-Attention, Q K V

... Self-Attention Feed Forward Neural Network (FFN) Residual Connections Layer Normalization Self-Attention Explained (Q, ...

28:00
L-6 | Transformer Encoder Explained | Self-Attention, Q K V

329 views

1 day ago

AI Academy
Batch Normalization Explained | Simple Visual Intuition for Beginners

Batch Normalization (BatchNorm) deep learning ka ek game-changing concept hai. Ye neural networks ko fast, stable, aur ...

2:28
Batch Normalization Explained | Simple Visual Intuition for Beginners

0 views

5 days ago

QA_AI_WIZARDS
🎥 Building a Large Language Model (LLM) from ScratchA Ground-Up Technical Guide Using PyTorch

Core building blocks • Multi-Head Self-Attention • Layer Normalization • Feed-Forward Networks • Residual Connections Each ...

5:24
🎥 Building a Large Language Model (LLM) from ScratchA Ground-Up Technical Guide Using PyTorch

96 views

7 days ago

ByteQuest
LeNet-5 CNN Architecture Explained | The Network That Started Deep Learning

In this video, we break down the LeNet-5 convolutional neural network architecture layer by layer. We cover convolutions, pooling, ...

5:39
LeNet-5 CNN Architecture Explained | The Network That Started Deep Learning

121 views

2 days ago

Däniel ebrz
3 code Neural Network Implementation and Weight Visualization Pipeline

This code demonstrates the complete pipeline for building, training, and evaluating a neural network for image classification using ...

5:50
3 code Neural Network Implementation and Weight Visualization Pipeline

0 views

7 days ago

Wonder Elven GouroB
𝙎𝙏𝘼𝙍 𝙏𝙧𝙖𝙫𝙚𝙡𝙚𝙧 𝙄𝙣𝙨𝙩𝙞𝙩𝙪𝙩𝙚 𝙋𝙡𝙖𝙣 (𝙎𝙚𝙣𝙨𝙚 𝙂𝙧𝙤𝙬𝙩𝙝 𝙄𝙣𝙨𝙩𝙞𝙩𝙪𝙩𝙚 𝙋𝙡𝙖𝙣 )

Working On New STAR Traveler Light Reality Maker : META Energy, Core ...

0:10
𝙎𝙏𝘼𝙍 𝙏𝙧𝙖𝙫𝙚𝙡𝙚𝙧 𝙄𝙣𝙨𝙩𝙞𝙩𝙪𝙩𝙚 𝙋𝙡𝙖𝙣 (𝙎𝙚𝙣𝙨𝙚 𝙂𝙧𝙤𝙬𝙩𝙝 𝙄𝙣𝙨𝙩𝙞𝙩𝙪𝙩𝙚 𝙋𝙡𝙖𝙣 )

0 views

5 days ago

Vikram Lingam
Mathematics of Neural Networks: Foundations and Advanced Applications

By viewing self-attention, residual connections, and layer normalization as outcomes of operator-splitting methods, we gain a ...

6:42
Mathematics of Neural Networks: Foundations and Advanced Applications

0 views

6 days ago

Big Data Landscape
Master Convolutional Neural Networks in One Video

In this comprehensive tutorial, we dive deep into Convolutional Neural Networks (CNNs) - the revolutionary deep learning ...

9:52
Master Convolutional Neural Networks in One Video

0 views

3 days ago

AI Podcast Series. Byte Goose AI.
Modern LLM Architectures: A Deep Dive into Efficiency and Design

For years, the race for Artificial Intelligence was a simple game of "more"—more data, more parameters, and more compute.

40:50
Modern LLM Architectures: A Deep Dive into Efficiency and Design

313 views

7 days ago

Gareth Pronovost | Build Without Code
Stop Using Airtable Like a Spreadsheet: The Right Way to Think About Airtable

If you are using Airtable like a spreadsheet, you are likely creating more work for yourself than you need to. This breakdown ...

14:32
Stop Using Airtable Like a Spreadsheet: The Right Way to Think About Airtable

363 views

2 days ago

BVICAM, New Delhi
203rd National Webinar on Lakehouse Architecture: Unifying Data Lakes, Warehouses, Modern Analytics

203rd National Webinar on "Lakehouse Architecture: Unifying Data Lakes and Warehouses for Modern Analytics"

1:20:43
203rd National Webinar on Lakehouse Architecture: Unifying Data Lakes, Warehouses, Modern Analytics

10 views

Streamed 8 days ago

Pendar Hadinezhad
هوش مصنوعی برای همه | ساخت مدل زبانی بزرگ (LLM) از صفر تا صد| جلسه نوزدهم | Layer Normalization

در جلسه نوزدهم از دوره‌ ی جامع هوش مصنوعی برای همه و مسیر کامل ساخت مدل زبانی بزرگ (LLM) از صفر تا صد، به یکی از مهم‌ترین مفاهیم ...

20:17
هوش مصنوعی برای همه | ساخت مدل زبانی بزرگ (LLM) از صفر تا صد| جلسه نوزدهم | Layer Normalization

38 views

1 day ago

Pallence AI
Gradient Descent Explained Simply: How AI Models Actually Learn

Deep Learning Foundations Playlist (start here): https://youtu.be/YQj9fgqkSTA?si=odzNpNUAy7Gs1Pj9 In this video, I break down ...

18:28
Gradient Descent Explained Simply: How AI Models Actually Learn

21 views

7 days ago

Learn With A1exandre
AI Engineering - Ep 16. (Data Training II)
41:07
AI Engineering - Ep 16. (Data Training II)

0 views

Streamed 5 days ago