ViewTube

ViewTube
Sign inSign upSubscriptions
Filters

Upload date

Type

Duration

Sort by

Features

Reset

35 results

3cycle
Intro to GPU programming with CUDA

A 'Math Club' talk, by 2swap!

1:09:17
Intro to GPU programming with CUDA

2,500 views

5 days ago

Sonsie Face
Nvidia CUDA vs Apple Metal for AI Work

I take a look at the different architectures for creating and using AI, and Apple's Metal is surprisingly good in some scenarios.

5:22
Nvidia CUDA vs Apple Metal for AI Work

442 views

5 days ago

The Silicon Skeptic
First C++ CUDA  Program

In this video, I break down two simple CUDA C++ programs that do the same core operation, increment array values, but with ...

5:40
First C++ CUDA Program

31 views

1 day ago

Cihangir Tezcan
CUDA Optimization of NSA's Cipher: SPECK

Full Course: https://www.youtube.com/playlist?list=PLUoixF7agmIujuNg-OLK4GyoHYulgOCFi CUDA source codes and the paper: ...

8:21
CUDA Optimization of NSA's Cipher: SPECK

27 views

2 days ago

Tinge Zhang
20260303 StitchCUDA Automated GPU Programming

StitchCUDA is an automated multi-agent framework designed to generate and optimize end-to-end GPU programs for complex ...

9:27
20260303 StitchCUDA Automated GPU Programming

0 views

7 days ago

AI with Arun
CUDA Agent Beats claude gemini at GPU Optimization #ai #llms #reinforcementlearning  #researchpaper

AI models are getting smarter — but GPUs are struggling to keep up. The biggest bottleneck? CUDA kernel optimization. Only a ...

10:40
CUDA Agent Beats claude gemini at GPU Optimization #ai #llms #reinforcementlearning #researchpaper

8 views

6 days ago

Priyam Mazumdar
Triton Grouped Matrix Multiplication (Almost CUDA Performance!) | A MyTorch Sidequest

Code: https://github.com/priyammaz/TritonKernels/tree/main We implement Grouped Matrix Multiplication that simply reorganizes ...

36:19
Triton Grouped Matrix Multiplication (Almost CUDA Performance!) | A MyTorch Sidequest

103 views

6 days ago

Tinge Zhang
20260227 CUDA Agent: Large-Scale Agentic RLfor High-Performance CUDA Kernel Generation

To address the extreme technical difficulty of CUDA programming, the researchers developed a scalable data synthesis pipeline ...

8:42
20260227 CUDA Agent: Large-Scale Agentic RLfor High-Performance CUDA Kernel Generation

0 views

7 days ago

Cuda Education
GPU Programming | Compute Particles PART 3.1 | Shader link for attach to cursor feature | Vulkan API

In this episode, I investigate the link between the shader on the GPU side and the mouse position data on the CPU/Operating ...

18:21
GPU Programming | Compute Particles PART 3.1 | Shader link for attach to cursor feature | Vulkan API

0 views

1 day ago

SciPulse
CUDA Agent: Large-Scale Agentic RL for High-Performance GPU Kernel Generation

*Key Insights & Contributions:* • *The Three-Stage Data Pipeline* — To overcome the scarcity of expert CUDA code, the ...

7:11
CUDA Agent: Large-Scale Agentic RL for High-Performance GPU Kernel Generation

115 views

2 days ago

Vinh Nguyen
Tile the Tensors

... learning curve for these abstractions is steep, they offer significant benefits for writing portable, high-performance CUDA code.

6:43
Tile the Tensors

6 views

5 days ago

Cihangir Tezcan
CUDA Optimization of KASUMI and A5/3

Full Course: https://www.youtube.com/playlist?list=PLUoixF7agmIujuNg-OLK4GyoHYulgOCFi CUDA source codes and the paper: ...

13:00
CUDA Optimization of KASUMI and A5/3

43 views

5 days ago

Afrokit Media
Rope Space Worm Face Swap Installation

Rope Space Worm Face Swap Installation. This video guide will teach you how to install the Rope Space Worm Version.

16:40
Rope Space Worm Face Swap Installation

152 views

5 days ago

Cuda Education
Apple Metal | Volumetric Scattering (Fog) PART 2 | Schlick Phase Function | Cuda Education

In this episode I investigate the Schlick phase function which is used to give us the fog effect in our scene. The Schlick phase ...

42:26
Apple Metal | Volumetric Scattering (Fog) PART 2 | Schlick Phase Function | Cuda Education

10 views

1 day ago

LLVM
WiCT Meetup — Saturday, March 14, 2026: Compiler Optimizations for CPU-GPU

WiCT Meetup — Saturday, March 14, 2026 Title: Compiler Optimizations for CPU-GPU Heterogeneous Systems ----- Speaker: ...

1:47:33
WiCT Meetup — Saturday, March 14, 2026: Compiler Optimizations for CPU-GPU

116 views

14 hours ago

Ray Fernando
OpenClaw's Creator Says Use This Plugin

The founder of OpenClaw just recommended a third-party plugin over his own built-in memory system. Pete Steinberger tweeted ...

1:47:03
OpenClaw's Creator Says Use This Plugin

5,137 views

Streamed 2 days ago

Eran Feit
Make YOLOv8 10x Faster with Nvidia TensorRT

Stop settling for slow inference speeds. If you want to Make YOLOv8 10x Faster with Nvidia TensorRT, this is the only tutorial you ...

17:29
Make YOLOv8 10x Faster with Nvidia TensorRT

39 views

1 day ago

Cuda Education
Apple Metal | Volumetric Scattering (Fog) PART 1 | Tile Size, Depth Slice, Voxel, Froxel

In this episode I start the volumetric scattering series that deals with simulating fog in our scene. We go through some basic ...

34:40
Apple Metal | Volumetric Scattering (Fog) PART 1 | Tile Size, Depth Slice, Voxel, Froxel

43 views

6 days ago

Solo Swift Crafter
Nvidia Has a Problem. It's Called Apple.

Come be part of the crew: https://www.crafterslab.dev Your MacBook might already be outperforming a $2000 Nvidia GPU for the ...

12:27
Nvidia Has a Problem. It's Called Apple.

5,014 views

6 days ago

GitHub Trending Digest
GitHub Trending Today - Vibe Coding CN, PyMuPDF, Hardseed, Fluxer & More | #2

Discover the hottest GitHub repositories trending today, from AI coding assistants to powerful PDF libraries and more!

12:47
GitHub Trending Today - Vibe Coding CN, PyMuPDF, Hardseed, Fluxer & More | #2

126 views

3 days ago