Upload date
All time
Last hour
Today
This week
This month
This year
Type
All
Video
Channel
Playlist
Movie
Duration
Short (< 4 minutes)
Medium (4-20 minutes)
Long (> 20 minutes)
Sort by
Relevance
Rating
View count
Features
HD
Subtitles/CC
Creative Commons
3D
Live
4K
360°
VR180
HDR
3,685 results
In this video we define the basics of quantization and look at how its benefits and how it affects large language models.
28,922 views
2 years ago
In this video, we discuss the fundamentals of model quantization, the technique that allows us to run inference on massive LLMs ...
46,460 views
9 months ago
Run massive AI models on your laptop! Learn the secrets of LLM quantization and how q2, q4, and q8 settings in Ollama can save ...
415,192 views
1 year ago
Quantizing models for maximum efficiency gains! Resources: Model Quantized: ...
22,959 views
VIDEO TITLE What is LLM Quantization? ✍️VIDEO DESCRIPTION ✍️ Large Language Models (LLMs) are built using ...
3,100 views
11 months ago
This video explores DeepSeek R1, how distilled versions and quantization make it more accessible, and the trade-offs between ...
23,456 views
Text:* https://github.com/The-Pocket/PocketFlow-Tutorial-Video-Generator/blob/main/docs/llm/quantization.md 0:00:00 ...
2,543 views
3 months ago
Welcome back to the Ollama course! In this lesson, we dive into the fascinating world of AI model quantization. Using variations of ...
29,634 views
Large Language Models (LLMs) are measured by the number of parameters they contain – the number of weights and biases ...
44,477 views
Are 1-bit LLMs the future of efficient AI? Or just a catchy Microsoft metaphor? In this video, we break down BitNet, the so-called ...
88,670 views
The first comprehensive explainer for the GGUF quantization ecosystem. GGUF quantization is currently the most popular tool for ...
50,160 views
8 months ago
Quantization is a common technique used to reduce the model size, though it can sometimes result in reduced accuracy.
162,610 views
Try Voice Writer - speak your thoughts and let AI handle the grammar: https://voicewriter.io Four techniques to optimize the speed ...
61,338 views
A NEW benchmark and guide which quantization models to use locally on your PC or laptop. Either in Ollama or in LM Studio, ...
3,988 views
6 months ago
In this video I will introduce and explain quantization: we will first start with a little introduction on numerical representation of ...
51,617 views
Learn how model quantization and distillation—two key techniques for large model compression—help reduce costs and improve ...
949 views
Download Tanka today https://www.tanka.ai and enjoy 3 months of free Premium! You can also get $20 / team for each referrals ...
367,904 views
I Made ChatGPT-2 Run on a Potato (63MB AI Model!) - Extreme Quantization Experiment What happens when you compress a ...
486,185 views
Papers / Resources ▭▭▭ LoRA Paper: https://arxiv.org/abs/2106.09685 QLoRA Paper: https://arxiv.org/abs/2305.14314 ...
121,540 views