Oscar Savolainen

Alma: Free Tool to Speed Up your PyTorch Model | Auto-Benchmark 50+ Conversion Options

20:57

Oscar Savolainen

GPTQ Quantization EXPLAINED

34:13

Oscar Savolainen

Speedrun deploying LLM Embedding models into Production

7:19

Oscar Savolainen

Cross Layer Equalization: Everything You Need to Know

12:52

Oscar Savolainen

How to see inside Neural Networks: New Tensor Histogram and Jacobian Sensitivity Analysis Tool!

12:18

Oscar Savolainen

Quantization Aware Training (QAT) With a Custom DataLoader: Beginner's Tutorial to Training Loops

26:13

Oscar Savolainen

Advanced PyTorch Graph Manipulation: FX Graph Mode Quantization Coding tutorial - Part 3/3

20:51

Oscar Savolainen

How does Graph Mode Affect Quantization? FX Graph Mode Quantization Coding tutorial - Part 2/3

14:06

Oscar Savolainen

How to do FX Graph Mode Quantization: FX Graph Mode Quantization Coding tutorial - Part 1/3

22:01

Oscar Savolainen

How to Quantize a ResNet from Scratch! Full Coding Tutorial (Eager Mode)

1:05:42

Oscar Savolainen

How to statically quantize a PyTorch model (Eager mode)

23:55

Oscar Savolainen

Understanding int8 neural network quantization

22:53

Oscar Savolainen

The benefits of quantizing your neural network to int8

4:50