Statistical Learning: 6.8 Tuning parameter selection

Statistical Learning: 6.6 Shrinkage methods and ridge regression

General Relativity Lecture 1

Why no RONALDO?! 🤔⚽️

РУЧКА (смешное видео, юмор, приколы, поржать, вайны)

«Угадай кто?» В этой игре и карточки с Гарри Поттером есть 🪄 Артикул WВ: 138578734, Ozоn: 981564320

Statistical Learning: 6.7 The Lasso

Stanford Online

Переглядів 7 835

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 29 лис 2024

КОМЕНТАРІ • 12

@ArminEghdamiDrums 2 дні тому
Wow, the picture with the RSS contours and the intuition why the lasso sets the coefficients to exactly zero is beautiful. I haven't see an illustration like this before. Thank you!
@raphaelriviere2506 10 місяців тому ⁺¹
My supervisor/collaborator in ML introduced me to Tibshirani's work during my MSc. and I feel like I was once blind and now can see. Thank you so much for these videos...
@VolatilityDoctor 11 місяців тому ⁺³
I like Daniele, she's brings a youthful vibrancy to the presentation.
@billio11 11 місяців тому ⁺³
DAYUMN! you came up with this? Thats epic B)
@brunaamancio3314 Місяць тому
Amazing content! Thanks!
@riemann7200 Місяць тому
One question. Why don't we mannually set a threshold epsilon and get rid of coefficients that's under this threshold in Ridge, because Ridge should be running faster by its differentiability?
@VaibhavRungta-s7z Рік тому ⁺²
On Lasso having been more likely have a ZERO coeff unlike ridge - Is it fair to say that for coeffs less than 1, ridge regression because of the squared term will (square of numbers less than 1 is smaller than the number itself) impose a less penalty than Lasso. On the other hand if we anticipate the coeffs to be greater than 1, we might be better off with Ridge Regression coz of the larger penalty and more shrinkage?
@SlavKinGa 11 місяців тому ⁺¹
you normalize the data before passing to either, so coefficient sizes are not as meaningful, so I dont think that's good/correct intution. Also, this doesnt explain why LASSO does make them zero and not just shrink more. I think good intuition is to look at the orthogonal case where all x_i are uncorrelated. Then what LASSO does is "soft-thresholding" (basically everythinking within a band around zero becomes EXACTLY ZERO), while Ridge just shrinks ALL COEFFICIENTS equally (by 1/(1+lambda)).
It is actually well-known that LASSO *overshrinks* the coefficients (for inference) -- so for purely predicitve purposes, where you dont care about sparsity you're probably better off with Ridge. You can take a look at Elastic Net and especially the original paper by Zou and Hastie, where they discuss LASSO's limitations (see elastic net vs naive elastic net for some corrections).
Also, generally speaking, LASSO makes sense when you have a lot of features and you want to remove some of them -- in these cases it's very hard to expect exactly which coefficients are going to be > 1 ...
@mojtabakanani2335 8 місяців тому
why is it more likely for lasso to touch the corners of the diamond than for ridge to touch the points on the circle on each axis that make one of the predictors equal to zero?
@turkdilkurumu8362 6 місяців тому
Consider two parameters b1 and b2. Consider b1=0.1 and b2=0.1 . Their squares are 0.01 and 0.01, too small. Ridge won't bother to set these two equal to zero, instead, it makes them too small like 0.001, whose square is 0.000001, a neglicible number.
@annawilson3824 Рік тому
6:24 minimization equations

Наступне

Автоматичне відтворення

Statistical Learning: 6.8 Tuning parameter selection

Statistical Learning: 6.8 Tuning parameter selection

Statistical Learning: 6.6 Shrinkage methods and ridge regression

Statistical Learning: 6.6 Shrinkage methods and ridge regression

General Relativity Lecture 1

General Relativity Lecture 1

Why no RONALDO?! 🤔⚽️

Why no RONALDO?! 🤔⚽️

РУЧКА (смешное видео, юмор, приколы, поржать, вайны)

РУЧКА (смешное видео, юмор, приколы, поржать, вайны)

«Угадай кто?» В этой игре и карточки с Гарри Поттером есть 🪄 Артикул WВ: 138578734, Ozоn: 981564320

«Угадай кто?» В этой игре и карточки с Гарри Поттером есть 🪄 Артикул WВ: 138578734, Ozоn: 981564320

Симбу закрыли дома?! 🔒 #симба #симбочка #арти

Симбу закрыли дома?! 🔒 #симба #симбочка #арти

Jen-Hsun Huang's Advice for Students (Extended)

Jen-Hsun Huang's Advice for Students (Extended)

Statistical Learning: 7.1 Polynomials and Step Functions

Statistical Learning: 7.1 Polynomials and Step Functions

Statistical Learning: 6.Py Ridge Regression and the Lasso I 2023

Statistical Learning: 6.Py Ridge Regression and the Lasso I 2023

Lecture 1 | String Theory and M-Theory

Lecture 1 | String Theory and M-Theory

Statistical Learning: 5.2 K-fold Cross Validation

Statistical Learning: 5.2 K-fold Cross Validation

Statistical Learning: 12.5 Matrix Completion

Statistical Learning: 12.5 Matrix Completion

Statistical Learning: 5.3 Cross Validation the wrong and right way

Statistical Learning: 5.3 Cross Validation the wrong and right way

Симбу закрыли дома?! 🔒 #симба #симбочка #арти

Симбу закрыли дома?! 🔒 #симба #симбочка #арти

When Cucumbers Meet PVC Pipe The Results Are Wild! 🤭

When Cucumbers Meet PVC Pipe The Results Are Wild! 🤭

Молодой боец приземлил легенду!

Молодой боец приземлил легенду!

Cool Items!🥰 New Gadgets, Smart Appliances, Kitchen Tools Utensils, Home Cleaning, Beauty #shorts

Cool Items!🥰 New Gadgets, Smart Appliances, Kitchen Tools Utensils, Home Cleaning, Beauty #shorts

Як в Уторопах варять сіль із соровиці з місцевого джерела

Як в Уторопах варять сіль із соровиці з місцевого джерела

Час РАСПЛАТЫ от МАЙКА ТАЙСОНА

Час РАСПЛАТЫ от МАЙКА ТАЙСОНА

Why no RONALDO?! 🤔⚽️

Why no RONALDO?! 🤔⚽️

Players push long pins through a cardboard box attempting to pop the balloon!

Players push long pins through a cardboard box attempting to pop the balloon!