Proof of the Cauchy-Schwarz inequality | Vectors and spaces | Linear Algebra | Khan Academy

Поділитися
Вставка
  • Опубліковано 15 жов 2024
  • Courses on Khan Academy are always 100% free. Start practicing-and saving your progress-now: www.khanacadem...
    Proof of the Cauchy-Schwarz Inequality
    Watch the next lesson: www.khanacadem...
    Missed the previous lesson?
    www.khanacadem...
    Linear Algebra on Khan Academy: Have you ever wondered what the difference is between speed and velocity? Ever try to visualize in four dimensions or six or seven? Linear algebra describes things in two dimensions, but many of the concepts can be extended into three, four or more. Linear algebra implies two dimensional reasoning, however, the concepts covered in linear algebra provide the basis for multi-dimensional representations of mathematical reasoning. Matrices, vectors, vector spaces, transformations, eigenvectors/values all help us to visualize and understand multi dimensional concepts. This is an advanced course normally taken by science or engineering majors after taking at least two semesters of calculus (although calculus really isn't a prereq) so don't confuse this with regular high school algebra.
    About Khan Academy: Khan Academy offers practice exercises, instructional videos, and a personalized learning dashboard that empower learners to study at their own pace in and outside of the classroom. We tackle math, science, computer programming, history, art history, economics, and more. Our math missions guide learners from kindergarten to calculus using state-of-the-art, adaptive technology that identifies strengths and learning gaps. We've also partnered with institutions like NASA, The Museum of Modern Art, The California Academy of Sciences, and MIT to offer specialized content.
    For free. For everyone. Forever. #YouCanLearnAnything
    Subscribe to KhanAcademy’s Linear Algebra channel:: / channel
    Subscribe to KhanAcademy: www.youtube.co...

КОМЕНТАРІ • 132

  • @fanrco766
    @fanrco766 5 років тому +35

    I highly recommend trying to prove this theorem in R2, R3, and R4 just to get a better intuition on how the variables cancel out. My proffessor gave us a take home test where we had to prove cauchy schwartz in R4 and it was honestly a very beautiful proof. The day after in class he was able to use the structure for the proof in R4 to generalize to Rn

  • @boipelojoe
    @boipelojoe 6 років тому +4

    b/2a to minimize the function --- it's a quadratic function; it achieve its minimum at b/2a... "Let me pick any number, b/2a" -- we know that's not any value :) You can't prove it without b/2a
    You can also differentiate p(t)
    From (y.y)t^2 - 2(x.y)t +x.x
    p'(t) = 2t||y|| - 2(x.y) *** x.x is a scalar
    p'(t) achieve its min when = 0
    => 2t||y||^2 - 2(x.y)= 0
    => t||y||^2 = 2(x.y)
    => t = (x.y)/||y||^2
    **Substitute t; the proof is complete.

  • @revooshnoj4078
    @revooshnoj4078 6 років тому +101

    I think Sal failed to explain a key insight here. First lets break down P(t)=|| ty-x ||^2 Notice that ty is a parameterization of a line so ||ty-x|| ^2 is essentially the squared distance between a point on that line and a vector x. Now what we want to do is find the minimum distance between the vector and the line. Since the function P(t)= at^2-bt+c obtains a minimum at t=b/2a (notice the negative sign in the function) we substitute that in to get the minimum distance and get the inequality using thr fact that P(t)>=0 (all lengths are positive) .
    Note:If ty and x are on the same line their minimum distance is zero which explains the equality and why one must be a multiple of the other

    • @장용연-f6v
      @장용연-f6v 5 років тому +2

      thanks cleared up alot

    • @anjalip7949
      @anjalip7949 5 років тому +2

      Hello, can you please elaborate on the minimum distance part?

    • @lraoux
      @lraoux 3 роки тому +1

      THANK YOU

    • @lraoux
      @lraoux 3 роки тому +1

      @@anjalip7949 If x and y are linearly independent, you get the minimum distance (i.e. 0). However, if they are NOT linearly independent, then we must actually SUBTRACT x from ty. If we use ||ty+x|| ^2, the inequality is backwards/reversed/errant.

    • @dineshganyarpawar5441
      @dineshganyarpawar5441 3 роки тому

      how to calculate the minimum of at^2-bt+c?

  • @possumsam2189
    @possumsam2189 8 років тому +17

    at 8:17, the intuition for using b/2a is because t at b/2a is the minimum point of the curve. By completing the square, we
    get a ( t - b/2a )^2 - (b^2)/4a + c. Since a is positive, t = b/2a is at the minimum point of the curve. So the reason for choosing b/2a is to evaluate the inequality at the point closest to 0. Can anyone confirm or deny?

  • @Mariomation3275
    @Mariomation3275 4 роки тому +3

    you are my motivation. I think your videos are more of complements to classes rather than actual lessons. it so diverse and inclusive that if you were to study a topic complete your still guaranteed to learn something new.

  • @Augustine_354
    @Augustine_354 10 років тому +51

    Very interesting, but I've got one problem. From where did you get this p(t) artificial function?

    • @bartalor
      @bartalor 7 років тому +1

      he started with an expression that has t in it, and he explained in the beginning why this expression is always greater or equal to zero regardless of the value of t,
      later he just started referring to that expressions as a function of t.

    • @revooshnoj4078
      @revooshnoj4078 6 років тому +12

      The function P(t) describes the squared distance between a vector and a point on the parameterized line ty. Essentially what were trying to do is to find the minimum distance between that line and that vector. Since P(t)=at^2-bt+c its minimum is obtained at t=b/2a plugging that in we get the desired inequality. Also notice that if the vector and line are the same line their minimum distance will be zero hence equality and why the other vector must be the scaled version of the other

  • @jpfry
    @jpfry 14 років тому +10

    Awesome proof!
    I think an argument with the discriminant would be a little more natural, rather than plugging in b/2a which takes a bit of foresight and might seem arbitrary.
    For those asking, the Cauchy-Schwarz inequality is extremely useful in Linear Algebra, Analysis, and probability. A good amount of proofs in mathematics use this inequality (even in physics too, where CS is used to arrive at the Heisenberg uncertainty principle).

  • @thetruereality2
    @thetruereality2 7 років тому +4

    Is there some specific reason why you defined P(t)=||ty-x||? Can you define any other vector differently?
    Also later in the proof you evaluated p(t) at b/2a you have not mentioned what is the a reason for that?
    It was a nice explanation I would be grateful if you explained above questions.
    Thank you

  • @sdfsgdsfgsdfg
    @sdfsgdsfgsdfg 9 років тому +24

    8:06 could use some motivation for t=b/2a

  • @rinkaghosh7961
    @rinkaghosh7961 4 роки тому +2

    Why choosing the function p(t) ? And then t=b/2a ? It might be better if some purposes were told for choosing the function and then that particular value of t

  • @JaySomething
    @JaySomething 12 років тому +2

    I think that it's more convenient to just use the identity that the dot product of any two vectors is their magnitudes multiplied together, times cosine of the smallest angle between them, to prove the inequality, or the equality when they are parallel vectors since theta (smallest angle) is 0, so cos0 is 1. For any other non-parallel vector pair, since costheta has the domain between neg 1 and pos 1, then we divide both sides by the magitudes to get 1 >/= costheta, which is true.

  • @energysage9774
    @energysage9774 11 років тому +2

    Why not just say A dot B = |A||B| cos(theta),and the largest value cosine can contribute is 1, giving A dot B = |A||B|. Cos(theta)=0 occurs at 0 and Pi radians (or 0 and 180 degrees), meaning A and B must be parallel or antiparallel. Either way, this means they are scalar multiples of one another.

  • @rikard92
    @rikard92 11 років тому +6

    how do I know what constant to put in to P(t) to prove the theorem? It seemed like you knew the answer and you used the "answer"(the constant t which was b/2a)) to "proof" the theorem.

  • @Furkan-yv5ew
    @Furkan-yv5ew 3 місяці тому

    Since I don't really know the definitions i can't prove them by my self. And I can't find a starting point for this. Is it necessary to know some linear algebra to understand topology

  • @borissimovic441
    @borissimovic441 10 місяців тому

    Where does the P(t) function come from, what is the intuition for using it?

  • @jakubnage
    @jakubnage 15 років тому

    Hello Salman Khan. I love your videos and have used them for calculus. You are truly an asset and valuable resource to any student.
    The main purpose of this comment (if you see it) is to ask what you recommend I use with my wacom bamboo tablet whilst taking notes in class. In terms of functionality, my only prefernce is that I could have typewritten text together with my tablet writing which would be diagrams.
    Anyone who feels like answering this, comment here or message me

  • @amyluo3314
    @amyluo3314 12 років тому +2

    Thank you soo much for giving us free lessons!
    Education is the most valuable thing ever.

  • @jingyiwang5113
    @jingyiwang5113 Рік тому

    I am really grateful for this video! You have provided a really clear explanation about this inequality. Thanks! My mathematics exam is approaching. I am so nervous about it.

  • @pjgcommunity3557
    @pjgcommunity3557 7 років тому +1

    The equation at(squared)-bt+c in the video is very similar to the equation to find distance after t seconds when acceleration is an arbitrary constant(1/2at(squared)+bt+c)

  • @Ashkepu
    @Ashkepu 12 років тому +1

    Where is the equation p(t) = ||ty-x||^2 from? How did you choose it?

  • @matejvidovic9026
    @matejvidovic9026 2 роки тому

    8:03 why did we choose b/2a as our value? how did we get this? so confused!

  • @samuelbassey6806
    @samuelbassey6806 10 місяців тому

    Well explained sir, thanks for sharing

  • @ericshindola3308
    @ericshindola3308 7 місяців тому

    awesome deliverance there, atleast i have understand.

  • @llewsub
    @llewsub Рік тому +1

    Why does the video description say "Linear algebra describes things in two dimensions" which is just not true lol?

  • @PoojaSharma-rz2bg
    @PoojaSharma-rz2bg 9 місяців тому

    Why we have put t= b/2a
    Sir ?please tell🙏

  • @林某-p3x
    @林某-p3x 8 років тому

    can you say something on ||>=?

  • @nadzTube
    @nadzTube 15 років тому +1

    Sal -- are you dedicated full time to your academy?

  • @mucklesandwich
    @mucklesandwich 11 років тому

    It could be any vector squared to begin with. He just picks the vector 'ty-x' because without breaking any rules of mathematics it can give him the result he wants. He uses that specific vector because the thing he's trying to get has x's and y's in it. And because any vector can be written as 'ty-x' (ie some scalar multiple of a vector minus some other vector) then what he proves for this example must hold true for all examples.

  • @looploop6612
    @looploop6612 7 років тому +5

    why t=b/2a ?

  • @laag4
    @laag4 15 років тому

    It's a max/min thing, if you're searching for a vertex on a parabola, it will give you the y coordinate. (I think, it has been a while since I've really done that.

  • @gbriceno
    @gbriceno 7 років тому +1

    Kahn, Great proof and explanation of the the Cauchy-Schwarz inequality is. Could you explain the use of P(t) and why the function was chosen as it was presented as a linear differnce combination of two vectors. I'd like to re-run the proof using the addition of two vectors to see if anything is lost in this proof.

  • @blackphoenix1207
    @blackphoenix1207 13 років тому

    I'm confused, shouldn't the equality iff cy = x have another part to it, where we start off with just x & y and prove that they're scalar multiples...

  • @evamari1900
    @evamari1900 2 роки тому

    This was very helpful

  • @mercyonize7882
    @mercyonize7882 4 місяці тому

    Thank you so much sir 🙌🏼

  • @SkunZielonyJakMech
    @SkunZielonyJakMech 7 років тому

    I think it's much easier to proof that (x1y2-x2y1)^2 >= 0, where x1,x2 and y1,y2 are coordinates of vectors x and y.

  • @LAnonHubbard
    @LAnonHubbard 14 років тому

    On third viewing, it's making more sense. My problem was that even though I could understand each step, I wasn't getting any intuition from it. At 16:50 he says in future videos he'll give intuition as to WHY it makes sense. I will rest easy again now!

  • @Shaiifalii
    @Shaiifalii 4 роки тому

    Is magnitude and norm same or interchangeable?

    • @GianMarcosAguilar
      @GianMarcosAguilar 2 роки тому

      magnitude, length and norm are the same (commenting in case others have the same question in the future)

  • @ANKITKUMAR-zb9nw
    @ANKITKUMAR-zb9nw 3 роки тому

    Hey x=cy . Here c has to be positive or it could be negative also for the |x.y|=||x|| ||y|| to be true

  • @frr
    @frr 13 років тому

    @regingwapo: The only if part are the assumptions. Assumptions are usually given as true and thus doesn't need to be proven. For example, he/she is human only if he/she is a man or women. That only if part is given to be someone either male or female and unnecessary to prove.

  • @mucklesandwich
    @mucklesandwich 11 років тому

    Maybe what you're asking is deeper than that... If you're asking how the first guys who did this knew what function to start with, then... that's a good question.... it possibly came out of exploring and playing around with squaring the magnitudes of vectors and seeing what happened. when this popped out they realised it made a definite statement about all vectors, a statement that could come in very handy in the future.

  • @riggsrevenge
    @riggsrevenge 12 років тому

    @bach1229 Its a teaching strategy to have you remember it better sir. its not just for mathmaticians either, it can be used in any subject, I use it for language, for example if I were to teach english and translate to spanish I would use red for english and blue for spanish, there is lots of research behind it.
    It works most of the time.

  • @eddielloyd1947
    @eddielloyd1947 7 років тому +2

    7:50
    Why didn't Khan just take the discriminant of P(t) as less than or equal to zero? Consequently (b/2)^2 is less than or equal to ac and Cauchy-Schwarz is proven.

  • @xoppa09
    @xoppa09 7 років тому

    15:29 here you have | || y||^2 | not || y||^2. But they are equal.

  • @Dusty1298
    @Dusty1298 12 років тому

    Is this somehow related to the quadratic formula? While proving the CS inequality at one of the steps he got b^2-4ac is less than or equal to 0. Is that just because he started from a quadratic equation or is there another reason?

  • @visaeris
    @visaeris 14 років тому

    Omg... this makes things sooo clear...

  • @diamondglitter205
    @diamondglitter205 6 років тому

    it will become much easier when I use the other dot product formula X.Y = ||X||.||Y||.cos(o). but I know he spouses we don't know this formula yet.

  • @shahadas907
    @shahadas907 Рік тому

    Fantastic!

  • @hitashasharma2178
    @hitashasharma2178 4 місяці тому

    You make maths fun

  • @freezingbeast
    @freezingbeast 14 років тому

    Isn't this much much much easier to prove if you introduce the dot product and do an inequality equation with -1

  • @Harrykesh630
    @Harrykesh630 4 місяці тому

    what are Non Real vectors ?

  • @blackphoenix1207
    @blackphoenix1207 13 років тому

    wait, but the CS equality is an if and only if proof, you've only proved one way assuming that x = cy (where x & y are vectors and c is a scalar) and then plugged it in what about the other way around? Maybe I'm not looking at it right..

  • @JDMaxton1999
    @JDMaxton1999 4 роки тому

    11:26 Ok I've seen it proven like this other times. I still don't get how b^2 - 4ac is allowed to be less than zero. Isn't this pretty much allowing there to be negative square roots if you were to use the quadratic equation on this theoretical equation???

  • @frr
    @frr 13 років тому

    @regingwapo: The only if part are the assumptions. Assumptions are usually given as true and thus doesn't need to be proven.

  • @PrUnEJuIcEtHeThIrD
    @PrUnEJuIcEtHeThIrD 12 років тому

    What's the intuition behind plugging in b/2a to p(t)?

  • @anjalip7949
    @anjalip7949 5 років тому

    Thank you, Sir.

  • @MarceloSousaWoody
    @MarceloSousaWoody 10 років тому

    How I can prove that the Cauchy-Schwarz inequality holds only when the vectors are linearly dependent?

  • @frei-math-quantum
    @frei-math-quantum 9 років тому +3

    Why evaluating precisely at that value? (Please more than just "it works".)

    • @inteusproductions
      @inteusproductions 9 років тому +2

      -b/2a is the turning point of a parabola.

    • @Nightmare1066
      @Nightmare1066 9 років тому

      +FreeziiS On Khan Academy's site there's a comment that explains what Sal is doing.

    • @frei-math-quantum
      @frei-math-quantum 9 років тому

      +Darko Bakula (Nightmare1066): I myself know the answer but it would be an improvement for your video. ;)

    • @ElizaberthUndEugen
      @ElizaberthUndEugen 7 років тому

      @FreeziiS If you know the answer, would you care to explain? I can't find the comment on the Khan Academy site that allegedly explains the choice of t=b/2.

  • @donjisto
    @donjisto 8 років тому

    For all you guys who want to know why b^2 - 4ac needs to be < or = to zero go and read my tip/comment on this video in khanacademy.

  • @yasminho195
    @yasminho195 2 роки тому

    Thank youuuuuuu!!!

  • @shenglanliu4197
    @shenglanliu4197 3 роки тому

    unfortunately you did not prove if and only if...basically
    . In a real inner product space where u and v are two vectors, given |u.v|=||u|| ||v||, then u and v are linearly dependent

  • @xoppa09
    @xoppa09 7 років тому

    13:48 what do you mean principal square root

  • @teymurahmadov5804
    @teymurahmadov5804 4 роки тому +1

    VERY nice ı anderstand evrysing.
    ı am 10,
    but ı understand you to.
    This vidio is very interasting.
    🕳😀😀😀😀😀😀😀

  • @usama57926
    @usama57926 5 років тому

    sir thank u so much

  • @benshapiro9630
    @benshapiro9630 8 років тому

    what if you use complex numbers?

  • @regingwapo
    @regingwapo 13 років тому

    @frr Only if means that it can't be something else, so you have to prove that that something else doesn't produce the same conclusion.

  • @medchs
    @medchs 4 роки тому +1

    a second degree polynomial has at most one real solution (is non-negative for all real t) iff its discriminant is

    • @legofanas
      @legofanas 4 роки тому

      If D < 0 it has 0 real solutions

  • @markobe08
    @markobe08 6 років тому

    09:37
    how can we multiply b^2/2a by 2 and leave the rest untouched?

    • @the61stbookworm78
      @the61stbookworm78 5 років тому

      I know this reply comes two months late, but I hope it can still be of some use. When b^2/2a is multiplied through by two, it is just being changed to an equivalent fraction. An equivalent fraction has the same value so it does not need to change the rest of the equation. Just like if you add 1/2 or 2/4 to a number you'd get the same result both times.

  • @lauelibre
    @lauelibre 6 років тому +5

    Thanks but how could I ever think of this from scratch on an exam?

    • @oneinabillion654
      @oneinabillion654 5 років тому

      If you learn scalar product rule. You remove the cosx and the side that had cosx now becomes bigger or equal to the other side as cosx is only from -1 to 1. This produces the same inequality.
      You can quickly test on a piece of paper.

    • @zackwyvern2582
      @zackwyvern2582 5 років тому

      @@oneinabillion654 Prove without invoking triangle inequality or dot product geometric definition, that is, from scratch.

    • @oneinabillion654
      @oneinabillion654 3 роки тому

      Ahh, I thought she wanted a fast way to know the expression since from scratch its long and troublesome. And I don't think any exam tells u to derive this literally from scratch so...
      Anyways, thanks for reminder

    • @oneinabillion654
      @oneinabillion654 3 роки тому

      @glyn hodges O.O

  • @SomethingSoOriginal
    @SomethingSoOriginal 12 років тому

    Very helpful, thanks.

  • @zakariaislam9891
    @zakariaislam9891 6 років тому

    Can't we just prove it by the definition of dot product,
    a.b=|a||b|cos(theta), the maximum value of this is when theta=0 which is a.b=|a||b| cos0= |a||b|.
    For any other value of theta a.b

  • @Waranle
    @Waranle 15 років тому

    Thank you Sal!

  • @kniinortey31
    @kniinortey31 3 роки тому

    I keep hearing in the last video what’s the last video.

  • @regingwapo
    @regingwapo 14 років тому +1

    you didn't prove the "only if" part

  • @jabranehcini1674
    @jabranehcini1674 3 роки тому

    un grand salut
    toi existe machallah

  • @LAnonHubbard
    @LAnonHubbard 14 років тому

    Have to say I struggled a bit with this so I'm going to find another source.

  • @sapnaahirwar9760
    @sapnaahirwar9760 4 роки тому

    One little problm I have is pls don't use this display, yellow green highlighter on the black..! 🙏

  • @demetriusdemarcusbartholom8063
    @demetriusdemarcusbartholom8063 2 роки тому

    ECE 485 UofA

  • @anmolagrawal5358
    @anmolagrawal5358 5 років тому +2

    3:11 "It was 2 videos ago"--😂😂

  • @dannysher1010
    @dannysher1010 9 років тому

    good stuff! I got it!

  • @ferdinandflames8172
    @ferdinandflames8172 2 роки тому

    Where's the denominator b?? 🤔

  • @cosinusarcus8907
    @cosinusarcus8907 11 років тому

    it is really nice proof!! :) thanks

  • @bopeng9504
    @bopeng9504 8 років тому +6

    complicate an easy thing.

  • @shifterdude
    @shifterdude 15 років тому

    awesome

  • @iRapplexD
    @iRapplexD 7 років тому

    all we need is 3-4 steps... talking about efficiency

  • @IJahan-k1q
    @IJahan-k1q 4 місяці тому

    14 years ago !

  • @click4nat
    @click4nat 15 років тому

    I guess it is just necessary for the proof.

  • @Reonaru
    @Reonaru 13 років тому

    Idol !!!

  • @FridolinH
    @FridolinH 7 років тому

    it's Schwarz. If I'd say it in german the way you wrote it, I'd have a lisp:D

  • @tonmandude
    @tonmandude 14 років тому +4

    Man, this stuff is so hard, I don't know how to do my math hw ;_;

    • @r4viity414
      @r4viity414 4 роки тому +1

      Did you finish your homework yet?

  • @alicenlucy
    @alicenlucy 8 років тому

    is it me or is there no audio??

  • @jjjeykey
    @jjjeykey 10 років тому +6

    I like your videos, but I would find them even better if you didn't sound so bored.

  • @muhammeteneseris6752
    @muhammeteneseris6752 6 років тому +2

    güzel ama ingilizce kim bilir ne diyo

  • @henrychan720
    @henrychan720 4 роки тому

    Just... draw a triangle?

  • @ElizaberthUndEugen
    @ElizaberthUndEugen 5 років тому +1

    Why do a 16 min video when you can prove this in ONE line?
    |x . y| = |cos(x, y) |x| |y|| which is bounded above by |x| |y| and below by 0
    DONE
    ...

  • @emanuellandeholm5657
    @emanuellandeholm5657 2 роки тому

    Great video!
    Cow she tho... :D You need to work on that pronunciation.

  • @blockobutter
    @blockobutter 4 роки тому

    Instead of learning Ruby frameworks or multithreading devices, I'm here learning about this useless (to me) inequality that I will never see again.... College is dogshit sometimes. BRAVO PRACTICALITY!
    Anyway, good video. Just the moron mathematicians that haven't worked a day in the software development field making me watch this shouldn't be in charge of course building...

  • @okshuvro2996
    @okshuvro2996 4 роки тому

    too hard -_-

  • @quant-prep2843
    @quant-prep2843 3 роки тому

    very bad explanation of proof

  • @frankensteinmoneymac
    @frankensteinmoneymac 14 років тому

    You didnt prove anything...you just wrote a bunch of stupid little squiggles on the screen then said a bunch of big words and stuff. What's a vector anyway? You keep saying vector. Then you say things like POSITIVE number...like its happy or something. Dude numbers cant be happy!!! You're stupid.
    PS Im like REALLY stoned right now....plus I dont know what I am talking about.
    Also I am tripping on some shrooms. I dont know if that makes a difference, but I am. And I think I love you.