Stealing bit of GPT's Brain for $20?!!! (INSANE GOOGLE RESEARCH)

1littlecoder

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 14 лис 2024

КОМЕНТАРІ • 47

@123arskas 8 місяців тому ⁺¹⁴
You make research papers sound so easy. Thank you for explaining them in simple terms.
@1littlecoder 8 місяців тому ⁺⁷
Thank you so much. Honestly for this one. I spent a lot of time and I was quite skeptical if I can explain well. Thanks for sharing your feedback and support 🙏🏾
@ineffige 8 місяців тому ⁺²
You have real talent in exlaining hard scientific papers easy. Thanks from Poland!
@1littlecoder 8 місяців тому ⁺¹
Thanks so much for the kind words. I'm glad this one worked out well
@ineffige 8 місяців тому ⁺¹
@@1littlecoder you got new subscriber, I binge watch your videos in the evening. Great job mate
@1littlecoder 8 місяців тому
@@ineffige Thanks mate!
@rickybloss8537 8 місяців тому ⁺⁸
Holy shit we might be able to do mind uploading with techniques like this. If you can recover your architecture by doing inference over all your skills. Then write down and encode all your memories into a memory system. Then use color blind tests and hearing tests to encode your qualia parameters. Then run inference tests over all your motor outputs in different eye tracking vr inference tasks. You could theoretically create a near perfect copy of your architecture. Damn i might start a cult lol
@1littlecoder 8 місяців тому ⁺¹
Hehe 😂
@gavinknight8560 8 місяців тому ⁺²
Love it
@gavinknight8560 8 місяців тому ⁺¹
Isn't that what Facebook does :) ?
@kurianbenoy9369 8 місяців тому ⁺¹
Great explanation for such a complex paper ❤
@1littlecoder 8 місяців тому ⁺¹
Thanks very much. Glad it turned out well
@drhxa 8 місяців тому ⁺⁵
Great job! Thank you so much! These paper videos are the your most valuable contributions at least in my opinion. You're not afraid to go into the details like 2 minute papers (his videos have their own type of value, bringing attention from the broader public to AI research)
@1littlecoder 8 місяців тому ⁺¹
Wow, thank you! I appreciate your kind words for the efforts
@XNaos 8 місяців тому ⁺¹¹
Imagine Elon musk stealing the source from OpenAIs models and open sourcing them XD
@1littlecoder 8 місяців тому ⁺³
Elon RobinHood Musk
@maheshBasavaraju 8 місяців тому ⁺²
Giving people's intelligence back to people 😅
@cig_in_mouth3786 8 місяців тому ⁺¹
First he will do some publicity stunt
@icandreamstream 8 місяців тому
@@cig_in_mouth3786then he will fail to deliver what he promised.
Finally he’ll wait another few weeks and promise something new.
@ernesto.iglesias 8 місяців тому
I think it'll probably be the oposite. CloseAI and ShalowMind stealing from Claude or Mistral
@headrobotics 8 місяців тому ⁺¹
Does this research reveal anything about OpenAI's models that was previously unknown?
@1littlecoder 8 місяців тому ⁺¹
The hidden layer dimensions of Ada and Babbage and also the undisclosed dimensions of GPT 3.5 and also the weight matrices. All these were previously unknown
@Shaunmcdonogh-shaunsurfing 8 місяців тому
Nicely covered. Thank you
@sgramstrup 8 місяців тому
Damn :-) I would've leaked everything to the dark web. Would love to force the 'open' part in OpenAI..
@edgarmedrano2562 8 місяців тому
I wonder if we can use something similar to get the dimension of our simulation (Wolfram) ruliad.
@doitjames6041 8 місяців тому ⁺²
Nice research
@1littlecoder 8 місяців тому
Kudos to the researchers!
@IsxaaqAcademy 8 місяців тому
I am looking for Devin AI software engineer
@zyxwvutsrqponmlkh 8 місяців тому ⁺¹
Sad this didn't come from someone a little more gray hat-ish that would actually release all the details.
@hammadalikhan8813 8 місяців тому ⁺¹
I think the definition of a black box model would be that we do not have access to the model's weights and its architectural information, etc., instead of not being able to understand the weights of the model because I think this paper falls into the category of adversarial machine learning. To my limited knowledge, there are 4 categories of attacks in AML, which are:
1) Evasion attack
2) Data poisoning attack
3) Extraction attack
4) Inference attack
And this particular attack falls into the category of an extraction-based attack, which means we are trying to extract information about the model.
@aiamfree 8 місяців тому
you just send an empty prompt “ “
@sykexz6793 8 місяців тому ⁺¹⁰
Stealing is all you need.
@1littlecoder 8 місяців тому ⁺⁴
Probably someone else will work on this paper title 🤣
@Kutsushita_yukino 8 місяців тому
lmao
@kotcraftchannelukraine6118 8 місяців тому ⁺¹
Why there is no LLMs based on spiking neural networks?
@1littlecoder 8 місяців тому ⁺²
Given that RNNs are making a comeback. we never know!
@diadetediotedio6918 8 місяців тому ⁺²
There is, it is called SpikeGPT, but it had not got any traction yet, because spiking neural networks are very specific to implement and many of our technollogies don't fit well with them. I think that if your plan is to just translate the normal neural networks trained with backpropagation into spiking neural networks (or just train them directly using surrogate gradients), then you are already losing all the supposed benefits of a different architecture like SNN's, so it is kinda pointless, except for the better cost and possibly the scalability on specialized hardware.
@ramasamynagappan7586 8 місяців тому
What is 1- bit LLM
@1littlecoder 8 місяців тому
ua-cam.com/video/Gtf3CxIRiPk/v-deo.html
@TheManinBlack9054 8 місяців тому ⁺¹
That doesn't have anything to do with AI black box effect. Black box means that we don't understand what happens INSIDE the AI model, how they think or how they make their decisions. The thing that would make it not a black box is interpretability research. Im sorry, but thats clickbat.
@1littlecoder 8 місяців тому
I quoted that from the paper. And feel it's a great achievement in terms of knowing the inner workings of DNN just from the API call. I'm sorry if you feel it's a clickbait!
@TheManinBlack9054 8 місяців тому
@@1littlecoder yeah, but its not what black box usually means, much better term would be closed-source, the reason black box is called that is because NO ONE knows whats inside. not because only some people know whats inside. For instance, Windows is a closed-source system, but people know what goes in inside of it, while AI can be open-source but people would still have no idea what exactly happens inside after the input and before the output, that's why AI interpretability field exists.
@1littlecoder 8 місяців тому
@@TheManinBlack9054 tbh, even before LLMs were popular, the black box model has been a common tech term to use. We used to call Random forest and Xgboost as black box models because they were not easily interpretable unlike linear models
@rushyscoper1651 8 місяців тому
do u understand that the same solution also work for interpretability? AI model understood and decode how another AI mode work, wit some effort u can create one that translate that for us to understand.

Наступне

Автоматичне відтворення

This NEW LLM "Learnt" to "THINK" BEFORE "TALK"ING!!!