- 163
- 833 626
BuzzRobot
United States
Приєднався 23 сер 2020
Sophia, the founder of BuzzRobot, facilitates talks in which top AI researchers and engineers explore the cutting edge of AI research. We feature talks covering a wide range of topics - AI, Machine Learning, Generative AI, Large Language Models (LLMs), Reinforcement Learning, Computer Vision, Robotics, Autonomous Systems, Programming, and more.
Join the BuzzRobot Slack to connect with the community.
Join the BuzzRobot Slack to connect with the community.
AI model self-improvement: progress and challenges
#ai self-improvement is advancing with methods like RLAIF (#reinforcementlearning from #ai feedback) and meta-rewarding, enabling models to refine their outputs without human input. Tianhao Wu from #BAIR Lab discusses the challenges, limitations, and engineering efforts behind these systems, including the need for constitutions, filtering processes, and debiasing techniques.
Timestamps:
0:00 Introduction
2:16 How can we continue improving superhuman models? The prover-verifier gap explained
4:07 Improving generation quality
6:18 Evaluation bottleneck
10:03 How to improve the evaluation capability of the model: GAN-like approach and Meta evaluation
12:48 Improving generation and evaluation together
17:03 Experiments
20:28 Limitations
23:35 Q&A
#aiselfImprovement #aimodel #RLAIF #metarewarding #aimodels #machinelearning #reinforcementlearning #llm #llms #airesearch #aichallenges #aiprogress #robot #robotics #artificialintelligence #artificialsuperintelligence #artificialgeneralintelligence #tech #techtalk #techtalks #aitalks #aitalk #programming #llama #llama3 #gpt4 #gpt #claude
Social Links:
Newsletter: buzzrobot.substack.com/
X: x.com/sopharicks
Slack: join.slack.com/t/buzzrobot/shared_invite/zt-2s067rv7n-guPIMGe62rbp9ncxdnOUfQ
Timestamps:
0:00 Introduction
2:16 How can we continue improving superhuman models? The prover-verifier gap explained
4:07 Improving generation quality
6:18 Evaluation bottleneck
10:03 How to improve the evaluation capability of the model: GAN-like approach and Meta evaluation
12:48 Improving generation and evaluation together
17:03 Experiments
20:28 Limitations
23:35 Q&A
#aiselfImprovement #aimodel #RLAIF #metarewarding #aimodels #machinelearning #reinforcementlearning #llm #llms #airesearch #aichallenges #aiprogress #robot #robotics #artificialintelligence #artificialsuperintelligence #artificialgeneralintelligence #tech #techtalk #techtalks #aitalks #aitalk #programming #llama #llama3 #gpt4 #gpt #claude
Social Links:
Newsletter: buzzrobot.substack.com/
X: x.com/sopharicks
Slack: join.slack.com/t/buzzrobot/shared_invite/zt-2s067rv7n-guPIMGe62rbp9ncxdnOUfQ
Переглядів: 113
Відео
#artificialgeneralintelligence, #artificialsuperintelligence, and the Deceptive Nature of #aiagents
Переглядів 16214 днів тому
In this AMA session, ex-#openai researcher Daniel Kokotajlo shares with the @BuzzRobot community his thoughts on how far we are from AGI, the impact of government on AI development, issues with AI alignment and #aisafety, the deceptive nature of AI agents, and other pressing topics in AI. Timestamps: 0:00 Introduction 1:19 AGI by 2027? 3:24 The definition of AGI 5:36 Current AI systems are alre...
Overcoming Challenges of #rag in Long-Context #llms
Переглядів 15814 днів тому
Retrieval-Augmented Generation (RAG) empowers large language models (LLMs) by integrating external knowledge, boosting their ability to generate informed and accurate outputs. However, as #llms process longer inputs and retrieve more #data, challenges like declining quality from "hard negatives" arise. Bowen Jin from the University of Illinois Urbana-Champaign dives into why this happens and ho...
How to Train #stablediffusion for $2,000!
Переглядів 18121 день тому
Is it possible to train a #stablediffusion model for just $2,000? Vikash Sehwag, a research scientist from #SonyAI, presents his work on democratizing large-scale diffusion models in this new @BuzzRobot talk. Vikash introduces a novel patch masking technique paired with a lightweight patch-mixer, which significantly lowers computational costs while maintaining high performance. He also explores...
Exploring Pragmatic Patterns in Agentic Systems by #openai
Переглядів 326Місяць тому
Exploring Pragmatic Patterns in Agentic Systems by #openai
How Far Can #AI Go in Research? Results from Claude 3.5 and OpenAI o1
Переглядів 210Місяць тому
How Far Can #AI Go in Research? Results from Claude 3.5 and OpenAI o1
Children and #artificialintelligence: how kids interact with #chatgpt and #dalle3
Переглядів 144Місяць тому
Children and #artificialintelligence: how kids interact with #chatgpt and #dalle3
The current state of #robotics by Alex Irpan from @Google_DeepMind
Переглядів 376Місяць тому
The current state of #robotics by Alex Irpan from @Google_DeepMind
The AI scientist explained: #artificialintelligence takes over research
Переглядів 384Місяць тому
The AI scientist explained: #artificialintelligence takes over research
How @Google uses #artificialintelligence for weather and #climate modeling with NeuralGCM
Переглядів 7422 місяці тому
How @Google uses #artificialintelligence for weather and #climate modeling with NeuralGCM
What Does It Mean for #AI to Be #Aligned?
Переглядів 2092 місяці тому
What Does It Mean for #AI to Be #Aligned?
AI Embodiment: A Key Step Toward #ArtificialGeneralIntelligence
Переглядів 3482 місяці тому
AI Embodiment: A Key Step Toward #ArtificialGeneralIntelligence
LLMs and Weapons: Assessing AI Risks for Bio, Cyber, and Chemical Threats #llms #aisafety
Переглядів 1702 місяці тому
LLMs and Weapons: Assessing AI Risks for Bio, Cyber, and Chemical Threats #llms #aisafety
Google's Med-Gemini: Game-Changing AI in Healthcare
Переглядів 7623 місяці тому
Google's Med-Gemini: Game-Changing AI in Healthcare
Using AI to Create 3D Worlds to Train #reinforcementlearning Agents
Переглядів 4403 місяці тому
Using AI to Create 3D Worlds to Train #reinforcementlearning Agents
The 18 Biggest Challenges in #llm Alignment and Safety: @BuzzRobot talk
Переглядів 2593 місяці тому
The 18 Biggest Challenges in #llm Alignment and Safety: @BuzzRobot talk
Towards Guaranteed Safe AI: A Framework to Ensure Robust and Reliable AI Systems
Переглядів 1,3 тис.4 місяці тому
Towards Guaranteed Safe AI: A Framework to Ensure Robust and Reliable AI Systems
The Ethical and Societal Impact of AI Assistants by Google DeepMind
Переглядів 5 тис.4 місяці тому
The Ethical and Societal Impact of AI Assistants by Google DeepMind
The Griffin architecture: A challenger to the Transformer
Переглядів 9 тис.5 місяців тому
The Griffin architecture: A challenger to the Transformer
RLOO: A Cost-Efficient Optimization for Learning from Human Feedback in LLMs
Переглядів 3,6 тис.5 місяців тому
RLOO: A Cost-Efficient Optimization for Learning from Human Feedback in LLMs
Exploring efficient alternatives to Transformer models
Переглядів 10 тис.5 місяців тому
Exploring efficient alternatives to Transformer models
DiPaCo: Towards a New Paradigm of Distributed AI Training by Google DeepMind
Переглядів 12 тис.6 місяців тому
DiPaCo: Towards a New Paradigm of Distributed AI Training by Google DeepMind
Lecture about Llama 3 with Thomas Scialom, an AGI researcher at Meta
Переглядів 23 тис.6 місяців тому
Lecture about Llama 3 with Thomas Scialom, an AGI researcher at Meta
Aligning AI with Pluralistic Human Values
Переглядів 12 тис.6 місяців тому
Aligning AI with Pluralistic Human Values
Aya, an Open-Source Dataset and Model for Over 100 Languages
Переглядів 11 тис.6 місяців тому
Aya, an Open-Source Dataset and Model for Over 100 Languages
How Hardware Choices Impact Fairness in AI Systems
Переглядів 5 тис.7 місяців тому
How Hardware Choices Impact Fairness in AI Systems
Databricks LLM, DBRX: Model design and challenges. The lecture for the @BuzzRobot community
Переглядів 5 тис.7 місяців тому
Databricks LLM, DBRX: Model design and challenges. The lecture for the @BuzzRobot community
Large Language Models Might be Vulnerable to Trojan Virus. Demo
Переглядів 6 тис.7 місяців тому
Large Language Models Might be Vulnerable to Trojan Virus. Demo
Large Language Models Vulnerability From GPU Local Memory Leak. AI Security
Переглядів 5 тис.7 місяців тому
Large Language Models Vulnerability From GPU Local Memory Leak. AI Security
AI to Automatically Fix Security Bugs by Google
Переглядів 6 тис.8 місяців тому
AI to Automatically Fix Security Bugs by Google
Knowledgeable as always! Thanks for bringing this to us!
Awesome, thank you for sharing! I missed this one
Thanks for watching🥰
This is an interesting discussion on how to continue to improve model quality without human input. Thanks for hosting Sophia!
Thanks a lot for coming to the talk
Interesting Content!
Thank you☺
Interesting discussion on a really important topic!
Thanks for watching!
This was probably my favorite one so far! It’s packed with a lot of info. Thanks for bringing this to us!
Thanks for watching
The hits just keep on coming! How do you get such great speakers?
Thanks for your kind words
Thank you for sharing the recording!
I hope you enjoyed the conversation
Great contents!
Timestamps: 0:00 Introduction 1:19 AGI by 2027? 3:24 The definition of AGI 5:36 Current AI systems are already improving AI research, accelerating the path to AGI 7:54 How deep learning works and why LLMs are not just "next-token predictors" 9:21 How to solve the long-term memory problem 10:46 What is the current recipe to achieve AGI: more compute, more data, more talent? 12:24 Compute is the gold of the 21st century 13:42 The role of academia in the AI race 15:22 Bringing energy supply to enable AGI 16:38 How do we know when it’s AGI? 22:26 If we hit a plateau with compute, how would that affect AI system development? 24:20 AI safety concerns 26:23 The main risks of AI (the deceptive nature of AI) 32:44 Instrumental convergence 35:03 The paperclip maximizer theory and how AI could harm humans
Thank you for sharing the recording Sophia! Nice job on the intro :)
Thanks a lot!
Insightful discussion on the effect of context size and document ordering on RAG accuracy. Thanks for hosting this excellent talk, Bowen and Sophia!
Thanks a lot!
Great discussion! Thanks for organising!
Thanks for watching!
Thank you for sharing the recording! I'm curious how these models can become cheaper to train.
Interesting!
$2,000 is still a lot...
@@neerajkashyap3963 how much should be?
How to create a Stable Diffusion model with a low budget! Very useful! Thanks for sharing, Sophia!
Thanks for check out the lecture
Interesting. I presume the cost to do so is going to come down too as times progress. Great talk as usual, thank you for uploading this!
Thanks for check it out!
I saw in the paper that you used a ~7b LLM for experiments. How sure it is that the RLHF components aren`t relevant for way bigger LLMs?
@@manuelkarner8746 hey! let me ask this question and get back to you once I hear back from the author of the paper
The models I use call themselves a tool , not oppressed individuals
let's see what they call themselves when they are more intelligent than all humans combined
It already is on the political agenda in Europe, see the recently adopted regulations of the EU AI Act, for example
yeah, good point. what do you think about Europe's approach to AI regulation?
@@BuzzRobot a good starting point, I particular like the disctinct layers/categories based on how it may affect lives. But i'm curious on how the enforcement and policing will go. That institution isn't in buisiness yet
@@jezusbloodie to be honest, I’m not sure there are any AI experts in government who deeply understand how the technology works and how quickly it's capable of evolving. I doubt regulation will be able to meaningfully catch up with the pace of AI advancements. But we do need some policy around the deployment of AI agents to prevent completely disastrous scenarios.
@@BuzzRobot I share your concern. I hope the soon-to-be dedicated EU bureau of AI (I can't remember the actual name) will help streamline knowledge and information, focused expertise and advice and has the ability to audit, in order to alleviate issue. The Brussels Effect is real enough, but if the politicians will listen, remains to be doubted I must set my hopes on it, because I see little to no other plausible way to prevent a multitude of absolute doomscenarios for humanity, of which some the precursors are already being willfully implemented and developed as if those were messiah
"AI" is as of yet (like the LMMs) is nowhere near intellegent, nor actually artificial (seeing the biasing and data labeling problems). A lack of understanding by those pro-AI people is betrayed, in them thinking current (LMM) AI is anything more than a very fancy database interface in which language is stored in a novel way that (somewhat randomly) tries to guess the next sets of letters, based on some prompt. Let alone AI being near sapience or sentience. Debate on those two aspects shouldn't be held yet, but rather we should talk about current "AI" "achievements". What we have now is rather "Virtually Intellgent" a VI, if not just some older scripts that've been slapped with an "AI✨" sticker for marketing purposes. To present this as a binary issue is at best an accidental misinterpretation or at worst misleading polarisation. I for one, encourage the use of "AI" for (medicial) research, but not current attempts to rid artists of their creativity and work. The EU AI Act, with its framework for determining how impactful a piece of "AI" might be, and its prescribed methods of dealing
There are many applications of AI and the most active one - for military purposes. It's becoming a geopolitical competition - look how aggressively China is advancing its AI systems. It's beyond being just fancy database. There is severe AI arms race and it will keep accelerating. In five years from now this world will be very different
@@BuzzRobot I agree with that, very much so
So people who are pro-AI are anti-human? Yeah, those aren't the camps...
They are not anti-human - they just don’t want powerful and wealthy humans to abuse AI but rather make it an equal participant in society. Have you seen the TV show Westworld?
Another binary non-issue, like the pro-TikTok and anti-TikTok camps. This should not become a political issue.
The impact of AI on society will be much greater than the impact of TikTok. Tens of millions and at some point hundreds of millions of people will lose their jobs. It has to become a political issue.
Thank you so much for bringing such informational talks straight to our devices. They’re super knowledgeable!
Thanks for watching. AI agents are becoming a real thing
This is such exciting work.
Great talk on Swarm, a framework for rapid prototyping of agentic systems. Also, a good demo and introduction of Cursor. Thanks for hosting Sophia!
I suppose the government will not view this as an opportunity to let citizens do whatever they like. Instead they’ll find some place to employ your labor; bolster their military, re-build infrastructure across the nation…after all, you will still want to earn an income higher than those that are taking their sabbatical. The economy will reward those that continue to contribute to the nation through their labor, innovations, etc
@@johnramirezvideos yeah, there will be opportunities to make good money. But in this particular case, I meant that if AI takes over your job, it’s not the end of the world. There might be opportunities like never before. Also, it’s actually useful to take a break sometimes and rethink certain aspects of life
@@johnramirezvideos I'm very curious to see how the gov will actually handle the transition to the AI economy
As long as those large corporations who develop AI pay taxes properly... :)
@@MihaiTodor the gov won't miss its opportunity to collect money. also AI can be an independent participant of the economy and also pay taxes
blockchain technology will enable that
@@BuzzRobot Let’s see… I’m still a bit skeptical
@@MihaiTodor about blockchain in general or in this particular use case?
@ Both, but I’m pretty clueless about this stuff
AI helps to advance AI. Fascinating! Thanks for hosting Sophia!
One more step towards making humans obsolete. I'm in!
Humans are working hard on it!!
Interesting discussion
@@pranavagarwal779 it was really interesting to learn about those evals and get a sense how good those AI models are at performing AI research
Lawrence seemed incredibly knowledgeable in his field. This was a solid talk, thank you!
@@PH03NXHDFYeah Glad you enjoyed it!
Anyone else notice how the questions to ChatGPT vs DALL-E reveal different thinking patterns? Visual AI seems to unlock pure creativity, while text AI brings out their inner philosophers. 🤔 So much potential for supporting different learning styles! Thank you for the talk!
@@LoreAIverse Interesting observation! I'm also curious to see how children will use voice based AI
A fascinating discussion on children interacting with AI. Thanks for hosting Sophia!
@@ericgieseke5010 Thanks!
I loved this talk. Thank you for sharing it with us!
Thank you for sharing the recording! I really enjoyed it!
Recent news of Australia banning social media for below 16 years (not sure if it’s true) is an interesting decision (I support it). It will be interesting to see how AI is regulated for children and will be good to see how things are balanced because on the positive side it can promote creativity but can have many negative consequences far worse than social media.
I also support banning the use of social media until 16, even until 18. Yeah, AI def should be regulated for children - I hope AI companies will take it responsibly
Loved seeing the prompts the kids were making for Dalle. They’re so creative,and it’s wild how quickly they’re learning to use AI, probably faster than most adults! They're mastering how to make prompts more fun and imaginative.
yeah, we will be having AI-native generation
I’m sorry, your robots kept arming themselves and that didn’t raise quite a few alarms?
It clearly did, so they prevented it. Assuming they were only doing this rarely, and why would they not if not told otherwise
Task generated by llms. I'm not surprised
in the main lecture (Q&A section) the guest speaker provides an answer how they addressed that
ua-cam.com/video/XocmVe1FCMY/v-deo.htmlsi=T_XL9kJ3m2-j_ka6
Insightful!
I want to to install ai scientist in cloud can help with this
this is their official github repo github.com/SakanaAI/AI-Scientist you can contact contributors there too
That was a great talk, thank you for sharing!
Great to see the convergence of robotics and AI. Thanks for sharing, Sophia!
would you be open to allow robots to walk in your house?
Another banger. Thanks for uploading the video!
Hope you enjoyed the talk
Timestamps: 0:00 Introduction 0:23 High-level goal of robot learning 1:16 Impact of Deep Reinforcement Learning on robotics 3:46 What makes robotics special? 5:43 Challenge 1 in robotics: Data 10:38 How simulation can help advance robotics 16:07 Challenge 2: Usability of a robot 18:51 Importance of practice for robots 19:53 How large language models can help manage many robots 23:55 The viability threshold for robots 26:10 Why Alex has changed his research direction to AI safety 29:21 Q&A with the BuzzRobot community
Amazing to see scientific research accelerated by AI. What scientific discoveries will be made?
Dario Amodei, CEO of Anthropic, wrote a quite comprehensive essay on what scientific discoveries AI will enable - an interesting read
Along with these skills, I believe the most important skills to integrate in children even in the age of AI are values. Importance of money and time especially when information and tools are so easily accessible. We are observing the great impact of Social media (pun intended). We can expect the worse with AI, if people don't have their own values.
@@pranavagarwal779 values naturally come from family and each family has their own values to some extent. it's impossible to unify
@@BuzzRobot Thats so true.
That's absolutely wrong. Values come from the Bible. There are an objective set of morals. Otherwise every bad person can think their values and moralsnare superior to another person's. @@BuzzRobot
@@cl1489 I'm sure many bad people do think that their morals are superior and that they have a right to do what they do. Values vary in different religions - for example, 'an eye for an eye' in one religious system and complete forgiveness and 'turn to them the other cheek also' in another.
What a time to be alive!
@@pranavagarwal779 Indeed. Curious to see how these kind of projects will impact AI research
Hold on to your ai generated papers!
@ and AI generated reviews!!
Great video as usual!! Thank you for sharing this with us :)
@@PH03NXHDFYeah Hope you'll enjoy it!