63
53 394

Andrew Ng at Snowflake: AI Agent Battle Royale

7:32

Aida Farahani: From 2D to 3D in Seconds

21:25

Sequoia Capital: Move 37 is Here!

16:54

Nikhil Patel: Inside Sally Beauty’s Data Strategy

32:14

How an 8B Model Beat an Industry Giant

11:38

Victor Perrine: From Bananas to $Billions

24:50

Tapan Khopkar: A ‘MasterClass’ in Marketing Mix

Tapan is the Executive Director of Marketing Sciences at OMD. This episode is a very thorough compendium of marketing science principles and practices, as applied to decisions about paid media. Topics include:
• Marketing Mix Modeling (MMM) vs Multi-Touch Attribution (MTA) vs experiments, including how to use each and for what purpose
• KPIs: Incremental sales vs top-of-funnel metrics like awareness and consideration
• How to design incremental changes in the direction that a model is pointing
• Testing a new marketing mix in various geo markets
• Judicious use of model hyperparameters, such as AdStocks
• The best cadence for measurement
• Third-party cookies (Implications of the shift towards user choice)
• Recent tools: Google Meridian (a Bayesian approach) vs Meta Robyn (a frequentist approach)
These ideas are brought to life with an example from a financial services brand, planning a big launch, with a need to answer questions like: How should the paid media budget be allocated? Should the focus be branding first and performance later? How should ad spend be flighted (Always-On vs Burst and Trickle)?
In effect, this show is a Master Class regarding marketing science, as described by one of the world’s top authorities on the topic, who's currently an Executive Director at OMD, which is the world’s largest media agency network.

Відео

Andrew Ng at Snowflake: AI Agent Battle Royale

7:32

Andrew Ng at Snowflake: AI Agent Battle Royale

Переглядів 1,3 тис.9 годин тому

Andrew Ng was the keynote speaker last week on Day Two of the Snowflake BUILD conference, and in that talk, he shared results from testing different kinds of agentic workflows on the Human Eval benchmark. This video is a deep dive into those test results, paying particular attention to the top two best-performing agentic tools in the evaluation panel done by DeepLearning, which were Reflexion a...

21:25

Aida Farahani: From 2D to 3D in Seconds

Переглядів 50321 годину тому

With a specialization in 3D deep learning, Aida is doing ground-breaking work related to 3D simulations. In this episode, she first describes why meshes or point clouds are computationally-expensive for simulating changes to shapes. She then describes why “implicit fields” are much more efficient for this, as described in the widely-cited “DeepSDF paper.” Next she describes a special-case probl...

16:54

Sequoia Capital: Move 37 is Here!

Переглядів 1,4 тис.День тому

This is a special edition of the ‘AI World’ video series covering the release of OpenAI-o1 (alias Q* and Strawberry). By whatever name, this is a very powerful new kind of model that has demonstrated remarkable reasoning abilities. The video starts with a look back in time at “Move 37” - an iconic moment in AI history during the 2016 match between AlphaGo and Lee Sedol. That was a moment when t...

Nikhil Patel: Inside Sally Beauty’s Data Strategy

32:14

Nikhil Patel: Inside Sally Beauty’s Data Strategy

Переглядів 13514 днів тому

Nikhil is the Data Science Director at Sally Beauty Holdings, which is a $3.7 billion specialty retailer, with more than 10,000 products sold through over 4,000 stores, as well as online. Prior to his current role, Nikhil held a senior leadership role at Harman International, working with panel data from top CPG brands like P&G, Unilever and Kraft. In this episode, Nikhil describes his 19-year ...

11:38

How an 8B Model Beat an Industry Giant

Переглядів 14714 днів тому

This video describes how a system called ‘AgentStore’ was able to gain the top spot on a benchmark for AI agents - beating out a gigantic model with a small one. AgentStore is a platform and method for aggregating specialized agents that perform real-world tasks on digital devices on macOS, Windows and Ubuntu. In that system, a meta agent selects the best resource (or combination of resources) ...

Victor Perrine: From Bananas to $Billions

24:50

Victor Perrine: From Bananas to $Billions

Переглядів 5121 день тому

Victor (Viko) Perrine is the Global Director of New Growth Initiatives at Circle K. Prior to that, he’s held senior leadership roles at Delek US Holdings and at UGP Inc. In this episode, Viko describes a major new initiative called Lift that's unique to Circle K, which just launched in Europe, plus future development plans for that. Building on this, he describes the day-to-day for a global inn...

24:09

Ray Pettit: New Models for AI Literacy?

Переглядів 2828 днів тому

Ray is the Chief Data and Analytics Officer at Valhalla AI Solutions. Prior to that, he held senior leadership roles at the Advertising Research Foundation, at the Institute for Experiential AI, and at comScore. In this episode, Ray discusses challenges associates with promoting meaningful AI literacy in business, starting with fundamental questions, like what is “AI literacy” exactly? (Is it m...

Ivan Pinto: A Year of AI Testing in Software Dev

24:02

Ivan Pinto: A Year of AI Testing in Software Dev

Переглядів 15928 днів тому

Ivan is an Associate VP of Delivery at Robosoft Technologies. His team does application development, engineering and QA for US clients, including web (HTML, CSS), mobile (Android, iOS, Samsung, LG) and streaming, including Roku TV. In this episode, Ivan shares what he learned in his experiments over the past year using Gen AI to improve the efficiency and quality of code produced by his 300-mem...

20:55

Sam Marks: Big Data, Big Bad Bruins

Переглядів 99Місяць тому

Sam is the Director of Business Strategy, Solutions & Analytics at the Boston Bruins & TD Garden. Prior to that, he directed strategy and analytics for the Arizona Coyotes, and at VaynerMedia. In this episode, Sam sheds light on the world of analytics and business strategy for sports teams, including the differences in strategy and focus for teams vs at the league level. He also talks about the...

Celia Wanderley: AI Innovator of the Year

30:55

Celia Wanderley: AI Innovator of the Year

Переглядів 903Місяць тому

Celia is the Chief Innovation Officer at Bits In Glass (BIG), a top Canadian IT consulting firm. Prior to that, she held senior leadership roles at AltaML and at Deloitte Canada. Last year. Celia was recognized as the AI Innovator of the Year, by Women in AI. In this episode, Celia shares insights from trends she’s seen in her recent work involving intelligent automation of business processes a...

24:13

Dave Stern: Hackproof Your Startup

Переглядів 154Місяць тому

Dave is a fractional CTO and DevOps engineer with over 25 years of experience in systems and software engineering. He's the President and Senior Solutions Architect of Stern DevOps Group, which is a consultancy focused on early stage companies. He's also the author of a new book: Hackproof Your Startup, and that book is a key topic of the show. In this episode, Dave discusses IT and AI security...

Andrei Lopatenko: Scaling AI to Billions

38:01

Andrei Lopatenko: Scaling AI to Billions

Переглядів 148Місяць тому

Andrei holds a PhD in Computer Science, and is the Director of Search and the AI Lab at Neuron7. Prior to that, he was the VP Engineering and AI at Zillow. He’s also held key leadership roles at Google, Apple, Walmart and eBay. In this episode, Andrei shares insights and advice, based on his experience deploying large-scale, high-load NLP and search applications to billions of customers (5 bill...

46:22

Shawn Goodin: Agent-Driven Marketing?

Переглядів 158Місяць тому

Shawn is the Global VP of Solutions at FirstHive, which is a customer data platform. Prior to that, he held senior leadership roles at Capgemini, Silicon Valley Bank, JPMorgan Chase, Clorox, Northwestern Mutual and SC Johnson. He is also an advisory board member of the Customer Data Platform (CDP) Institute. In this conversation, Shawn describes various roadblocks to transformation in large org...

30:57

Jodi Blomberg: Strategic Bets on AI

Переглядів 26Місяць тому

Jodi is the VP of Data Science at Cox Automotive, a company that has a diverse portfolio of 17 brands that encompass digital products like Kelley Blue Book and Autotrader, as well as various kinds of physical services - all of which are supported by about 70 in-house data scientists and ML engineers. In this conversation, Jodi describes her AI initiatives as investments, managed in a way that’s...

Ramsu Sundararajan: Segment of One at Scale

58:14

Ramsu Sundararajan: Segment of One at Scale

Переглядів 69Місяць тому

Ramsu Sundararajan: Segment of One at Scale

2:53

Announcing the AI Master Group Podcast

Переглядів 135Місяць тому

Announcing the AI Master Group Podcast

Mesh Anything (except a Pink Hippo Ballerina)

8:04

Mesh Anything (except a Pink Hippo Ballerina)

Переглядів 3,2 тис.2 місяці тому

Mesh Anything (except a Pink Hippo Ballerina)

Can Robots Win at Table Tennis? Take a Look!

8:37

Can Robots Win at Table Tennis? Take a Look!

Переглядів 2,2 тис.2 місяці тому

Can Robots Win at Table Tennis? Take a Look!

10:37

Shark Alert! YOLO AI-Vision in Action

Переглядів 1,4 тис.3 місяці тому

Shark Alert! YOLO AI-Vision in Action

AI Can do That?? Silver Medal in Pure Math

11:04

AI Can do That?? Silver Medal in Pure Math

Переглядів 1,7 тис.3 місяці тому

AI Can do That?? Silver Medal in Pure Math

8:12

Will Open-Source Llama Beat GPT-4o?

Переглядів 6283 місяці тому

Will Open-Source Llama Beat GPT-4o?

Call a Doctor! --Blue Screen Lessons Learned

10:21

Call a Doctor! --Blue Screen Lessons Learned

Переглядів 9954 місяці тому

Call a Doctor! Blue Screen Lessons Learned

Amazing Milestone! Million Experts Model

7:07

Amazing Milestone! Million Experts Model

Переглядів 1,1 тис.4 місяці тому

Amazing Milestone! Million Experts Model

10:16

Behind the Curtain of Figma AI

Переглядів 5964 місяці тому

Behind the Curtain of Figma AI

How a Language Model Aced a Top Leaderboard

8:49

How a Language Model Aced a Top Leaderboard

Переглядів 1,2 тис.4 місяці тому

How a Language Model Aced a Top Leaderboard

6:29

New Method Runs Big LLMs on Smartphones

Переглядів 1,9 тис.4 місяці тому

New Method Runs Big LLMs on Smartphones

10:02

Nemotron-4 is BIG in More Ways than One

Переглядів 8505 місяців тому

Nemotron-4 is BIG in More Ways than One

13:54

Testing Ollama on Hard Questions

Переглядів 1,2 тис.5 місяців тому

Testing Ollama on Hard Questions

8:28

Hacking Passwords with ChatGPT?

Переглядів 1,6 тис.5 місяців тому

Hacking Passwords with ChatGPT?

КОМЕНТАРІ

@AIMasterGroup День тому
Audio Podcast on Spotify at spoti.fi/3XIf1te Audio Podcast on Apple at apple.co/48cS6eB
@amirhooshang1269 2 дні тому
No device such as AI or Deepmind google can beat a Pro TT Human. I strongly believe. Let us challenge. I can bet on any amount with AI or Google any time anywhere. Le me know please.
@amirhooshang1269 2 дні тому
price please?
@haleh.kia_ARTSTUDIO 4 дні тому
👏🏻👏🏻👏🏻👏🏻👏🏻bravo ❤️🔥🌹
@AIMasterGroup 4 дні тому
Here are links to the two papers I cited in the video. Reflexion: Language Agents with Verbal Reinforcement Learning arxiv.org/pdf/2303.11366 AgentCoder: Multiagent-Code Generation with Iterative Testing and Optimisation arxiv.org/pdf/2312.13010
@hamidekarimi-b3v 6 днів тому
Good job👍🏻👍🏻
@i4aneye618 7 днів тому
@lassetischer2045 9 днів тому
Brilliant video
@AIMasterGroup 9 днів тому
Audio Podcast on Spotify at spoti.fi/3XIf1te Audio Podcast on Apple at apple.co/48cS6eB
@coney2010grads 9 днів тому
Good analysis. Talk about ttt also!
@AIMasterGroup 11 днів тому
As promised in the video, here are links to the two papers I featured. Sequoia Capital Generative AI’s Act o1: The Agentic Reasoning Era Begins www.sequoiacap.com/article/generative-ais-act-o1/ OpenAI Learning to Reason with LLMs openai.com/index/learning-to-reason-with-llms/
@AIMasterGroup 16 днів тому
Audio Podcast on Spotify at spoti.fi/3XIf1te Audio Podcast on Apple at apple.co/48cS6eB
@AIMasterGroup 17 днів тому
Here's a link to the authors' GitHub page. AgentStore: Scalable Integration Of Heterogeneous Agents As Specialized Generalist Computer Assistant. chengyou-jia.github.io/AgentStore-Home/
@AIMasterGroup 23 дні тому
Audio Podcast on Spotify at spoti.fi/3XIf1te Audio Podcast on Apple at apple.co/48cS6eB
@AIMasterGroup 28 днів тому
Audio Podcast on Spotify at spoti.fi/3XIf1te Audio Podcast on Apple at apple.co/48cS6eB
@AIMasterGroup Місяць тому
Audio Podcast on Spotify at spoti.fi/3XIf1te Audio Podcast on Apple at apple.co/48cS6eB
@AIMasterGroup Місяць тому
Audio Podcast on Spotify at spoti.fi/3XIf1te Audio Podcast on Apple at apple.co/48cS6eB
@AIMasterGroup Місяць тому
Audio Podcast on Spotify at spoti.fi/3XIf1te Audio Podcast on Apple at apple.co/48cS6eB
@AIMasterGroup Місяць тому
Audio Podcast on Spotify at spoti.fi/3XIf1te Audio Podcast on Apple at apple.co/48cS6eB
@AIMasterGroup Місяць тому
Audio Podcast on Spotify at spoti.fi/3XIf1te Audio Podcast on Apple at apple.co/48cS6eB
@AIMasterGroup Місяць тому
Audio Podcast on Spotify at spoti.fi/3XIf1te Audio Podcast on Apple at apple.co/48cS6eB
@AIMasterGroup Місяць тому
Audio Podcast on Spotify at spoti.fi/3XIf1te Audio Podcast on Apple at apple.co/48cS6eB
@AIMasterGroup Місяць тому
Audio Podcast on Spotify at spoti.fi/3XIf1te Audio Podcast on Apple at apple.co/48cS6eB
@AIMasterGroup Місяць тому
The AI Master Group Podcast launches this week on Friday! The show is an ideal place to meet people who are creating the future of AI in their work today. It features interviews with authors of recent papers, and with people deploying AI in real-world applications, including people in senior leadership roles. If you want a front row seat to the technology and craft of AI, this show will be perfect. You can Follow the show on Spotify at this link: spoti.fi/3XIf1te (The show will also be the other usual channels, such as Apple Podcasts.) LAUNCH SCHEDULE The first episode on Friday of this week will be a surprise guest. 😊 Following that, here’s a preview of upcoming episodes. Monday 10/7: Ramsu Sundararajan, PhD Ramsu is the Head of R&D at solus.ai, which powers Segment of One personalization. Roles prior to that included Senior Scientist at GE Global Research, and Principal at Sabre Airline Solutions, where he developed some of the original algorithms. Monday 10/7: Jodi Blomberg Jodi is the VP, Data Science at Cox Automotive. Prior to that, she’s held senior leadership roles at SAS, Charles Schwab and Waste Management. Tuesday 10/8: Shawn Goodin Shawn is the Global VP of Solutions at FirstHive, which is a customer data platform. Prior to that, he held senior leadership roles at Capgemini, Silicon Valley Bank, JPMorgan Chase, Clorox, Northwestern Mutual and SC Johnson. Shawn is also an advisory board member of the Customer Data Platform (CDP) Institute. Wednesday 10/9: Dave Stern Dave is a fractional CTO and devops engineer with over 25 years of experience in systems and software engineering. He's the President and Senior Solutions Architect of Stern Devops Group, which is a consultancy focused on early stage companies. He's also the author of a new book: Hackproof Your Startup, and that book is a key topic of the show. Thursday 10/10: Andrei Lopatenko, PhD Andrei is the Director of Search and the AI Lab at Neuron7. Prior to that, he was the VP Engineering and AI at Zillow. He’s also held key leadership roles at Google, Apple, Walmart and eBay. Friday 10/11: Celia Wanderly Celia is the Chief Innovation Officer at Bits In Glass (BIG), a top Canadian IT consulting firm. Prior to that, she held senior leadership roles at AltaML and at Deloitte Canada. Last year. Celia was recognized as the AI Innovator of the Year, by Women in AI. Monday 10/14: Sam Marks Sam is the Director of Business Strategy, Solutions & Analytics at the Boston Bruins & TD Garden. Prior to that, he directed strategy and analytics for the Arizona Coyotes, and at VaynerMedia. Tuesday 10/15: Ivan Pinto Ivan is an Associate VP of Delivery at Robosoft Technologies. His team does software development, engineering and QA for US clients. This conversation covers his experiments using Gen AI to improve the efficiency and quality of code produced by his 300-member team. Wednesday 10/16: Ray Pettit, PhD Ray is the Chief Data and Analytics Officer at Valhalla AI Solutions. Prior to that, he held senior leadership roles at the Advertising Research Foundation, at the Institute for Experiential AI, and at comScore. Thursday 10/17: Victor (Viko) Perrine Viko is the Global Director of New Growth Initiatives at Circle K. Prior to that, he’s held senior leadership roles at Delek US Holdings and at UGP Inc. The conversation covers a new initiative that he just launched last month in Europe. Friday 10/18: Nikhil Patel Nikhil is the Data Science Director at Sally Beauty Holdings. Prior to that, he held a senior leadership role at Harman International. Friday 10/25: Aida Farahani Aida is a doctoral candidate in Machine Learning at Technische Universität Chemnitz, in Germany. With a specialization in 3D deep learning, Aida is doing ground-breaking work creating simulations using implicit fields, which is a much more computationally-efficient approach than mesh. Friday 11/1: Tapan Khopkar, PhD Tapan is the Executive Director of Marketing Sciences at OMD, which is the world’s largest media agency network. Prior to that, he was VP and Head of Innovation Labs at Cartesian Consulting.
@timalete Місяць тому
STEM, Science, Technology, Engineering, Math are at the core of hard sciences upon which systems are built. I suggest an addition: STEMAI. Adding AI where once it was a domain of art, it is now a science. Call it STEMY for short. Y meaning "Why?" Math Symbol: ? AI giving a technical hard answer to the question that was once an art in the domain of Philosophy! ...Because Intelligent Science.....?.
@NotSoMuchFrankly Місяць тому
I want those 8 minutes back. How did this get upvotes?
@dipeoluayomide5097 2 місяці тому
Wow, that was really explanatory, thanks a lot for this
@AB-wf8ek 2 місяці тому
This doesn't make any sense to me. There are a bunch of options for remeshing that have been around for years. Instant Meshes comes to mind, which runs standalone free & open source. It also converts to quad typology, which is important for applying subdivisions. Of all the tools that are currently avaliable, why would you promote the one AI project that doesn't do anything new and doesn't even work?
@insane5769 2 місяці тому
the channel's called AI master group i imagine it's all promoting ai dogshit
@AB-wf8ek 2 місяці тому
@insane5769 I'm actually trying to be constructive. I do 3D and use AI tools, so I appreciate the effort that a person makes to record themselves and share this information, but at least marginally educate yourself on the subject before sharing, otherwise it's just slop content.
@insane5769 2 місяці тому
@@AB-wf8ek i guess sometimes i don't realise that there are some useful AI applications but 100% of what i see is slop because there's just so much more of it than anything else
@HoggyMayhem Місяць тому
This isn't about remeshing, it's about text-to-mesh generation and reducing the number of tokens required to represent a mesh in a transformer. The channel's content is a bit weird, I agree, but coming to an ai-focused channel and asking 'why you talk about ai' is also kinda unreasonable :D
@AIMasterGroup 2 місяці тому
Here’s a link to the code I ran in the video. github.com/buaacyw/MeshAnythingV2 And here’s a link to the Rodin website where I created the 3D character. hyperhuman.deemos.com/rodin
@Also_sprach_Zarathustra. 2 місяці тому
Thank you ! It's a really funny video, and I love your calm voice ^^
@-danR 2 місяці тому
"he" this, "him" that. Is that thumbnail narrator AI-generated?
@AIMasterGroup 2 місяці тому
That's me: Jim Griffin. I'm a real person, not AI generated. I hope the question implies that I sounded professional!!
@J.J.J.J.J.J.J 2 місяці тому
Could not deal with the massive spin of advanced players.
@thereistheonlyone 2 місяці тому
Maybe But once it gets the data ... The robot will strike very early So it'a all about the input.
@J.J.J.J.J.J.J 2 місяці тому
@@thereistheonlyone Yes indeed, if it was able to receive the input and calculate position AND movement needed at point of contact. Because it would also need movement options since just getting its racquet in the right position (with slight movement) is not enough to deal with some spins. We have to provide counter-spin at times, which is not a simple matter to determine or execute.
@J.J.J.J.J.J.J 2 місяці тому
@@thereistheonlyone And on rewatching some, I didn't even see it deal with basic back spins.
@AIMasterGroup 3 місяці тому
Here’s a link to the full paper: Achieving Human Level Competitive Robot Table Tennis arxiv.org/pdf/2408.03906
@AIMasterGroup 3 місяці тому
Here’s a link to the SharkEye website. sharkeye.org/#our-process And here’s a link to the documentation page for YOLO. docs.ultralytics.com/
@npc4416 3 місяці тому
they should use this to end world hunger and do cancer research pleaseeeeeeeeeeeeeeeeeeeeeeee
@AIMasterGroup 3 місяці тому
Here’s a link to the full article from Google DeepMind. AI achieves silver-medal standard solving International Mathematical Olympiad problems deepmind.google/discover/blog/ai-solves-imo-problems-at-silver-medal-level/
@AIMasterGroup 3 місяці тому
Here’s a link to the blog post I quoted from in the video. Open Source AI Is the Path Forward about.fb.com/news/2024/07/open-source-ai-is-the-path-forward/
@AIMasterGroup 4 місяці тому
If you’re interested in digger deeper into this topic, I highly recommend this excellent article by Ed Bott on ZDnet, which includes some very helpful historical context. “What caused the great CrowdStrike-Windows meltdown of 2024? History has the answer.” www.zdnet.com/article/what-caused-the-great-crowdstrike-windows-meltdown-of-2024-history-has-the-answer/
@hausenmusic 4 місяці тому
Great video! Thank you for the clear explanation. The calm tone made it easier to undertand.
@AIMasterGroup 4 місяці тому
That's very nice of you. Thank you very much!
@AIMasterGroup 4 місяці тому
Here’s a link to the paper I featured in this video: “Mixture of A Million Experts,” by Xu He at Google DeepMind arxiv.org/pdf/2407.04153
@AIMasterGroup 4 місяці тому
At the very end of this video there’s a snippet from a brilliant performance of the song “Daisy Bell: A Bicycle Built For Two,” by permission from Julien Neel. (Thank you Julien!) Here’s his website: julienneel.sellfy.store/ All four singers in the barbershop quartet are Julien. It’s brilliant - and amazing. Here’s a link to the full song on UA-cam. ua-cam.com/video/JmiNlZAiDqo/v-deo.html
@AIMasterGroup 4 місяці тому
Here’s a link to the paper this video is about: “Discovering Preference Optimization Algorithms with and for Large Language Models” arxiv.org/pdf/2406.08414
@KiraSlith 4 місяці тому
Please check your microphone volume before upload, you are DRAMATICALLY quieter than any of the sound effects or your end card. I'd hazard a guess at around 12dB quieter.
@AIMasterGroup 4 місяці тому
Thanks for the heads-up. I need to study up on how to Normalize volume using Essential Sound in Adobe Premier Pro. (I know how to do that in Sound Forge.)
@KiraSlith 4 місяці тому
You could probably get it running at home through a smart scalar API like Ollama with 4 A6000s, 512gb of system RAM, and a pair of Intel Scalable CPUs with AVX-512 cores, it'll be around $10k to build on used parts. It won't be snappy, 2 minutes per response at least, but it's cheaper than renting a whole DGX for a month. A 22.3k entry synthetic dataset generated by continually inferring desireable results with Nemotron over a month should be good enough to start training a smaller model to a far higher standard than it's natural dataset counterpart could provide, and look at that, you'll also have hardware good enough for training 32B models already on-hand by the end.
@AIMasterGroup 4 місяці тому
That's an interesting idea. It's worth a try.
@kitastro 4 місяці тому
That's mixtral though isn't it just getting memory throttled
@AIMasterGroup 4 місяці тому
Yes, you're right. Most devices would certainly get memory throttled running Mixtral-47B. PowerInfer-2 apparently helps to mitigate that issue, both by optimizing the computation approach and by selecting processing units in a more planful way.
@ronilevarez901 5 місяців тому
No performance gains on CPU unless you use their models *and* you have avx2 instruct set, so meh. I'll stick to my current project of making ultra slim, CPU only mini models that challenge mainstream, thanks.
@AIMasterGroup 4 місяці тому
Sounds like an interesting area of focus: Ultra-slim, CPU-only mini models. You're right: PowerInfer-2 optimizes for various processing units, including CPU configurations that support AVX2 instructions. Although you're constrained on the CPU, you still might get a performance improvement from other aspects of PowerInfer-2 -- especially the approach they describe to handling sparse data, so it still might be worth exploring . . .
@vintagegenious 5 місяців тому
Very interesting. Is there a way to link powerinfer2 to sillytavern ?
@Getenari 5 місяців тому
don't make such loud noises in the video
@KJ-xt3yu 5 місяців тому
phone farms might take this on 🍿
@AIMasterGroup 5 місяців тому
Here’s a link to the paper I cited in the video. PowerInfer-2: Fast Large Language Model Inference on a Smartphone arxiv.org/abs/2406.06282
@Learntsomethingtoday 5 місяців тому
Agent smith.