- 57
- 27 969
Arthur
Приєднався 11 січ 2022
Arthur is the AI performance company. Our platform monitors, measures, and improves machine learning models to deliver better results. We help data scientists, product owners, and business leaders accelerate model operations and optimize for accuracy, explainability, and fairness.
Arthur’s research-led approach to product development drives exclusive capabilities in computer vision, NLP, bias mitigation, and other critical areas. We’re on a mission to make AI work for everyone, and we are deeply passionate about building ML technology to drive responsible business results.
Arthur’s research-led approach to product development drives exclusive capabilities in computer vision, NLP, bias mitigation, and other critical areas. We’re on a mission to make AI work for everyone, and we are deeply passionate about building ML technology to drive responsible business results.
AI Fest Presents: From Investment to Impact - The ROI of AI in the Enterprise (Stage 1)
Panelists:
- Amit V. Singh, Global Head of GTM - Gen AI Partnerships at AWS
- Pri Oberoi, Staff Data Scientist at Axios HQ
- Meinolf Sellmann, Chief Technology Officer at InsideOpt
- Matt Lynley, Founder of Supervised (Moderator)
Explore how AI investments are transforming into tangible business outcomes in this insightful session. Industry experts from top organizations discussed strategies for maximizing returns on AI initiatives, highlighting real-world examples of AI-driven growth and efficiency. Gain actionable insights into measuring the success of AI projects, from initial investment to long-term impact.
- Amit V. Singh, Global Head of GTM - Gen AI Partnerships at AWS
- Pri Oberoi, Staff Data Scientist at Axios HQ
- Meinolf Sellmann, Chief Technology Officer at InsideOpt
- Matt Lynley, Founder of Supervised (Moderator)
Explore how AI investments are transforming into tangible business outcomes in this insightful session. Industry experts from top organizations discussed strategies for maximizing returns on AI initiatives, highlighting real-world examples of AI-driven growth and efficiency. Gain actionable insights into measuring the success of AI projects, from initial investment to long-term impact.
Переглядів: 58
Відео
AI Fest Presents: Can Anyone Tell Me What an "AI Platform" Is? (Stage 1)
Переглядів 39День тому
Presented By: Donny Greenberg, CEO of Runhouse As companies focus on operationalizing their AI development and infrastructure, especially those coming off a long period of GenAI exploration, it’s worth taking a moment to identify the evolution of the AI stack and define just what an "AI platform" is. Part of the confusion is that the notion of an “AI Platform” has changed every ~3 years for the...
AI Fest Presents: Legal Considerations for the Use of AI in the Enterprise (Stage 1)
Переглядів 64День тому
Panelists: - Aaron Ogunro, Associate Attorney at Polsinelli - Lily Li, Data Privacy and AI Lawyer at Metaverse Law - Ian Eisenberg, Head of AI Governance Research at Credo AI - Joyce Chen, AI Lawyer & Former Data Privacy Officer at Pendo.io - Var Shankar, Chief AI and Privacy Officer at Enzai (Moderator) This panel explored the complex legal landscape surrounding AI adoption in business. Expert...
AI Fest Presents: Elevating Customer Experiences with ML & NLP (Stage 1)
Переглядів 38День тому
Panelists: - Seth Levine, Lead ML Scientist at Loris AI - Bitun Banerjee, AI Product Lead at JP Morgan Chase - George Davis, Founder & CEO of Frame.ai - Bear Douglas, Director of Developer Relations at Pinecone (Moderator) Join industry leaders as they delve into the transformative power of ML and NLP in enhancing customer experiences. This panel explored cutting-edge techniques for leveraging ...
AI Fest Presents: Evaluation is All You Need (Stage 1)
Переглядів 33День тому
Presented By: Jayeeta Putatunda, Senior Data Scientist at Fitch Ratings Large Language Models (LLMs) have transformed natural language processing (NLP), but their evaluation poses challenges due to the lack of standardized benchmarks for diverse tasks. The opaque, black-box nature of LLMs complicates understanding their decision-making processes and identifying biases. Effective evaluation metr...
AI Fest Presents: AI at Scale - Turning Models into Business Solutions (Stage 1)
Переглядів 92День тому
Panelists: - Vik Scoggins, AI/ML Product Lead at Coinbase - Tim Rich, Head of AI at Horizon Media - Adam Zhao, Co-Founder of SafeNest - Alejandro Fernandez, Product Manager - ML/AI at Square (Moderator) In this panel session, experts explored the transformative journey from AI models to impactful business applications. Discover strategies for scaling AI across organizations, overcoming operatio...
AI Fest Presents: Leveraging Data and AI for Human Centered Computational Reasoning (Stage 2)
Переглядів 18День тому
Presented By: Nicholas Mattei, Associate Professor at Tulane University In recent years there has been an explosion in interest in topics that sit at the intersection of applications of computing technology and societal issues. There has been significant work in the academic, industrial, and policy spaces to clarify and formalize best practices regarding the deployment of computational decision...
AI Fest Presents: Generative Al - A Phenomenal Science (Stage 2)
Переглядів 31День тому
Presented By: Raz Besaleli, AI Consultant
AI Fest Presents: You Also Need Good Hardware and Software (Stage 2)
Переглядів 20День тому
Presented By: Gabe Weisz, Fellow at AMD Attention is not all you need-running inference on large language models and other modern neural network topologies would be too slow to be useful without specialized computing devices. In this talk, Gabe discussed how model design, hardware design, and software interact, and provide a high-level overview of the accelerator space including GPUs, NPUs, and...
AI Fest Presents: Venture Perspectives: The Next Big Moves in AI (Stage 2)
Переглядів 39День тому
Panelists: - Tiffany Luck, Partner at New Enterprise Associates - Dylan Itzikowitz, Principal at South Park Commons - Leah Morris, Senior Director, Velocity Program at Radical Ventures - Daniel Chesley, Principal at Work-Bench (Moderator) In this session, leading venture capitalists explored the most promising trends and innovations shaping the future of artificial intelligence. Discover the ke...
AI Fest Presents: Embeddings Must Be Seen to Be Believed (Stage 2)
Переглядів 36День тому
Presented By: Ben Schmidt, VP of Information Design at Nomic AI Embedding models are a foundational part of all modern AI systems, and their representations of documents are of potentially great value to anyone with large uncategorized collections of text or images. But high dimensional spaces are also intrinsically hard to understand, which makes providing useful interfaces to embedding spaces...
AI Fest Presents: Considering the Psychosocial Impact of Harnessing Technology for Good (Stage 3)
Переглядів 35День тому
Presented By: Nakshathra Suresh, Co-Founder of eiris This talk explored the psychological, social, ethical, and safety risks of integrating emerging technologies, including AI, into daily life. Nakshathra, a cyber criminologist and co-founder of eiris, highlighted the growing cyber safety challenges posed by innovators who overlook end-user safety. She discussed non-technical risks like harm, b...
AI Fest Presents: Championing Ethical & Responsible AI: A Conversation with Leaders (Stage 3)
Переглядів 62День тому
Panelists: - Alyssa Lefaivre Škopac, RAI Strategist - Michael Brent, Director of Responsible AI at BCG - Shruthi Velidi, Founder of Communitek - Gurpreet Kaur Khalsa, Senior Product Manager - Securing GenAI at Palo Alto Networks - Abhinav Raghunathan, Founder of EAIDB (Moderator) In this panel, you'll hear from industry pioneers who are at the forefront of ethical AI development. This session d...
AI Fest Presents: Ethics, Equity, and Empowerment in AI - Fireside Chat w/ Renée Cummings (Stage 3)
Переглядів 53День тому
Join us for a thought-provoking fireside chat with Renée Cummings, renowned AI ethicist and Data Science Professor of Practice at the University of Virginia, as we explore the critical intersection of ethics, equity, and empowerment in AI. In this session, moderated by Arthur's very own Victoria Vassileva, Renée will discuss how AI technologies can both challenge and advance social justice, and...
AI Fest Presents: Environmental Challenges and AI: Shaping a Sustainable Future (Stage 3)
Переглядів 68День тому
Panelists: - Lily Xu, Postdoc at the University of Oxford - Maria João Sousa, Executive Director at Climate Change AI - Pranjal Bajaj, Senior Data Scientist at Boston Consulting Group - Teresa Datta, ML Research Scientist at Arthur (Moderator) In this session, our panelists explored the powerful role AI plays in addressing today’s most pressing environmental issues. This session highlighted how...
AI Fest Presents: Human Reactions to Being Erased in Generative AI (Stage 3)
Переглядів 5День тому
AI Fest Presents: Human Reactions to Being Erased in Generative AI (Stage 3)
AI Fest Presents: Work 2.0: AI’s Role in the Future of Employment (Stage 3)
Переглядів 101День тому
AI Fest Presents: Work 2.0: AI’s Role in the Future of Employment (Stage 3)
AI Fest Presents: LLM Representation of Personas (Stage 3)
Переглядів 22День тому
AI Fest Presents: LLM Representation of Personas (Stage 3)
AI Fest Presents: Style Over Substance - Failure Modes of LLM Judges in Alignment Benchmarking
Переглядів 24День тому
AI Fest Presents: Style Over Substance - Failure Modes of LLM Judges in Alignment Benchmarking
AI Fest Presents: What’s New with the Arthur Platform
Переглядів 38День тому
AI Fest Presents: What’s New with the Arthur Platform
AI Fest Presents: The Era of Inference - Efficient and Controllable Serving of LLMs
Переглядів 103День тому
AI Fest Presents: The Era of Inference - Efficient and Controllable Serving of LLMs
[Webinar] A Quick Primer on Agents: The Good, the Bad, and the Future
Переглядів 257День тому
[Webinar] A Quick Primer on Agents: The Good, the Bad, and the Future
[Webinar] LLMs and Misinformation: A Double-Edged Sword in the Digital Age
Переглядів 1102 місяці тому
[Webinar] LLMs and Misinformation: A Double-Edged Sword in the Digital Age
Ground Truth #6: OpenAI’s O1 Model Unpacked
Переглядів 5652 місяці тому
Ground Truth #6: OpenAI’s O1 Model Unpacked
Ground Truth Podcast #5: Amazon's Alexa Revamp, Ilya Sutskever's $1B Raise, & The Antitrust Circus
Переглядів 692 місяці тому
Ground Truth Podcast #5: Amazon's Alexa Revamp, Ilya Sutskever's $1B Raise, & The Antitrust Circus
Ground Truth Podcast #4: Nvidia Earnings, AI Code Completion, and California’s AI Regulations
Переглядів 712 місяці тому
Ground Truth Podcast #4: Nvidia Earnings, AI Code Completion, and California’s AI Regulations
Ground Truth Podcast #3: Generative AI, Model Monitoring, and AI in Healthcare
Переглядів 813 місяці тому
Ground Truth Podcast #3: Generative AI, Model Monitoring, and AI in Healthcare
Ground Truth Podcast #2: AI Privacy, Autonomous Vehicles, and Wall Street Insights
Переглядів 363 місяці тому
Ground Truth Podcast #2: AI Privacy, Autonomous Vehicles, and Wall Street Insights
[Webinar] Safeguarding AI Models: Exploring Prompt Injection Variants
Переглядів 3483 місяці тому
[Webinar] Safeguarding AI Models: Exploring Prompt Injection Variants
It was nicknamed Strawberry because it's an organic model, not a static GPT. What's buried in the straw, is their new jam. 🍎
Buried in the straw, or strraw, or strrraw, or ...
@@JohnPDickerson It's funny to watch people test this preview model, because it's not actually a test of o1's reasoning, it's a test of the users' reasoning. Those with imagination who can process information in creative ways, will see how different this model is compared to all precedent. And those who learned to just memorize information see it as an auto-complete or harder to use. They ask it to count letters in a word LMAO
I missed the live presentation, but this is just as good. Thanks for this incredible material. I am passionate about the disinformation and misinformation in the age of deluge of information, coupled with powerful tools at the disposal of everyday folks. It matters how these tools are deployed and how the unsuspecting vulnerable majority are protected from the potential harm. Thanks Cherie and Team!
It was nothing about the Finance.
Subscribed
I was looking for a good summary around LLM evaluation metrics.. I see a lot of them captured here well
💖 'Promo SM'
Who is Diego M. Oppenheimer?
Interesting
Awesome talk! So much cool info
Fantastic presentation, Max and Rowan! The depth of your analysis and the clarity with which you presented the complexities of evaluating LLMs is truly commendable. It's evident that a lot of thought and effort went into this research. I'm particularly intrigued by your approach to using LLMs as evaluators. It opens up a plethora of possibilities but also brings forth some ethical considerations. How do you account for systemic biases in evaluation metrics when using LLMs as evaluators? Given that traditional metrics might not capture the fairness aspect adequately, have you considered incorporating fairness metrics or mitigation methods in your evaluation process?
Excellent overview, Terry. The part about identifying age discrimination within machine learning models caught my attention. Could you share more about how Arthur.AI's platform sets the acceptable range for performance metrics in this context? Is it customizable based on industry or legal standards?
Any way to boost the audio on this? Barely audible in some parts
That's a resourceful conversation, folks. Thanks for hosting.
✋ þrðmð§m