- 141
- 160 364
NextGenLakehouse
France
Приєднався 1 сер 2022
If you are a Data Engineer, Data Scientist or Data Analyst this channel is the place to be.
I will make sure to create content around Lakehouse, Delta Lake, Mlflow and Apache Spark.
I will make sure to create content around Lakehouse, Delta Lake, Mlflow and Apache Spark.
Meet a Databricks MVP: Rishabh Pandey
The Databricks MVP Program is our way of thanking and recognizing the community members, data scientists, data engineers, developers and open source enthusiasts who go above and beyond to uplift the data and AI community.
Переглядів: 327
Відео
What's new in Databricks - December 2024
Переглядів 4579 годин тому
Databricks December 2024 Release Highlights. NextGenLakehouse
The perks of using Unity Catalog managed tables
Переглядів 37521 день тому
Documentation docs.databricks.com/en/tables/managed.html Linkedin: www.linkedin.com/in/cindyxjiang/
Simplified ETL with Rivery and Databricks
Переглядів 14328 днів тому
Taylor McGrath, who leads Solutions Engineering at Rivery, discusses the platform's capabilities and its integration with Databricks. Rivery is a modern integration platform operating in the ELT (Extract, Load, Transform) space, designed to simplify data pipeline building and maintenance. Key points: • Rivery offers a no-code interface for data engineers and analysts to extract and load data fr...
From Berkeley to a $B company Data+AI Startup - Lesson learn with Arsalan Tavakoli
Переглядів 570Місяць тому
In this interview, Databricks co-founder Arsalan Tavakoli discusses the challenges and rewards of being a leader in the rapidly changing tech industry. Tavakoli shares his insights on topics like: ●Building and scaling a company from the ground up ●Balancing work and personal life ●Making key decisions ●Hiring and developing a world-class team ●Adapting to the rise of AI Arsalan offers advice t...
Why you should consider using an Open source Semantic layer with David Mariani CTO at Atscale
Переглядів 276Місяць тому
In this interview, David Mariani, CTO of AtScale, discusses the importance of semantic layers, especially in the age of LLMs. Mariani argues that semantic layers are crucial for bridging the gap between the general knowledge of LLMs and the specific terminology and definitions used within companies. He emphasizes the difficulty of building semantic layers manually due to the complexity of busin...
Why you should start using Unity Catalog
Переглядів 285Місяць тому
This video features a discussion with Michelle Leon, Product Manager for Unity Catalog at Databricks. The conversation focuses on the open-source version of Unity Catalog, its features, and why users should consider adopting it. Michelle explains the core functionalities of a data catalog, including metadata management, governance, commit coordination, and support for diverse data and AI assets...
The Future of Data Engineering with Databricks Lakeflow ( Bilal Aslam)
Переглядів 8712 місяці тому
The Future of Data Engineering with Databricks Lakeflow ( Bilal Aslam)
OSS Discussion with Denny Lee Principal Developer Advocate at Databricks
Переглядів 2402 місяці тому
OSS Discussion with Denny Lee Principal Developer Advocate at Databricks
What's new in Databricks : Data Engineering and Governance
Переглядів 2692 місяці тому
What's new in Databricks : Data Engineering and Governance
Databricks Serverless Discussion with Josue
Переглядів 1542 місяці тому
Databricks Serverless Discussion with Josue
The future of Data Engineering with Michael Armbrust
Переглядів 8103 місяці тому
The future of Data Engineering with Michael Armbrust
The future of Data Warehousing with Reynold Xin Databricks Co-founder
Переглядів 2 тис.3 місяці тому
The future of Data Warehousing with Reynold Xin Databricks Co-founder
Getting started with Databricks and Salesforce
Переглядів 2324 місяці тому
Getting started with Databricks and Salesforce
Raito the Self Service Access Management Tool
Переглядів 724 місяці тому
Raito the Self Service Access Management Tool
Choose Openness not storage wars with Kyle Weller Head of Product at Onehouse
Переглядів 2014 місяці тому
Choose Openness not storage wars with Kyle Weller Head of Product at Onehouse
The Digital Natives podcast with Lexy Kassan from Databricks
Переглядів 974 місяці тому
The Digital Natives podcast with Lexy Kassan from Databricks
Getting started with Variant Data Type in Delta Lake and Apache Spark
Переглядів 4804 місяці тому
Getting started with Variant Data Type in Delta Lake and Apache Spark
Data Engineering Pro Tips from Fabrice Deseyn
Переглядів 2814 місяці тому
Data Engineering Pro Tips from Fabrice Deseyn
Open Sourcing Unity Catalog with Michelle Leon Product Manager at Databricks
Переглядів 4004 місяці тому
Open Sourcing Unity Catalog with Michelle Leon Product Manager at Databricks
What's new in Databricks - July 2024
Переглядів 2835 місяців тому
What's new in Databricks - July 2024
Streamline the migration process to Databricks with Alchemist
Переглядів 1995 місяців тому
Streamline the migration process to Databricks with Alchemist
Databricks AIBI Genie: The best Text2SQL AI System with Chao Cai, Sr Director Engineering
Переглядів 8005 місяців тому
Databricks AIBI Genie: The best Text2SQL AI System with Chao Cai, Sr Director Engineering
Getting Started with Databricks Connect and Serverless Compute
Переглядів 1,4 тис.5 місяців тому
Getting Started with Databricks Connect and Serverless Compute
Getting Started with Amperity: Data quality to fuel AI and engagement
Переглядів 3365 місяців тому
Getting Started with Amperity: Data quality to fuel AI and engagement
Power BI and Databricks SQL best practices
Переглядів 7465 місяців тому
Power BI and Databricks SQL best practices
The future of Delta Lake and Apache Iceberg with Tathagata Das
Переглядів 1,2 тис.5 місяців тому
The future of Delta Lake and Apache Iceberg with Tathagata Das
Data Warehousing Migration Talk With Laurent Leturgez
Переглядів 2565 місяців тому
Data Warehousing Migration Talk With Laurent Leturgez
Great podcast, personally enjoyed it.
Thanks for having me, it was a wonderful podcast experience.
Excellent insight!
Great insights.. as a co-founder myself in a tiny AI startup super useful.
great talk! thank you for sharing
What is sml? How is this generic or just databricks implementation
Thanks so much for this. Please how can I learn more
If i have an existing databricks account and workspace that where manually provisioned how can I extract all the environment setup into Terraform to know the state of our current deployment
The Video was too abstract and not for beginners, would great if we can put begineer videos to not especially around the authentication with SPN from local and CICD development with GitActions.
If you ever tried that you would know that this is bullshit
Why change tracking and not cdc?
Exciting update! Looking forward to this new format and the deep dives into Databricks and all things data. The bi-weekly topics sound like a fantastic way to stay on top of trends-can’t wait!
I love databricks but some of the features are available for all users. Example: workflows, any user with workspace access is able to create. With this it is hard to keep environment clean. Any idea when this feature will be available?
Merci Beaucoup
Bon courage ❤❤❤❤❤
❤❤❤❤❤❤❤❤❤❤
❤❤❤❤❤❤❤❤❤
Shields Throughway
White Manors
Deckow Summit
Adrianna Villages
How to use in spark SQL locally can you help with packages to add ?
I love this intro! :D
❤❤❤❤❤❤❤
❤❤❤❤❤❤❤❤❤❤❤
❤❤❤❤❤❤❤❤❤
❤❤❤❤❤❤❤❤❤❤❤
Engaging and intriguing session. Thank you for having Michael over. @Micheal You are a rockstarrrr man... :)
Awesome video and awesome guest! Michael is truly a delight to hear as a data engineer and is perfectly explaining the philosophy behind what Databricks is trying to build ♥ I'm so excited to one day see the open sourcing of DLT 🤩 Keep up the good work!
99927 Braun Prairie
Delia Trail
Emilia Views
Reynold mentioned that he was rapidly churning out code for Apache Spark around 2016 and 2017. During that time, my team adopted it for our applications, and I ended up creating many Jira tickets and Rennold often directly got involved to resolve the tickets. It was challenging to work through the bugs and issues in the framework back then, but since version 2.4, we haven’t encountered any major problems-we’re really happy with the improvements! The most recent issue I faced was with Spark v3.6, where converting a DataFrame to a Pandas DataFrame threw an exception. However, after downgrading the Pandas version, the issue was resolved. I didn’t create a ticket for this one. It’s great to hear from the interview that Reynold has many exciting ideas in the works. He’s no longer just an individual pilot-now he’s the captain of an entire fleet. I’m confident he’ll lead us from "big" to "huge."
Rogahn Extension
How did you do the first step of converting the PDF into chunks and into a database? I feel like we skipped a step here
Becker Mountain
Dereck Flats
Brad Row
Jaskolski Junctions
❤❤❤❤❤❤❤
❤❤❤❤❤❤❤❤
❤❤❤❤❤
Thank you for another great conversation
❤❤❤❤
Great video: * Intro who is Fabrice, a Data Practitioner and a Databricks Champion ua-cam.com/video/fwnhGDZABAk/v-deo.html * How to become a Databricks Champion ? ua-cam.com/video/fwnhGDZABAk/v-deo.html&t=97s * How hard is to convince business (C level) of the need for UC ua-cam.com/video/fwnhGDZABAk/v-deo.html&t=915s * When migration UC, how hard is to get access to the team that owns the Account Console ua-cam.com/video/fwnhGDZABAk/v-deo.html&t=1150s
Looking forward to this!
Can we use UC OSS to share data with consumers within the organization, especially with those who use Python based scripts to pull data from the Lakehouse? Also, would be good to see guidance around when to use UC OSS Vs Delta Sharing.
Unity Catalog open APIs is a two way road enabling read and write access to data through api
@@nextgenlakehouse ok, understood. So, I assume we can just grant READ access also. The consumer needs to clone the git repo, install UC, and access data, is that the high level procedure?
❤❤❤❤
Love it 😀
❤❤❤❤❤❤❤❤❤