Unity Catalog, Delta Sharing and Data Mesh on Databricks Lakehouse

Databricks Cost Management: Tips and Tools to Stay Under Budget

Advancements in Open Source LLM Tooling, Including MLflow

Get 10 Mega Boxes OR 60 Starr Drops!!

У Москві - про Курську область. Що кажуть на вулицях

Олександрія - Шахтар / УПЛ / 4 тур / Огляд матчу #Олександрія #Шахтар #уплтб

Photon for Dummies: How Does this New Execution Engine Actually Work?

Databricks

Переглядів 6 732

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 25 сер 2024

КОМЕНТАРІ • 9

@lezwon 10 місяців тому ⁺⁴
Wow! this was one of the best and fun talks I've listened to i a long time. I loved how Holly similplified the entire talk, so that even dummies like me can understand. Kudos to her 👏 Great job from starting with basics of how spark and the system works, to relating it to photon.
Thank you for the presentation Holly. This was very helpful. 🙏
@datasmithing_holly 8 місяців тому ⁺⁷
Hi everyone! Thanks for watching this video. Unfotunately the sources and credits were cut off at the end, so here they are if you would like to do any further reading.
[Paper] Alexander Behm, Shoumik Palkar, Utkarsh Agarwal, Timothy Armstrong, David Cashman, Ankur Dave, Todd Greenstein, Shant Hovsepian, Ryan Johnson, Arvind Sai Krishnan, Paul Leventis, Ala Luszczak, Prashanth Menon, Mostafa Mokhtar, Gene Pang, Sameer Paranjpye, Greg Rahn, Bart Samwel, Tom van Bussel, Herman van Hovell, Maryann Xue, Reynold Xin, Matei Zaharia. Photon: A Fast Query Engine for Lakehouse Systems. SIGMOD ’22
[Paper] Michael Armbrust, Reynold S. Xin, Cheng Lian, Yin Huai, Davies Liu, Joseph K. Bradley, Xiangrui Meng, Tomer Kaftan, Michael J. Franklin, Ali Ghodsi, Matei Zaharia. 2015. Spark SQL: Relational Data Processing in Spark. ACM SIGMOD
[Paper] Timo Kersten, Viktor Leis, Alfons Kemper, Thomas Neumann, Andrew Pavlo, and Peter Boncz. 2018. Everything you always wanted to know about compiled and vectorized queries but were afraid to ask.
[Lectures] CMU 15-721 Advanced Database Systems. 20 - Databricks Photon / Spark SQL, Andrew Pavlo
[Book] Code: The Hidden Language of Computer Hardware and Software, Charles Petzold
With special thanks to fact checkers and early reviewers: Alexander Behm, Sriram Krishnamurthy, Utkarsh Agarwal, Kent Marten, Tim Dikland, Grzegorz Rusin, Yassine Essawabi, Youssef Mrini, Erika Fonseca, Eoin O'Flanagan and Michael O'Kane
@wookiist 23 дні тому ⁺¹
That was amazing session. Thank you!
@rakeshreddy6630 11 місяців тому ⁺³
Holly Smith's voice is amazing..
explanation is giving so effectively...
@allthingsdata 9 місяців тому ⁺²
fantastic, probably gonna steal some slides for internal training
@youssefb.7406 11 місяців тому ⁺¹
Thanks a lot, could be interesting to showcase performance increase using the photon acceleration
@datasmithing_holly 8 місяців тому ⁺³
Hey Youssef, I toyed with the idea of including them, but the problem is that performance is very subjective to workloads, feature coverage and when the test is being run. If I was cherry picking, I would point to the 37x speed up for some text functions. On the other hand, not all workloads are photon-isable, so it could make no difference whatsoever. In general, as of 2023 I'd expect to see 2-3x speed up in a compatible workload, but by 2024 I'm anticipating 3-4x.
Benchmarks can be useful, but what matters are your personal ETL pipelines you're running. At 37:57 there's a list of good candidates to start with. I'd recommend testing Photon with those, and seeing what kind of a difference it makes.
Happy testing!
@maximerivest3501 10 місяців тому
Seems like lots of the problems could have been resolved by using julia instead of scala
@ScienceMinisterZero 9 місяців тому ⁺²
The jvm is for boomers, rewrite it in Rust.

Наступне

Автоматичне відтворення

Unity Catalog, Delta Sharing and Data Mesh on Databricks Lakehouse

Unity Catalog, Delta Sharing and Data Mesh on Databricks Lakehouse

Databricks Cost Management: Tips and Tools to Stay Under Budget

Databricks Cost Management: Tips and Tools to Stay Under Budget

Advancements in Open Source LLM Tooling, Including MLflow

Advancements in Open Source LLM Tooling, Including MLflow

Get 10 Mega Boxes OR 60 Starr Drops!!

Get 10 Mega Boxes OR 60 Starr Drops!!

У Москві - про Курську область. Що кажуть на вулицях

У Москві - про Курську область. Що кажуть на вулицях

Олександрія - Шахтар / УПЛ / 4 тур / Огляд матчу #Олександрія #Шахтар #уплтб

Олександрія - Шахтар / УПЛ / 4 тур / Огляд матчу #Олександрія #Шахтар #уплтб

How AI 'Understands' Images (CLIP) - Computerphile

How AI 'Understands' Images (CLIP) - Computerphile

JVM Anatomy 101

JVM Anatomy 101

[Webinar] LLMs for Evaluating LLMs

[Webinar] LLMs for Evaluating LLMs

Do NOT Learn Kubernetes Without Knowing These Concepts...

Do NOT Learn Kubernetes Without Knowing These Concepts...

Photon Technical Deep Dive: How to Think Vectorized

Photon Technical Deep Dive: How to Think Vectorized

What is generative AI and how does it work? - The Turing Lectures with Mirella Lapata

What is generative AI and how does it work? – The Turing Lectures with Mirella Lapata

S2024 #18 - Databricks Photon / Spark SQL (CMU Advanced Database Systems)

S2024 #18 - Databricks Photon / Spark SQL (CMU Advanced Database Systems)

What’s New in Databricks Workflows -- With Live Demos

What’s New in Databricks Workflows -- With Live Demos

Deep Dive into the New Features of Apache Spark™ 3.4

Deep Dive into the New Features of Apache Spark™ 3.4

Олександр Усик подарував Президенту пояс WBC

Олександр Усик подарував Президенту пояс WBC

Мы сделали гигантские сухарики! #большаяеда

Мы сделали гигантские сухарики! #большаяеда

КТО ЛЮБИТ ГРИБЫ?? #shorts

КТО ЛЮБИТ ГРИБЫ?? #shorts

От первого лица: Лагерь 😱 УГНАЛИ ЯХТУ 🤯 РАЗГРОМИЛИ ЛАГЕРЬ 🥹 ВЫГНАЛИ из СТРАНЫ 😭 ГЛАЗАМИ ШКОЛЬНИКА

От первого лица: Лагерь 😱 УГНАЛИ ЯХТУ 🤯 РАЗГРОМИЛИ ЛАГЕРЬ 🥹 ВЫГНАЛИ из СТРАНЫ 😭 ГЛАЗАМИ ШКОЛЬНИКА

这三姐弟太会藏了！#小丑#天使#路飞#家庭#搞笑

这三姐弟太会藏了！#小丑#天使#路飞#家庭#搞笑

Яшин - интервью после тюрьмы / вДудь

Яшин – интервью после тюрьмы / вДудь

Типичный день рождения... @Lorenzo.bagnati @Margofood @maxmakesvideo @samsebesushist

Типичный день рождения... @Lorenzo.bagnati @Margofood @maxmakesvideo @samsebesushist

Білоруська армія ВСТУПИЛА В БІЙ НА КУРЩИНІ! Лавров ВИЗНАВ ОКУПАЦІЮ ТЕРИТОРІЙ! | НОВИНИ СЬОГОДНІ

Білоруська армія ВСТУПИЛА В БІЙ НА КУРЩИНІ! Лавров ВИЗНАВ ОКУПАЦІЮ ТЕРИТОРІЙ! | НОВИНИ СЬОГОДНІ