DuckDB Tutorial For Beginners

SQL for Data Analytics - Learn SQL in 4 Hours

DuckDB: Ente gut, alles gut? // deutsch

Что-что Мурсдей говорит? 💭 #симбочка #симба #мурсдей

НА ЦЕ можна дивитись ВІЧНО! Такої ПАЛКОЇ зустрічі НІХТО НЕ ЧЕКАВ

Правильный подход к детям

DuckDB vs Pandas vs Polars For Python devs

MotherDuck

Переглядів 22 122

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 18 січ 2025

КОМЕНТАРІ • 21

@porlando12 Рік тому ⁺¹²
I appreciate the nods to the R community going on in here. Great video!
@matej6418 Рік тому ⁺²
all 5 of them.
@Shawn-cr8ep Рік тому ⁺³⁰
DuckDB is the most underused and underrated Python library. I started using it a couple weeks ago and I'm blown away by the efficiency increase over Pandas. Plus SQL is easier and it forces you to think I'm vectorized operations rather than being tempted by Pandas built in loop methods that are super slow
@Emotekofficial Рік тому ⁺³
How about DUCKDB and SQLALCHEMY? Do they shake hands? Can I do ORM like this?
@motherduckdb Рік тому ⁺²
yep, here’s MotherDuck instructions for it: motherduck.com/docs/integrations/sqlalchemy
(though also works with vanilla OSS duckdb, with driver linked from there)
@MrRubix94 Рік тому ⁺¹¹
Well I had just started to learn Polars, but your video and another one comparing DuckDB and Polars are making me doubt my choice… DuckDB seems MUCH faster. Besides, SQL knowledge can be leveraged for everything. Why one would use pandas or polars over DuckDB? Am I missing something?
@mehdio Рік тому ⁺³
I understand the doubt :) Apart from features there is the debate about DataFrame vs SQL approach.
While both Polars and DuckDB support DataFrame & SQL, DuckDB is primary designed to interface through SQL.
So if your a SQL lover, DuckDB is a no brainer. Polars has also a SQL interface but it's a pretty recent.
@MrRubix94 Рік тому ⁺³
@@mehdio Hum, I’m not really a SQL lover, I just want to use what works best as a data scientist. Manipulating a DataFrame is really convenient when exploring data. Maybe DuckDB + Polars? But I like simplicity, I would rather use one tool only. Choices, choices…
@incremental_failure Рік тому ⁺⁵
Same here. Just finished a rewrite from Pandas to Polars and it's already out of date. Although I'll likely be using Polars for the in-memory stuff and DuckDB for out-of-memory persistent data. The differences in speed are not gigantic if you consider the bigger picture and Polars development is very active, they are getting faster with every minor version.
@armeyavaidya3464 Рік тому
Polars is best for continuous operation on columns,
Also it doesn't support indices so can't do (I at some point and j at some point)
@incremental_failure Рік тому
@@armeyavaidya3464 Indexes can be simulated, using a column as an index.
@HitAndMissLab 5 місяців тому
what about DuckDB vs Dask?
@kpyoutuber4671 11 місяців тому ⁺¹
Thank you, for this valuable content!!.
Can you also explain the parquet dataset?
I used to create partitioned Parquet datasets by using Pandas and Polars.
But I want to know how to read data from such partitioned parquet datasets directly to Polars lazy frame format (not to pandas as data size is larger than memory) to do some analytics.
import polars as pl
import pyarrow.parquet as pq
# Read data written to parquet dataset
pq_df = pq.read_table(r"C:\Users\test_pl",
schema=pd_df_schema,
)
pl_df = pl.from_pandas(pq_df.to_pandas()).lazy()
Is there any better way to do this
@motherduckdb 10 місяців тому ⁺¹
As per polars documentation, docs.pola.rs/py-polars/html/reference/api/polars.scan_pyarrow_dataset.html#polars.scan_pyarrow_dataset
You can use scan_pyarrow_dataset() to read from partitioned datasets.
@user-fv1576 6 місяців тому
Is DuckDb a query language, a real db like sqlite or both?
@motherduckdb 6 місяців тому
It's a real DB like sqlite! But it innovates a lot around SQL, read more here : duckdb.org/2022/05/04/friendlier-sql.html
@denismetelin 3 місяці тому ⁺³
Too many words, little information.
@allthingsdata 9 місяців тому ⁺¹
I guess I'm stating the obvious but for anyone who doesn't use SQL for data operations DuckDB is second class. And I surely do not like to use SQL for transformations and such.
@tmb8807 3 місяці тому
I agree. DuckDB seems great for what it is but I find method chaining and the expression syntax of Polars much less cognitively demanding than SQL. But then I don't have a ton of experience with SQL so I'm not used to thinking in the way it requires.
@JOHNSMITH-ve3rq Рік тому ⁺²
SQLite is faster yo
@shogun8-9 Рік тому ⁺⁷
not for analysis. SQLite is OLTB, not OLAP.

Наступне

Автоматичне відтворення

DuckDB Tutorial For Beginners

DuckDB Tutorial For Beginners

SQL for Data Analytics - Learn SQL in 4 Hours

SQL for Data Analytics - Learn SQL in 4 Hours

DuckDB: Ente gut, alles gut? // deutsch

DuckDB: Ente gut, alles gut? // deutsch

Что-что Мурсдей говорит? 💭 #симбочка #симба #мурсдей

Что-что Мурсдей говорит? 💭 #симбочка #симба #мурсдей

НА ЦЕ можна дивитись ВІЧНО! Такої ПАЛКОЇ зустрічі НІХТО НЕ ЧЕКАВ

НА ЦЕ можна дивитись ВІЧНО! Такої ПАЛКОЇ зустрічі НІХТО НЕ ЧЕКАВ

Правильный подход к детям

Правильный подход к детям

Як азовська піхота прийняла групу розвідки вс рф? Зізнання окупантів і кадри з GoPro

Як азовська піхота прийняла групу розвідки вс рф? Зізнання окупантів і кадри з GoPro

Hannes Mühleisen - Data Wrangling [for Python or R] Like a Boss With DuckDB

Hannes Mühleisen - Data Wrangling [for Python or R] Like a Boss With DuckDB

How Fast can Python Parse 1 Billion Rows of Data?

How Fast can Python Parse 1 Billion Rows of Data?

DuckDB in Python - The Next Pandas Killer?

DuckDB in Python - The Next Pandas Killer?

Big Data is Dead | MotherDuck

Big Data is Dead | MotherDuck

pg_duckdb: Adding analytics to your application database

pg_duckdb: Adding analytics to your application database

Polars: The Next Big Python Data Science Library... written in RUST?

Polars: The Next Big Python Data Science Library... written in RUST?

Python Polars - Fastest Data Science Library!

Python Polars - Fastest Data Science Library!

15 futuristic databases you’ve never heard of

15 futuristic databases you’ve never heard of

"Serpents and Ducks: wrangling data with Python and DuckDB" - Simon Aubury (Pycon AU 2024)

"Serpents and Ducks: wrangling data with Python and DuckDB" - Simon Aubury (Pycon AU 2024)

When you lose control of your Waboba Moon Ball. @TheWabobaTeam #wabobapartner

When you lose control of your Waboba Moon Ball. @TheWabobaTeam #wabobapartner

TOY STORY IN BRAWL STARS!?

TOY STORY IN BRAWL STARS!?

😯 Подарила сыну БМВ, но не ожидала такой реакции на машину! | Новостничок

😯 Подарила сыну БМВ, но не ожидала такой реакции на машину! | Новостничок

Син ПОВАЛІЙ ПЛЮНУВ ЇЙ в ОБЛИЧЧЯ! Скандальне ПРИВІТАННЯ для ЗРАДНИЦІ! | OBOZ.LIFE

Син ПОВАЛІЙ ПЛЮНУВ ЇЙ в ОБЛИЧЧЯ! Скандальне ПРИВІТАННЯ для ЗРАДНИЦІ! | OBOZ.LIFE

REAL or FAKE? #beatbox #tiktok

REAL or FAKE? #beatbox #tiktok

до конца, там самая счастливая табалапка🐾🐾 #тикток #табалапка

до конца, там самая счастливая табалапка🐾🐾 #тикток #табалапка

«Просив пробачення, що не уберіг Діму» - історія братів Василя Репчука і Дмитра Мурару #shorts

«Просив пробачення, що не уберіг Діму» — історія братів Василя Репчука і Дмитра Мурару #shorts

Что-что Мурсдей говорит? 💭 #симбочка #симба #мурсдей

Что-что Мурсдей говорит? 💭 #симбочка #симба #мурсдей