Attrs, Pydantic, or Python Data Classes?

Поділитися
Вставка
  • Опубліковано 20 чер 2024
  • Data classes are a valuable tool in the Python programmer's toolkit. Despite their usefulness (I do like them a lot!), they do have limitations. In this video, I compare dataclasses with two alternative options, attrs and Pydantic, to help you decide which one to use in your code.
    GitHub repository here: github.com/ArjanCodes/2023-attrs
    ✍🏻 Take a quiz on this topic: www.learntail.com/quiz/qgtggs
    🚀 Next-Level Python Skillshare Class: skl.sh/3ZQkUEN
    💡 Get my FREE 7-step guide to help you consistently design great software: arjancodes.com/designguide.
    💻 ArjanCodes Blog: www.arjancodes.com/blog
    🎓 Courses:
    The Software Designer Mindset: www.arjancodes.com/mindset
    The Software Designer Mindset Team Packages: www.arjancodes.com/sas
    The Software Architect Mindset: Pre-register now! www.arjancodes.com/architect
    Next Level Python: Become a Python Expert: www.arjancodes.com/next-level...
    The 30-Day Design Challenge: www.arjancodes.com/30ddc
    🛒 GEAR & RECOMMENDED BOOKS: kit.co/arjancodes.
    👍 If you enjoyed this content, give this video a like. If you want to watch more of my upcoming videos, consider subscribing to my channel!
    💬 Discord: discord.arjan.codes
    🐦Twitter: / arjancodes
    🌍LinkedIn: / arjancodes
    🕵Facebook: / arjancodes
    📱Instagram: / arjancodes
    👀 Code reviewers:
    - Yoriz
    - Ryan Laursen
    - James Dooley
    - Dale Hagglund
    🎥 Video edited by Mark Bacskai: / bacskaimark
    💻 Code example by Henrique Branco: / henriqueajnb
    🔖 Chapters:
    0:00 Intro
    1:34 Example explanation
    4:20 Comparison
    7:07 Attribute validation
    11:26 Pros and cons dataclasses
    13:00 Pros and cons attrs
    14:14 Pros and cons Pydantic
    15:58 Final thoughts
    16:42 Outro
    #arjancodes #softwaredesign #python
    DISCLAIMER - The links in this description might be affiliate links. If you purchase a product or service through one of those links, I may receive a small commission. There is no additional charge to you. Thanks for supporting my channel so I can continue to provide you with free content each week!

КОМЕНТАРІ • 124

  • @ArjanCodes
    @ArjanCodes  8 місяців тому +1

    💡 Get my FREE 7-step guide to help you consistently design great software: arjancodes.com/designguide.

  • @BPopes
    @BPopes Рік тому +71

    Answering your question in the video about Pydantic validation (~ 14:53 ), pydantic's default mode is to validate on instantiation only. But you can set validate_assignment=True in the ConfigModel of your model to validate when you assign as well.

    • @drheaddamage
      @drheaddamage Рік тому +4

      I don't get the purpose of this. When would you want to validate on instantiation, but not on assignment? Sounds like a good way to complicate your debugging if you assume a variable is a particular type and then turns out to be overwritten with something completely different.

    • @matthiashomann6381
      @matthiashomann6381 Рік тому +13

      @@drheaddamage I guess the use case for only validating at instantiation is that after that you may trust it based on the code and not validating later may provide performance benefits. The instantiation could be with external data (user input, config file, API data) while the subsequent assignments are within the code itself (e.g. calculations)

    • @ripichipina
      @ripichipina Рік тому +6

      Also validation on every assignment is quite expensive operation for pydantic.
      There is a pattern when you create only unchangeable instances. Like using dataclasses ‘frozen’ init parameter.
      And if you want to change instance, you should create a new one.

    • @Nobody313
      @Nobody313 Рік тому +2

      @@drheaddamage Validation on assigment could be very expensive. If you use a functional approach, you will never modify data (objects) just create new ones, so makes sense.

    • @piotrgretkierewicz3906
      @piotrgretkierewicz3906 Рік тому +1

      @@drheaddamage It is not needed with frozen classes and those are extensively used in some projects. I think there was some argument that this is a bit more secure way of keeping data - creating new objects over editing existing one

  • @tjanos88
    @tjanos88 Рік тому

    Thank you, I learned a lot again. Have a nice weekend!

  • @hexomega9445
    @hexomega9445 Рік тому

    Thanks so much to bring this to the table!

  • @romanzaiev
    @romanzaiev Рік тому +3

    Thanks for the video! We use both dataclasses and pydantic in our projects for 2 different cases. Pydantic is more than a schema validation library today but we consider it should be used as a schema validation tool only. So, we use it extensively for the request/response schemas and some other things. On the other hand we have data models that we map on the corresponding DB tables and we use dataclasses for that. Basically, the standard flow is request -> pydantic validation -> some logic and dataclass models -> pydantic validation -> response. Sometimes it may be handy to map the schema to the model data automagically and for that case there is an orm_mode flag in pydantic. I just want to say they are not competitors at all.

  • @joaopedrorocha5693
    @joaopedrorocha5693 Рік тому +26

    Great video! The idea is great ... comparing tools and them giving your nuggets of wisdom on which one to choose for each job ... It's great to have a glimpse of your experience while choosing tools.
    Suggestion for a next one: comparison between web frameworks (such as fast api, django, flask)

  • @horoshuhin
    @horoshuhin Рік тому +14

    Arjan it would be awesome if you'd interview the creator of pydantic. It would be a fantastic episode of "Become A Better Software Developer" or something like that. I don't know if you've thought about this kind of format. I'd love to see pros dive into the nitty gritty and share ideas or the way to do stuff. Just a thought.

  • @rx97_mc
    @rx97_mc Рік тому +2

    You've been releasing banger videos one after another, i swear, I link or get linked your video hours after release!

  • @osjerick
    @osjerick Рік тому +1

    Awesome video! I use dataclasses and Pydantic a lot. I'll take a look at attrs now. Adding Mypy to my dev workflow has also helped me deliver better code.

  • @timorieber
    @timorieber Рік тому +13

    Hi Arjan, thanks for your effort you put in this videos, I got a lot of inspiration from them in the last years!
    I wanted to add, that Pydantic also has a dataclass decorator which is a drop-in replacement for the standard dataclasses, with all the validation features available as for the Pydantic BaseModel. Perhaps that would have been the more comparable choice.
    Keep up the great work!
    Timo

    • @evandrofilipe1526
      @evandrofilipe1526 Рік тому +1

      It has all the features of pydantic without inheritance?!

    • @timorieber
      @timorieber Рік тому

      @@evandrofilipe1526 it has all the validation features, but it is not a replacement for the Pydantic BaseModel. I recommend their docs for further details.

    • @matthiashomann6381
      @matthiashomann6381 Рік тому +4

      One thing to note is that the Pydantic dataclass has some restrictions compared to the Pydantic BaseModel features. Would be nice to compare the Python built-in dataclass, the Pydantic dataclass and the traditional Pydantic BaseModel.

  • @MedievalChips
    @MedievalChips Рік тому

    Thank you for the great video. Love the deep dive in new packages.

  • @guyonlead
    @guyonlead Рік тому +6

    I’m been rocking Pydantic for about a month now and I’m absolutely in love with it. It’s simpler to use than data classes IMO when using REST API. Granted I have a bunch of base models into another class then I use my own methods to manipulate the data but it works out nicely in the end.

    • @dpav02
      @dpav02 Рік тому

      Agreed. Pydantic also works extremely well when using an ORM like Sqlalchemy.

  • @dorb1337
    @dorb1337 Рік тому +2

    Great video ! a question that pops - Where are you getting your knowledge from ? reading the documentation is nice, and playing around with a module or a library is great - but aren't you afraid to miss out few of the features ? are there any good forums or knowledge resources you're using to stay up to date and understand all the possible features of a library or module you use ?

  • @grzegorzbajson3897
    @grzegorzbajson3897 Рік тому +4

    It's worth to mention the performance. Dataclasses are way faster than BaseModel (I learnt this the hard way :-/). There are some improvements expected with Pydantic 2.0, but for now:
    In [1]: from dataclasses import dataclass
    In [2]: @dataclass
    ...: class A:
    ...: a: int = 1
    ...: b: str = "one"
    ...:
    In [3]: %timeit a = A()
    80.5 ns ± 0.212 ns per loop (mean ± std. dev. of 7 runs, 10,000,000 loops each)
    In [5]: from pydantic import BaseModel
    In [6]: class B(BaseModel):
    ...: a: int = 1
    ...: b: str = "one"
    ...:
    In [7]: %timeit b = B()
    1.03 µs ± 3.7 ns per loop (mean ± std. dev. of 7 runs, 1,000,000 loops each)

  • @Patrick-hl1wp
    @Patrick-hl1wp Рік тому

    Super helpful information, thanks Arjan

  •  Рік тому

    As always, awesome video!
    Thank you for the great content. I'm improving a lot my developer skills with your lessons.

  • @TomTrval
    @TomTrval Рік тому

    Great overview thank you :) Personal preference: dataclass with `slots=True`, I hope we get build in object pooling like `pool=300` one day for for dataclasses

  • @domenechj
    @domenechj Рік тому +3

    Great video as always! By the way, what library do you prefer for performing validation on dataframes?

    • @babc142
      @babc142 Рік тому

      I've used pandera pretty heavily in production and it's very capable and the developer is super helpful. But I haven't explored pydantic extensively, and I feel there might be some advantages to using it in exchange for writing quite a lot more code

  • @pamdemonia
    @pamdemonia Рік тому

    Just a late thank you for your dataclasses video which taught me about the __post_init__() method which really came in handy recently.

  • @JohnFKenobi
    @JohnFKenobi Рік тому

    great stuff

  • @GigaMarou
    @GigaMarou Рік тому +1

    Hey Arjan, great video. How would you use dataclasses, attrs or pydantic to validate Tabular Machine Learning data. Greetings from Germany :)

  • @CarlosMorenoV
    @CarlosMorenoV Рік тому

    Nice video, I realised about attrs because of Fluent Python 2nd edition by Luciano Ramalho. There is an entire section in that book about Data classes. If you ask me, I prefer to use attrs or Pydantic. You can use dataclass to prototype an initial version of some tiny app. However, once you need to build something more professional, you definitely need to go beyond. And Attrs or Pydantic is the correct choice.

  • @viajeradelaluz
    @viajeradelaluz Рік тому

    Great Video! I actually started using dataclasses when I saw your video about it few months ago. I 'd like you to make a short video with pros and cons about your recommendation to use int() instead of float() type for prices fields. You left me thinking about that idea. Many thanks! 😋

  • @KoyaMel
    @KoyaMel Рік тому +5

    literally 5 minutes ago, I was in the situation to decide which to use. (you read my mind)

    • @ArjanCodes
      @ArjanCodes  Рік тому +1

      Haha, so I was just in time :).

    • @daverichardhadley
      @daverichardhadley Рік тому +2

      What did you choose?

    • @TheSzybas
      @TheSzybas Рік тому +3

      @@ArjanCodes because the wizard is not late nor early. Just in time.
      Like a compiler...

    • @KoyaMel
      @KoyaMel Рік тому +1

      dataclass for the win

  • @simmonslucas
    @simmonslucas Рік тому +2

    You already convinced me of dataclasses. They spread across our code base like a slime mold.

  • @matercomus
    @matercomus Рік тому

    Great video!

  • @danimperiale8545
    @danimperiale8545 Рік тому

    Dude, great video. Just really really great.

  • @EricChenx
    @EricChenx Рік тому

    Thanks!

  • @GeorgelPreput
    @GeorgelPreput Рік тому +1

    Would be great to have a follow-up which compares validation in pydantic, param and traitlets

  • @mithunmanoharmithun
    @mithunmanoharmithun Рік тому

    Awesome content as always! Could you please make video on how to persist these objects into a datbase and best practices to do so ?

    • @ArjanCodes
      @ArjanCodes  Рік тому

      Thanks! Your suggestion is noted :)

  • @helish_88
    @helish_88 Рік тому

    My english is not so good, usually I don't understand all but ur speech is so good, I understood almost everything, btw its very helpful video

  • @PavithraSuLa
    @PavithraSuLa Рік тому

    Awesome video! Thank you for the great content.
    Could you please make video about python metaprogramming and metaclass with real world example.

    • @ArjanCodes
      @ArjanCodes  Рік тому

      Thanks for the suggestion it's noted!

  • @whkoh7619
    @whkoh7619 Рік тому

    thanks for the compare! the real content is here as always ;)

  • @shS83
    @shS83 3 місяці тому

    Great video. Good explanation for using int in price. This, like your other videos, come in handy in lots of different scenarios. Thank you for your contribution.

    • @ArjanCodes
      @ArjanCodes  3 місяці тому

      I'm happy to hear you're enjoying the content!

  • @hargovind2776
    @hargovind2776 Рік тому

    I hit like even before starting your video. Already know the content's gonna be amazing, keep doing the awesome work!!!!

  • @realplod
    @realplod Рік тому

    Thanks

  • @evandrofilipe1526
    @evandrofilipe1526 Рік тому

    I would like to know what you think about typeguard module since I never seen you use it.

  • @mariamozgunova9584
    @mariamozgunova9584 10 місяців тому

    In Python, one can also use `Decimal` to represent monetary amounts

  • @NopeNotMeNoSiree
    @NopeNotMeNoSiree 7 місяців тому +1

    I had chosen pydantic for a few projects, but now I'm ripping it all out because of the incompatible API changes brought by 2.0. It appears the authors of pydantic are _very_ opinionated about how method names should be formatted, so they cavalierly replaced parse_raw by model_validate_json. Not only is the function name much longer, but it apparently does exactly the same thing. If I keep pydantic in the project, then it's like a ticking time bomb for future developers.

  • @cppsage
    @cppsage 2 місяці тому

    Hehe, this is REALLY a NIT. I was told that zero is neither negative or positive, that function in the attr example is non-negative, not positive. (Of course with IEEE-754 we can get zero BOTH negative or positive, but that is only useful for a zero that isn’t really zero, just too small)

  • @MrVernuk
    @MrVernuk 3 місяці тому

    Thank you fot this video! It's really helpfull ☺

    • @ArjanCodes
      @ArjanCodes  3 місяці тому

      I'm glad to hear the video was helpful!

  • @kosmonautofficial296
    @kosmonautofficial296 Рік тому

    When you printed that banana it totally made sense to me

  • @potens1
    @potens1 Рік тому

    About the representation in pydantic, instead of printing the banana, you can do print(repr(banana)), you will get Banana(name='banana', category='fruit'....).

  • @lawrencedoliveiro9104
    @lawrencedoliveiro9104 Рік тому

    4:50 Sounds like defining key fields in a database record.

  • @OzFush
    @OzFush Рік тому

    What brand is your hoodie? I want one but can’t find the logo in google image search 😂

  • @cppsage
    @cppsage 2 місяці тому

    And another nit in the attrs example, and this may very well be my misunderstanding of str, but instead of lower should rather we use casefold for comparisons? As I understand that is dedicated for that purpose.

  • @SimeonNedkov
    @SimeonNedkov Рік тому +1

    "Don't be an attr (etter)..." is a great way to start the week! 😁

  • @giovanniosorio4759
    @giovanniosorio4759 Рік тому

    I haven’t been able to understand Enum classes and what exactly are they useful? I've been looking for videos that explain it well but there aren't any. Could you please make one?

  • @ArtyomShakurov
    @ArtyomShakurov Рік тому

    Thanks! Can you please compare pydantic and marshmallow?

  • @proofit404
    @proofit404 Рік тому

    Лучший!

  • @SuperDoc3000
    @SuperDoc3000 Рік тому

    Thanks @ArjanCodes, quality content every time!

  • @lawrencedoliveiro9104
    @lawrencedoliveiro9104 Рік тому +1

    4:07 Why not use the Decimal type?

  • @WilliS1651686868
    @WilliS1651686868 Рік тому

    Are there any difference between those three options in terms of performance?

  • @cppsage
    @cppsage 2 місяці тому

    Can’t read all comments, but I think I saw a bug in the dataclass example. I think if your code would ever run through a midnight you’d need that default for the order date to also be dynamic with default_factory, otherwise it’ll be the date of the creation of the *class* and not the instance. So if I started my program December 2023, all my orders will come up with that date by default.

  • @WilliS1651686868
    @WilliS1651686868 Рік тому

    I really like the "frozen" option of dataclass that allow to have some sort of immutability. Does attrs or pedantic have something similar?

    • @johantoday
      @johantoday 9 місяців тому

      Pydantic supports frozen fields or frozen instances

  • @jkrigelman
    @jkrigelman Рік тому

    Now I need to learn what atter is in Dutch.

  • @Vijay-Yarramsetty
    @Vijay-Yarramsetty Рік тому

    I've been having this doubt since a long time. which one among them is better?

  • @CrazyDalekCaan
    @CrazyDalekCaan Рік тому

    You missed something important: pydantic has its own dataclass decorator which can be used much like a BaseModel, but is fully API compatible with a built-in dataclass

  • @PraecorLoth970
    @PraecorLoth970 Рік тому +1

    Nice video, cheers from a country where taxes aren't limited to 100% 😭

  • @lawrencedoliveiro9104
    @lawrencedoliveiro9104 Рік тому +1

    7:38 As soon as you said it uses subclassing, I immediately thought “there must be a metaclass involved”.
    Had a look at the source code, and yes, that is how it works.

  • @tanjt107
    @tanjt107 Рік тому

    I am not sure if why question is clear and related to this video. Sometimes in my classes there are a lot of lines in the init/post_init method. The reason of this is I don’t want the same thing being calculated many times and saving the result in memory can speed up the process. Maybe I should just use function instead?

    • @ArjanCodes
      @ArjanCodes  Рік тому +1

      If it's a complicated computation, I would suggest to move it out of the init method and into a separate function. You could also look into functools cached_property decorator. This computes a value once and then memoizes it.

    • @tanjt107
      @tanjt107 Рік тому

      @@ArjanCodes Thanks! I never know this decorator. It helps!

  • @Vodkarh
    @Vodkarh Рік тому +25

    ok now I want to know what attrs means in dutch

    • @RutNij
      @RutNij Рік тому +3

      The dutch word is 'etter', which is pronounced similar to attr in english. Etter translates more or less to 'jerk' in english 😂

    • @HanWechgelaer
      @HanWechgelaer Рік тому

      Actually attr sounds like the Dutch word "etter" which means a brat who is acting up.🙂

    • @samsung40_media87
      @samsung40_media87 Рік тому

      in afrikaans, it is pretty bad

    • @evandrofilipe1526
      @evandrofilipe1526 Рік тому

      ​@@samsung40_media87 what does it mean?

    • @JoergAsmussen
      @JoergAsmussen Рік тому

      he is een etter -> he's a jerk (according to Google translate)

  • @AngDmz
    @AngDmz Рік тому

    With pydantic you have super powerful objects factory to perform really good OOP without much effort

  • @LeftLib
    @LeftLib 5 місяців тому

    as a newcomer I was hoping to find The Answer, what I should be using beginning from now. After watching this I am not sure, it seems odd to have a choice of 3. What would it be like 5 years from now?

  • @ErikS-
    @ErikS- Рік тому

    I honestly used dataclasses for some time, but I stopped using them.
    I prefer having all classes written according to the standard rules for classes. So no instance variables in the part where you would standard see the class variables defined.
    Instead of mixing the dataclasses into regular classes, It would have been better if a new type of datastructure would be implemented in Python, and which is available by DEFAULT. Making the distinction with a decorator just doesn't do it for me... I like to say that pydantic is by the way a 'cleaner' solution imo...

  • @esmaelmohamed8209
    @esmaelmohamed8209 6 місяців тому

    Arjan how I communicate you officially for your courses?

    • @ArjanCodes
      @ArjanCodes  6 місяців тому

      To talk about my courses you can send an email to: support@arjancodes.com

    • @esmaelmohamed8209
      @esmaelmohamed8209 6 місяців тому

      @@ArjanCodes Okay Arjan I will. ❤

    • @esmaelmohamed8209
      @esmaelmohamed8209 6 місяців тому

      @@ArjanCodes Now I did that

  • @ac130kz
    @ac130kz Рік тому

    attrs is excellent, but it's kinda not compatible with everything else, so it's an all in approach

  • @julienyt1600
    @julienyt1600 Рік тому

    Pydantic's Soo good to ensure that your python objects can easily become dicts, serialize in JSON, parse_raw query results and validate at the same time.
    I use pydantic all the time so other developpers only need to learn them and not have to deal with the differences coming in dataclasses fields

  • @_Amilio_
    @_Amilio_ Рік тому +1

    I forgot that "attrs" in dutch was niet zo leuk 😂

  • @Blingblingblingling
    @Blingblingblingling Рік тому

    Curious. Why not, like price being int, make weight also int and represent the smallest unit we care about (like gram)?

  • @drheaddamage
    @drheaddamage Рік тому +2

    I like attrs a lot, but it is a bit funny to me that we say: "Python is great because it is dynamically typed", and then the first thing we do is strictly typing using one of these packages...

  • @nixonkutz3018
    @nixonkutz3018 Рік тому

    I always get tickled by the volume and popularity of code that attempts to bolt type checking onto a duck-typed language

  • @dalaymann
    @dalaymann 6 місяців тому

    한국인 중에 Pydantic 을 가르쳐 줄 수 있는 사람은 대개 미국에 살고 있으니... 감사합니다.

  • @HanWechgelaer
    @HanWechgelaer Рік тому

    Nice video, now don't be a attr and like the video 😂😂😂