ElixirDaze 2016 - Processing 2.7 million images with Elixir (vs Ruby) by David Padilla

Поділитися
Вставка
  • Опубліковано 17 січ 2025

КОМЕНТАРІ • 71

  • @amerispunk
    @amerispunk 7 років тому +10

    Great video. I like how you walked us through every step as you're learning all this for yourself. Very entertaining, as well! The color of your slides in a few cases could have been better, though, as was previously mentioned.

  • @johnjames282
    @johnjames282 8 років тому +51

    good talk, please consider not using white on yellow slides. Most of those slides were very unreadable.

  • @supernewuser
    @supernewuser 8 років тому +59

    Whoever told you bright yellow slides with white text was acceptable is acting against your best interests.

    • @labcrowd
      @labcrowd 8 років тому +4

      Agreed. It was like putting hot lava stones on my eyes..

  • @nonefvnfvnjnjnjevjenjvonej3384
    @nonefvnfvnjnjnjevjenjvonej3384 4 роки тому +8

    Another option would be to resize the images using css and get a background job to resize just those images so next time they will be the right size. That way you do it in chunks.

  • @sureseam
    @sureseam 8 років тому +3

    Communicated very well - thank you!

  • @balavarikuti
    @balavarikuti 8 років тому +3

    great talk. i have watched all the way. i liked the last piece especially erlang server handling other servers.

  • @AdrianoMitre
    @AdrianoMitre 8 років тому +12

    Wouldn't it be easier to simply have resorted to GNU parallel and ImageMagick?

    • @batlin
      @batlin 7 років тому

      Probably, but you'd have to wrap each ImageMagick invocation in calls to something like s3cmd get/put, and like he said in the talk, the overhead of starting and setting up s3cmd processes each time makes it slow.

    • @eafadeev
      @eafadeev 7 років тому

      he could have put the S3 put calls from threads, that would dramatically parallelize uploading of the resized images. Resizing can be done by multiprocessing to resolve the GIL issue. I.e. ruby (or python or any other language x) would do the job just as fast.

    •  4 роки тому +2

      He said he wanted to try Elixir.

  • @ThePhan7em
    @ThePhan7em 4 роки тому +6

    A better solution would have been to dynamically resize the image as they are requested. That way you only resize the images that users are actually requesting. Once the image has been resized, store that on S3 and serve that going forward. Still an informative presentation, thank you.

    • @Qrzychu92
      @Qrzychu92 3 роки тому

      there is one problem with that - it takes 2s to do that per image, right? So, even if you did that concurently, images still would all appear 2s after the page loaded

    • @ThePhan7em
      @ThePhan7em 3 роки тому +2

      @@Qrzychu92 Even if that were the case, that would happen once. To avoid this, you could show a thumbnail in its place until the resized image is ready. That is if you have a thumbnail(which you should) to show to begin with, if not, we are back to your point which is valid.

    • @Qrzychu92
      @Qrzychu92 3 роки тому

      @@ThePhan7em yes, once for every image :) if you were unlucky, you could be the user that triggers it on every image you see.
      It all depends whether this is acceptable in the app context. In this case, where users rely havily on images in decision making, I think parsing all images upfront wa s good decision.

    • @ThePhan7em
      @ThePhan7em 3 роки тому +1

      @@Qrzychu92 I dont agree tbh. You may end up paying for storing images that may only be seen once, then storing it at cost. Have a look at imgix.com. You can roll your own AWS Lambda or the likes relative ease.

    • @Qrzychu92
      @Qrzychu92 3 роки тому

      @@ThePhan7em then why even bother with scaling the images on upload when you can scale them on demand?
      It's possible that storage is cheaper than running AWS lambda. Like I said, it all depends. Imagine instagram scaling images when poeple first see them. Would everyone notice? Probably not.

  • @jpmohan96
    @jpmohan96 7 років тому +8

    Why wasn't CSS used for resizing images?

    • @goodcyrus
      @goodcyrus 6 років тому

      That's what I thought from the beginning. I thought someone there would stop his talk or in comments. Only 2 people mentioned this

    • @Itachi.Uchiha.Offical
      @Itachi.Uchiha.Offical 5 років тому +7

      1) Decrease loading time (browser)
      2) Save storage (some kB make a difference on 2.7m images)

    • @smonkey001
      @smonkey001 5 років тому +5

      CSS is the correct 1 day solution.

  • @morkhoudia9
    @morkhoudia9 7 місяців тому

    Great talk

  • @ferdinandenario9137
    @ferdinandenario9137 8 років тому +3

    Nice talk on Elixir! thanks

  • @elie2222
    @elie2222 8 років тому +1

    Why were you limited to 20 cores? Couldn't you have run the job on multiple machines? 4 * 20 for example (splitting up the images into 4 groups) and had it all done in a day?
    (I didn't watch the whole thing. Sorry if I missed the answer)

    • @vasumahesh3800
      @vasumahesh3800 8 років тому +6

      You are right, Elixir can be deployed to different machines and be interconnected as well. But I guess as per the video the guy didn't have much time. But still a wonderful video.

  • @nicanorperera
    @nicanorperera 8 років тому

    Gracias! Muy buena presentación.

  • @clintonmoshe9544
    @clintonmoshe9544 Місяць тому

    2024, great talk.

  • @InsanityNerve
    @InsanityNerve 8 років тому +1

    Great talk!

  • @christianrojasgar
    @christianrojasgar 8 років тому +1

    Awesome talk!, thanks

  • @BorisBarroso
    @BorisBarroso 8 років тому +1

    Great, Buena presentación, sigo aprendiendo Elixir y tratando de cambiar de Ruby a Elixir :)

  • @lostcodex
    @lostcodex 8 років тому +3

    Did you open source your work on github?

    • @btc-btc-net
      @btc-btc-net 7 років тому +8

      github.com/dabit/elixir_images

  • @Thetastygamers
    @Thetastygamers 8 років тому

    Perfect speech. Thank you

  • @emanuelquimper697
    @emanuelquimper697 8 років тому +5

    I want this color syntax :(

  • @ArquitectoR
    @ArquitectoR Рік тому

    Mogrify is just a wrapper of command line call of ImageMagic, so the same issues here like with the first attempt to upload to S3.
    Bindings to magick++ library or Vix should be used instead.

  • @iansoulful
    @iansoulful 8 років тому +10

    I wonder if using AWS lamda would have saved, money developer time, download and upload time of images. it would probably be cheaper and much faster. Nevertheless great talk and information on Elixir :)

    • @marksargento840
      @marksargento840 8 років тому +2

      I guess it's not possible to use lambda on Digital Ocean(they're not on EC2 from what I heard on the video) but I agree with you, they would have saved time and money. Great video though!

    • @omarlujan1809
      @omarlujan1809 8 років тому

      probs

    • @RistoNovik
      @RistoNovik 8 років тому +2

      Well the images are stored already in AWS S3, so there is no problem to spin of lambda to do the batch work. So there would be no need to extra download and upload time waste needed.

  • @정동호-i1r
    @정동호-i1r 5 років тому +1

    It's very helpful for me who new to the elixir.

  • @christopheverbinnen3626
    @christopheverbinnen3626 8 років тому +5

    Ever heard of Sidekiq?

    • @_tachyons
      @_tachyons 7 років тому

      Christophe Verbinnen AFAIK sidekiq still uses single cpu core unless you purchased their enterprise version. Or am I missing something

    • @christopheverbinnen3626
      @christopheverbinnen3626 7 років тому +1

      Unless you use jruby or you start on sidekiq process per core. Pretty straight forward to setup.

  • @klancaster1957
    @klancaster1957 8 років тому +1

    Nice talk!

  • @giftoflife6921
    @giftoflife6921 2 роки тому

    Before you upload your videos it might help if you view it from beginning to end to see if it's good.
    My point is this. You displayed some white-colored text against a bright yellow background, which makes for difficult reading.

  • @btc-btc-net
    @btc-btc-net 7 років тому +1

    Good talk.

  • @michaelkohlhaas4427
    @michaelkohlhaas4427 4 роки тому

    *Why doing it the easy way if we can make it complicated?*

  • @hvgpilaatkaiken2300
    @hvgpilaatkaiken2300 5 років тому +3

    Fun talk, but the story about time pressure is obviously BS. If you have to solve a trivially data parallel task asap, you don't spend 12 days learning a new language because it's designed for massive parallelism and run it on a single node for 4 days.
    You split the data to N batches, where N is the number of cores, and run N instances of whatever global interpreter locked language you're comfortable with and be done 12 days faster. Or split the data to 100 patches, make a shell script to kick up 100 instances in EC2, run your program on each and be done before lunch...

  • @RafidelisMaker
    @RafidelisMaker 8 років тому +2

    I'm sintax guy too! Nice talk, thanks for sharing.

  • @romenigld
    @romenigld 7 років тому

    Nixe talk, congrats!

  • @subratamajumdar13
    @subratamajumdar13 10 місяців тому

    It is HTTPoison not HTTPotion!

  • @pookachu64
    @pookachu64 8 років тому

    Flaw in the logic of using digital ocean - you don't pay for transfer costs between s3 and ec2. Do the whole thing in aws!

  • @blu3h4t
    @blu3h4t Рік тому

    maybe if he typed how to upload to s3 with erlang instead of elixir he would find something, if you are already on the otp maybe was worth a try

    • @blu3h4t
      @blu3h4t Рік тому

      ah ok i c i was too fast :D

  • @ultort
    @ultort 7 місяців тому

    Really cool, I love Elixir, but using it for a one shot to resize images it really overkill, specially if as told this was something needed ASAP. 3M records is manageable, extract it in a file, split it in N*k files, write a batch script that download the images, process them and upload them, send it to N machines, start k processes that run concurrently on each machine and wait. If you have 5 machines with 20 cores, even at 1.6s/image (probably can be improved with batch processing), it will only take 13 hours, the next day your images would be ready. Enjoy your free time to learn more about Elixir.

  • @ThrashAbaddon
    @ThrashAbaddon 8 років тому

    Good and fun talk. :)

  • @haystackdmilith
    @haystackdmilith 8 років тому +2

    Like for that Ruby-Threading troll

  • @juliolinarezescobar
    @juliolinarezescobar Рік тому

    Now this 15:20 it is so different

  • @leo11877
    @leo11877 7 років тому +2

    or maybe use responsive CSS images?

  • @lr5867
    @lr5867 7 років тому +1

    Arguably he might've done the same batch execution w/ C++, using a Ruby supervisor task to spawn as many small processes to accomplish the same task. Speed differential .... maybe gain another 33% - 50% in performance??

    • @lr5867
      @lr5867 7 років тому

      As for resizing images ... well, they'll load faster, but he could've just as well changed the CSS to scale the images instead. I think I could've figured a way to do it in a half hour but tell the boss it took 2 days ... :)

    • @batlin
      @batlin 7 років тому

      Yeah I was wondering why the images needed to be resized at all... maybe the quality of dynamically scaling them down in the browser is slightly less than you can get with ImageMagick.

    • @techorb5799
      @techorb5799 7 років тому

      This is exactly how I handled it with Ruby 1.9. In my case I was creating millions of animated 2 frame banner ads. I used mini-magik to create the frame, then a cmd shell all out to `convert' utility to animate the frames together. Each batch gets fed to a process which is Fork 'd and processes the batch sequentially. Get a big AWS instance with lots of cores handle all the processes and start spawning forks until the machine falls over, then dial back a couple. could generate millions of images in a matter of hours.

  • @NikolajLepka
    @NikolajLepka 8 років тому

    I don't get why he explains elixir features at an elixir conference... you'd assume people at an elixir conference already know how to use elixir

    • @_tachyons
      @_tachyons 7 років тому +2

      Niko L Not really, In every conference there will people from different levels of expertise. Especially in new programming language like elixer,there will many people who came to explore the basics of Elixir

  • @torvic99
    @torvic99 6 років тому

    Latin America #1

  • @edugonch2
    @edugonch2 7 років тому

    Next time you just use Cloudinary XD, nice talk by the way.

    • @pablocacaster
      @pablocacaster 7 років тому

      So you wanna pay 140 a month for a slow service?