Top Object Detection Models in 2023 | Model Selection Guide sponsored by Intel

Поділитися
Вставка
  • Опубліковано 1 чер 2024
  • Description:
    Discover the top object detection models in 2023 in this comprehensive video. We compare models like YOLOv8, YOLOv7, RTMDet, DETA, DINO, and GroundingDINO based on metrics like Mean Average Precision, community support, packaging, and licensing for you to decide which is best for your production AI applications. The video also details the challenges in comparing model speed and highlights important nuances within the realm of object detection, like choosing the right model for the right hardware and use case. It's an essential watch for anyone interested in computer vision and model selection. This research was sponsored by Intel.
    #ObjectDetection #ComputerVision
    Chapters:
    - 00:00 Introduction
    - 00:35 Object Detection
    - 01:42 Mean Average Precision
    - 02:28 Speed
    - 03:40 Paper, Packaging, and License
    - 04:35 YOLOv8
    - 05:21 YOLOv7
    - 06:06 YOLOv6-v3
    - 07:01 RTMDet
    - 07:46 RT-DETR
    - 08:50 DETA
    - 10:02 GroundingDINO
    - 10:37 Model Community Comparison
    - 11:46 Conclusion
    Resources:
    - 🌏 Roboflow: roboflow.com
    - 📚 Roboflow Notebooks Repository: github.com/roboflow/notebooks
    - 🌌 Roboflow Universe: universe.roboflow.com
    - 📈GPU vs Intel HPU (new hardware options for AI): blog.roboflow.com/gpu-vs-hpu/
    Stay updated with the projects I'm working on at github.com/roboflow and github.com/SkalskiP! ⭐
    Remember to like, comment, and subscribe for more content on AI, computer vision, and the latest technological breakthroughs! 🚀
  • Наука та технологія

КОМЕНТАРІ • 73

  • @hawkingradiation3774
    @hawkingradiation3774 8 місяців тому +16

    would also like to see a video for comparison for segmentation tasks as well.

    • @Roboflow
      @Roboflow  8 місяців тому +6

      Awesome idea! I'm curious if there are more people who would like to see that. It's a lot of work to create video like that.

    • @sozno4222
      @sozno4222 8 місяців тому +2

      @@RoboflowI would also be interested in a video like that

    • @visuality2541
      @visuality2541 7 місяців тому

      Indeed

  • @ferneutron
    @ferneutron 4 місяці тому

    Super great job, thanks!

  • @AlainPilon
    @AlainPilon 8 місяців тому +5

    Thanks for the work! I am wondering if the COCO metric is actually useful. It is an interesting comparison point but I am not sure how it translates in practice given that most people only train detector for a very few set of classes. Community support, documentation and how the model integrates in the current ecosystem is much more impactful and I am glad you added these to your chart.

    • @Roboflow
      @Roboflow  8 місяців тому

      I'm glad you agree with my methodology. I think the license is also very important. After all you must be able to use the model in your project. As for mAP, 100% agree. I'd love to have other metrics that I could use. We developed RF100 metric - paperswithcode.com/dataset/rf100, but I didn't have enough datapoints to compare all the models.

  • @rperezalejo
    @rperezalejo 8 місяців тому +1

    Great video, right now I am working on a real time sport object detection and this video comes like a charm for me. I can test other posibilities I did not had in mind.

    • @Roboflow
      @Roboflow  8 місяців тому

      Awesome we came at the right one! I’d love to here about your results once you finish tests.

    • @rperezalejo
      @rperezalejo 8 місяців тому

      ​@@Roboflow I have to retrain on 4k images first and then see the performance of the models in real time, I am not sure If i am going to be able to test the all, but the idea of having the golden cub gives a sign on where to start. It is a great guide anyways

  • @mpty2022
    @mpty2022 8 місяців тому +2

    thanks for sharing your research

    • @Roboflow
      @Roboflow  8 місяців тому +1

      My pleasure!

  • @juanolano2818
    @juanolano2818 8 місяців тому +3

    Will go with Yolo8 for my current microbes identification project :) Thank you Piotr!

    • @Roboflow
      @Roboflow  8 місяців тому

      If it is open source or academic good choice! 👍🏻

    • @saurabhgupta7148
      @saurabhgupta7148 3 місяці тому

      I am also working with images of micro organisms. Did you get good results with YoloV8?

  • @romroc627
    @romroc627 8 місяців тому +1

    Excellent video, really useful.

    • @Roboflow
      @Roboflow  8 місяців тому

      I’m so happy to see such a positive feedback!

  • @satellite-image-deep-learning
    @satellite-image-deep-learning 7 місяців тому +1

    Fantastic summary, thank you for the effort that went into this! MMdetection has come up before and I would love an intro video on it 😊

    • @Roboflow
      @Roboflow  7 місяців тому

      We already have MMDetection. :) Take a look on our channel

    • @segheysens
      @segheysens 7 місяців тому

      They already created a video here! 🙌 ua-cam.com/video/5kgWyo6Sg4E/v-deo.html

  • @johannesmokami5760
    @johannesmokami5760 8 місяців тому +1

    Thanks for the info
    I will definitely try them as well.

    • @Roboflow
      @Roboflow  8 місяців тому +1

      Awesome! Which detector are you going to try?

    • @johannesmokami5760
      @johannesmokami5760 8 місяців тому

      I'm currently trying out YOLOv8 but I'd like to try YOLOv7 and GroundingDINO @@Roboflow

  • @user-fj1zz9mi5r
    @user-fj1zz9mi5r 7 місяців тому

    I was looking for performance over the time inference for edge devices. I was trying to use Yolov8 for edge deployment into STM32 but at the end, i realized this model was too big for this card. What do you think is a good model for a good ratio between inference time / model size? Thanks for your response

  • @Seethis-HD
    @Seethis-HD 7 місяців тому +2

    Excellent video. Thanks for the efforts. I was wondering why you didn't consider YOLO-NAS in the list?

    • @Roboflow
      @Roboflow  7 місяців тому

      I considered it but ultimately decided not to include it. I’m pretty confident those models are better choices.

    • @PhilippBlum
      @PhilippBlum 3 місяці тому

      @ow What was the reason not to include it? Accuracy?

  • @user-rx4wd3xn2c
    @user-rx4wd3xn2c 5 місяців тому

    can we use yolov8 pretrained weights for commercial use?

  • @seanolivieri4829
    @seanolivieri4829 6 місяців тому +1

    Do you have a video on licenses? I dont understand any of those and which one should I use if I want to be able to sell my program or call it my own

  • @EliSpizzichino
    @EliSpizzichino 8 місяців тому +3

    We need a platform to fully compare them on real datasets on real training on the same device.
    Is also important to keep track if a version change produce worsening quality. I've noticed for example that between one minor version and another of the ultralitics codebase the quality of the final trained model worsened by a lot.

    • @Roboflow
      @Roboflow  8 місяців тому

      That's super interesting! Could you share more insights on what the versions were? I'd love to do more investigation.
      As for the "platform to fully compare them on real datasets", have you seen RF100? paperswithcode.com/dataset/rf100

    • @EliSpizzichino
      @EliSpizzichino 8 місяців тому

      @@Roboflow The last tested good version was 8.0.103. Unfortunately I had no time since then to do further investigation my self but I remember trying a couple of training out with some versions after that and got worse results
      I haven't tested the RF100 yet, it's a good effort and I like what you do as company (I never left such a good comment to anyone in my life :)

  • @mr_tpk
    @mr_tpk 5 місяців тому

    Thank you ❤

  • @rafael.gildin
    @rafael.gildin 8 місяців тому +1

    Great Video 🎉

    • @Roboflow
      @Roboflow  8 місяців тому

      My pleasure! 🔥

  • @tryingtobeproductive
    @tryingtobeproductive 8 місяців тому +1

    This video is extremely useful 10/10

    • @Roboflow
      @Roboflow  8 місяців тому

      Thank you! Awesome to hear such a positive feedback 🔥

  • @EastAfrica_Vehicle_Importers
    @EastAfrica_Vehicle_Importers 8 місяців тому +1

    I love your videos.
    Am working on object tracking and counting, which the best tracking algorithms can you advice me to employ

    • @Roboflow
      @Roboflow  8 місяців тому

      Do you need to run in real time?

    • @EastAfrica_Vehicle_Importers
      @EastAfrica_Vehicle_Importers 7 місяців тому

      Yes bro ,am doing my thesis . i collected vehicle dataset ,now I need to detect and track in order to count@@Roboflow

  • @pleison111
    @pleison111 3 місяці тому +1

    I have a question, I am working on a OCR project, I am using a fastrcnn with resnet50 as object detector, and then I need something like a conv + GRU or ViT to decode the text, do you have some suggestions regarding OCR?

    • @Roboflow
      @Roboflow  3 місяці тому

      First of all why Fast RCNN? As for OCR did you try Tesseract?

  • @zskater1234
    @zskater1234 7 місяців тому +2

    I’m currently porting GroundingDINO to the transformers library so buckle up

    • @Roboflow
      @Roboflow  7 місяців тому

      Uuu! Awesome! I can’t wait to see that happening. Being able to setup GroundingDINO with single pip install.

    • @zskater1234
      @zskater1234 7 місяців тому

      @@Roboflow the model is already ported, now I’m tackling the tests and documentation if everything goes well by next weekend I’ll finished it and will await the HF review

  • @shubh722
    @shubh722 8 місяців тому +2

    I am doing an object detection task and get 97.4% accuracy on the dataset using yolov5 and will be running it on an edge device. Is yolov5 too old and Should I train a yolov8 model for faster inference? As I think accuracy will be almost similar as it’s already 97.4%. Or is it task specific. If yolov5 is performing good then is there any need to change. If anyone can suggest please

    • @Roboflow
      @Roboflow  8 місяців тому +1

      I don’t think you can expect better accuracy than that. The main issue here is that YOLOv5 did not have proper Python packaging so integrating it into larger projects was problematic.

    • @omigator
      @omigator 7 місяців тому +2

      We switched from Yv8 to Yv5 because it gave better performance without any loss in accuracy on our edge devices.

    • @shubh722
      @shubh722 7 місяців тому

      @omigator What do you think about the inference times between v8 and v5? Is it real time? Also idk I found yolov5 easier to use as well. I was training it in Azure ml so was much easier to tweak the files for v5 to train there rather than v8. And the accuracy is pretty good as well.

  • @tyronetyrone2652
    @tyronetyrone2652 8 місяців тому +1

    Which framework is better to use in embedded chips?

    • @Roboflow
      @Roboflow  8 місяців тому

      which board are you using?

    • @sarathkumar-gq8be
      @sarathkumar-gq8be 3 місяці тому

      Which model will perform better in raspberry pi

  • @satyajitpanigrahy7742
    @satyajitpanigrahy7742 8 місяців тому

    Kindly, Update the ultralytics package for YOLOv4 model

    • @Roboflow
      @Roboflow  8 місяців тому

      Hi @satyajitpanigrahy7742 👋 Ultralytics is a separate team. Kindly, try to submit bug report in their repository: github.com/ultralytics/ultralytics

  • @8eck
    @8eck 8 місяців тому

    Where is DETA video? Couldn't find DETA with 100k stars... Could you please add github link here.

    • @Roboflow
      @Roboflow  8 місяців тому

      For now we only have DETR. You can find it here: ua-cam.com/video/AM8D4j9KoaU/v-deo.html
      As for star count, DETA is distributed via transformers library and that's what I used to measure community size.

  • @appliedml8665
    @appliedml8665 8 місяців тому +1

    Yolo gold is now available.

    • @Roboflow
      @Roboflow  8 місяців тому

      Yeah I know... This video was recorded before YOLO GOLD was released. I didn't have time to play with it yet. Have you?

  • @titusfx
    @titusfx 8 місяців тому +1

    In 2:28 why not just do asymptotic analysis (computational complexity analysis).

    • @Roboflow
      @Roboflow  8 місяців тому

      Hi 👋🏻! You mean use FLOPS to asses complexity?

  • @sanchaythalnerkar9736
    @sanchaythalnerkar9736 8 місяців тому +1

    Can you show actual code and real time comparison of these?

    • @Roboflow
      @Roboflow  8 місяців тому

      You mean independent time benchmark comparing the speed of all of those models?

    • @sanchaythalnerkar9736
      @sanchaythalnerkar9736 8 місяців тому +1

      Yes Exactly , a side by side comparison@@Roboflow

    • @Roboflow
      @Roboflow  8 місяців тому

      @@sanchaythalnerkar9736 not sure if it really can be side by side. To truly measure the model speed we need to make sure there is no other heavy process running on the machine. But sure we can try to make that happen. I'll add it to our TODO list.

  • @8eck
    @8eck 8 місяців тому +1

    Found only DETA with 198 stars, not 100k like in your table...

    • @Roboflow
      @Roboflow  8 місяців тому

      I responded to that question under your other comment :)

  • @8eck
    @8eck 8 місяців тому +1

    RT-DETR have 355 stars, 20k+

    • @Roboflow
      @Roboflow  8 місяців тому

      To be honest no one use implementation from original repository. RT-DETR is distributed via PaddlePaddle package. That's why we use 20k+ star count. I know it is not perfect... but like I said I decided to use the top repo that make the model accessible.

    • @8eck
      @8eck 8 місяців тому +1

      ​@@Roboflowcan you please drop some links? Thank you.

    • @Roboflow
      @Roboflow  8 місяців тому

      @@8eck take a look here: github.com/PaddlePaddle/PaddleDetection/tree/develop/configs/rtdetr and here: huggingface.co/docs/transformers/main/en/model_doc/deta

  • @adurks4846
    @adurks4846 8 місяців тому +1

    Personally, I've found yolov8 to be disappointing in the real world. I work in aerial/satellite imaging and yolov8 performs ~10% worse than scaled-yolov4. Most of the others on that list perform similarly. Overall, it seems like once you leave the types of images/targets in the COCO dataset, the metrics mean less and less for what will do well on your project.

    • @Roboflow
      @Roboflow  8 місяців тому

      Absolutely agree! I even said that in the video. I'd love to have other metric to compare models, not just mAP on COCO. The moment to start to fine-tune the model on your dataset that number means nothing. Do you care about the speed when you process aerial/satellite imaging?

    • @adurks4846
      @adurks4846 8 місяців тому

      @@Roboflow We don't care that much about speed. However, we don't typically have much data which means that the larger models seem to do worse.
      Do you guys have in-house metrics for some of these models using the roboflow-100?