Image segmentation with Yolov8 custom dataset | Computer vision tutorial

Поділитися
Вставка
  • Опубліковано 20 січ 2025

КОМЕНТАРІ • 241

  • @WelcomeToMyLife888
    @WelcomeToMyLife888 Рік тому +12

    Another awesome tutorial, showing all the necessary steps! Wish you all the best.

  • @dmitrium12
    @dmitrium12 Рік тому +2

    This is a very cool manual, thank you for it, this is exactly what I wanted to see. I have always been surprised in your channel that you post all these materials and videos for free, because you could sell them.

    • @ComputerVisionEngineer
      @ComputerVisionEngineer  Рік тому +1

      Hey, glad you find it helpful! I really enjoy sharing my knowledge of computer vision with everyone! 😃💪 Selling courses is not a bad idea, though. Maybe I will do it in the future. 😊

  • @ComputerVisionEngineer
    @ComputerVisionEngineer  Рік тому

    Did you enjoy this video? Try my premium courses! 😃🙌😊
    ● Hands-On Computer Vision in the Cloud: Building an AWS-based Real Time Number Plate Recognition System bit.ly/3RXrE1Y
    ● End-To-End Computer Vision: Build and Deploy a Video Summarization API bit.ly/3tyQX0M
    ● Computer Vision on Edge: Real Time Number Plate Recognition on an Edge Device bit.ly/4dYodA7
    ● Machine Learning Entrepreneur: How to start your entrepreneurial journey as a freelancer and content creator bit.ly/4bFLeaC
    Learn to create AI-based prototypes in the Computer Vision School! www.computervision.school 😃🚀🎓

  • @AlexFilozof1996
    @AlexFilozof1996 Рік тому

    I must say that only your videos are helping me with computer vision project. All others do not work. Thank you, from Serbia

  • @TrevorSullivan
    @TrevorSullivan 4 місяці тому

    You're doing an amazing job! This is a really good video. Keep making more videos like this one!

    • @ComputerVisionEngineer
      @ComputerVisionEngineer  4 місяці тому

      Thank you for your support! I will keep on making videos like this one! 😃🙌

  • @vishalpahuja2967
    @vishalpahuja2967 Рік тому

    Wow! was waiting for this video.
    Thank you!

  • @zeynox8983
    @zeynox8983 6 місяців тому

    hi tenk u for ur video but i got a problem here. The labels in the val part should be the same as the labels we did for the images in the same directory, right? Should we convert them from the image format to numerical format in the code? Or when I downloaded the dataset, the labels for the val part came in this format: "duck 56.96 124.29848700000001 1023.36 421.58926299999996" for each image. Is this the correct format? When I did the second one, I got an error in VS Code. When I did the first one, I got the following output in Google Colab:
    "Epoch GPU_mem box_loss seg_loss cls_loss dfl_loss Instances Size
    10/10 0G 0 0 75.79 0 0 640: 100%|██████████| 5/5 [01:23

  • @daw70772
    @daw70772 6 місяців тому

    Thank you! Your tutorials are awesome!

  • @neptunelearning9249
    @neptunelearning9249 10 місяців тому

    Nice class sir. You explained to finetune YOLO in a simple way. Thank you

  • @lopezbryan7589
    @lopezbryan7589 11 місяців тому

    please , how can i get the code at 11:40 in the video? type in by myself?

  • @sugarbycand2845
    @sugarbycand2845 Рік тому

    Great tutorial, everything explained very well. You saved me :D

  • @faizalbarrisi7254
    @faizalbarrisi7254 Рік тому +3

    can you give me a step, on how to download the dataaset from the openimages and the annotation mask

    • @ialbornoz
      @ialbornoz Рік тому

      I have the same question

    • @felipe_gf
      @felipe_gf 7 місяців тому

      @@ialbornoz Did you ever figure out how to do it?

    • @haseebkhawaja1050
      @haseebkhawaja1050 6 місяців тому

      @@felipe_gf ...

  • @loaykatinah184
    @loaykatinah184 6 місяців тому +2

    Thank you for this video.
    I have a multiclass problem 10 classes + background. How can I convert the masks to yolo labels considering the right arrangement of the labels?

    • @haseebkhawaja1050
      @haseebkhawaja1050 6 місяців тому

      did you get the answer please can you help me with it I also have a multi class problem and cat seem to find the code to convert the masks to labels please !!!!!!!!!!!!!!!!!

    • @haseebkhawaja1050
      @haseebkhawaja1050 6 місяців тому

      Also how can we add background I mean if its not a separate class then how we can specify it as a background

    • @loaykatinah184
      @loaykatinah184 6 місяців тому

      @@haseebkhawaja1050 I exported my Annotation as JSON file. Then you can import the Data with JSON file to roboflow annotater. Then from the robowflow annotater the txt. files for yolov8 can be exported. Also you will finde alot of codes online that can transforme the JSON file to yolov8 txt. format or to a binary mask

    •  2 місяці тому

      i can make a hot encoded with classes numbers and a label map. class 1: 1 , class 2: 2. 1:(0,0,0) 2:(255,255,255)

  • @samb23692
    @samb23692 8 місяців тому +4

    HI, I have a project wherein, I have to segment multiple classes, how do i go about it? What changes do I need to make in the code?

    • @Γιαννηςοπαλίδης
      @Γιαννηςοπαλίδης 7 місяців тому +1

      actually this one bothers me to

    • @haseebkhawaja1050
      @haseebkhawaja1050 6 місяців тому

      if you find anything on it please help. I found one solution of ROBOFLOW which actually generates auto lables after annotating no need to go through to masks and then convert to lables (polygons)

    • @prasadsuryawanshi2351
      @prasadsuryawanshi2351 4 місяці тому

      add classes in code. if u have multiple object to detect then u need to add more class in code

    • @prasadsuryawanshi2351
      @prasadsuryawanshi2351 4 місяці тому

      you need to edit the config file in which nc = number of labels and names = names of ur labels like
      nc : 4
      names : ['duck' , 'cat' , 'cow' , 'horse' ]

    • @Switchkey1000
      @Switchkey1000 5 днів тому

      Use the color of the different classes to define which segment has which color:
      import os
      import cv2
      import numpy as np
      # Define color-to-label/class mapping
      color_to_label = {
      (25, 25, 77): 0, # Red
      (61, 245, 61): 1, # Green
      (189, 18, 0): 2, # Black
      # Add more colors and labels as needed for more different classes
      }
      input_dir = 'myinput_path'
      output_dir = 'myoutput'_path
      for j in os.listdir(input_dir):
      image_path = os.path.join(input_dir, j)
      # Load the color mask
      mask = cv2.imread(image_path)
      H, W = mask.shape[:2]
      polygons = []
      for color, label in color_to_label.items():
      # Create a binary mask for the current color
      binary_mask = cv2.inRange(mask, np.array(color), np.array(color))
      contours, _ = cv2.findContours(binary_mask, cv2.RETR_EXTERNAL, cv2.CHAIN_APPROX_SIMPLE)
      for cnt in contours:
      if cv2.contourArea(cnt) > 200:
      polygon = [label]
      for point in cnt:
      x, y = point[0]
      polygon.append(x / W)
      polygon.append(y / H)
      polygons.append(polygon)
      # Save the polygons with labels
      with open('{}.txt'.format(os.path.join(output_dir, j)[:-4]), 'w') as f:
      for polygon in polygons:
      f.write(' '.join(map(str, polygon)) + '
      ')

  • @vikashkumar-cr7ee
    @vikashkumar-cr7ee Рік тому

    Dear Tutor,
    Greetings! I am getting the following error while running the code at time stamp 44:35
    'for j, mask in enumerate(result.masks.data):
    AttributeError: 'NoneType' object has no attribute 'data'
    Can you please help me out ?

    • @ComputerVisionEngineer
      @ComputerVisionEngineer  Рік тому

      Hey, it may be that you are not detecting any objects in that image. Have you tried with other images?

    • @vikashkumar-cr7ee
      @vikashkumar-cr7ee Рік тому

      @@ComputerVisionEngineer I tried on three or four other images in val set, but I am getting the same error

    • @SethmiyaAbeyrathna
      @SethmiyaAbeyrathna Рік тому

      Same problem for me

  • @aravindnag1803
    @aravindnag1803 6 місяців тому

    Thank you so much for the label creation code!!

  • @bbb-xu7wx
    @bbb-xu7wx Рік тому +1

    thank you for the video again amazing job

  • @LiuJason-m3n
    @LiuJason-m3n Рік тому

    Thank you so much for your kindness tutorial! Hope you everything well!!!!!!

  • @FazriGading
    @FazriGading 8 місяців тому

    Wow. you are amazing bro. Thank you so much for teaching me this!!!

  • @Copelion
    @Copelion 5 місяців тому

    Thanks for the nice tutorial. :D

  • @aakashbhosale9140
    @aakashbhosale9140 Рік тому

    Thanks for the detailed explanation.

  • @ArushiGupta-t4j
    @ArushiGupta-t4j Рік тому +1

    When I'm exporting the annotated files using segmentation mask 1.1, the zip I'm getting is just a single text file. Any idea what else I can do?

  • @adhammahmoud5574
    @adhammahmoud5574 8 місяців тому

    Thank you so much for this wonderful video. It helped me so much.

  • @4Tjohny
    @4Tjohny Рік тому

    Thank you. Awesome tutorial.

  • @haseebkhawaja1050
    @haseebkhawaja1050 6 місяців тому

    hey there does this code work for two classes or more than 1 class (mask to data points etc labels) please help how can we modify it to include more than 1 class please

  • @zaidahmed4069
    @zaidahmed4069 5 місяців тому

    If I have a segmented mask can I zoom in or out the masked object while the unmasked remaining image stays same ?

  • @johnton96
    @johnton96 4 місяці тому

    I need samples of C. elegans nematodes. Are they existing in the dataset?

  • @aneerimmco
    @aneerimmco 6 місяців тому

    this was very informative. Thank you

  • @epicfishguy
    @epicfishguy Місяць тому

    Hey do you have a video or some tipps for Yolov11 Segmentation?

  • @rosemutegi8830
    @rosemutegi8830 Рік тому

    Getting this error what could be the issue.
    IndexError: list index out of range when i train my .yaml file

  • @yassinebouchoucha
    @yassinebouchoucha Рік тому

    @18:50 starting with `epoch=1` to make sure everything is well established is a Golden Rule !

  • @lewislord9341
    @lewislord9341 Рік тому

    please sir i have an error and i didnt find the solution
    its not possible to find the images, i dont know if the problem is on my config file. i have a big problem with it

  • @muhammadgulfam1869
    @muhammadgulfam1869 9 місяців тому

    In the config file, what is is "nc" variable referring to? Also what are the instructions for the "names" variable, does it contains object names? do we decide what we want to call the object?

    • @ComputerVisionEngineer
      @ComputerVisionEngineer  9 місяців тому

      nc is the number of classes, yes you can name the objects whatever you want, the names won't affect the training process 🙌

    • @muhammadgulfam1869
      @muhammadgulfam1869 9 місяців тому

      @@ComputerVisionEngineer Thank you

  • @Di0n-r5u
    @Di0n-r5u Рік тому +1

    Hi, I was just wondering how you installed the specific segmentation masks of ducks from open images, I couldn't figure it out on my own. I would greatly appreciate it if you lend me a hand

    • @ComputerVisionEngineer
      @ComputerVisionEngineer  Рік тому

      Hey, take a look at the 'Annotations and metadata' section of this page storage.googleapis.com/openimages/web/download_v7.html 🙌

    • @Di0n-r5u
      @Di0n-r5u Рік тому

      @@ComputerVisionEngineer Should I only download the mask data? If so, after that have you written a script which creates a list of the image ID's and extracts the necessary annotations. Like in your object detection video, could you share it with me?

    • @Di0n-r5u
      @Di0n-r5u Рік тому

      I have resorted to another solution after I couldn't figure out the exact thing I wanted to make: I downloaded all the zip files containing all the masks, all mask files have a notation like __.png for example: 00a3d94534a1b356_m0k4j_97c014cd.png I then wrote a script in python to filter only the masks with the wanted class label utilising multiprocessing and multithreading, the script saves the files as .png omitting the other parts. After running the script, I was able to obtain the binary masks only for my selected class. Now I'm going to annotate them using the polygonization script which you've provided, I might modify it a little. Anyway, thanks for your help!

    • @ComputerVisionEngineer
      @ComputerVisionEngineer  Рік тому

      @@Di0n-r5u Glad you solved it!

    • @abhisheknegi2888
      @abhisheknegi2888 8 місяців тому

      @@Di0n-r5u hey can you please share the script

  • @zaidahmed4069
    @zaidahmed4069 Рік тому

    Can you help me with exporting this model because I'm facing an error while exporting the model into tflite or pb format

  • @inquisitiverakib5844
    @inquisitiverakib5844 Рік тому

    how can we get .json file from this annotated image which will carry the co-ordinates of the polygon mask as text format???

  • @zaidahmed4069
    @zaidahmed4069 6 місяців тому

    Does YOLO supports 3d image segmentation ? I have some 3d images in .obj format and want to segment them

  • @hamedzeinaly
    @hamedzeinaly 5 місяців тому

    great. What if we have 2 objects? can you help with that too?

  • @nomaanrizvi6561
    @nomaanrizvi6561 Рік тому

    i need the code for polygons to mask...can i get it?..please

  • @jacobmars1902
    @jacobmars1902 8 місяців тому

    could i use A111 inpaint anything to create the masks for the training?

    • @ComputerVisionEngineer
      @ComputerVisionEngineer  8 місяців тому

      Not sure if I understand, but no, I don't think you could do that.

    • @jacobmars1902
      @jacobmars1902 8 місяців тому

      @@ComputerVisionEngineer ok, thanks

  • @alphagenerativeai
    @alphagenerativeai 9 місяців тому

    How can I labeled 2 classes and configured training. We hope to receive feedback from you! Thank you, from Viet Nam

  • @mohamedkeddache4202
    @mohamedkeddache4202 Рік тому

    thanks you are very helpful, keep going bro

  • @afjamo
    @afjamo 6 місяців тому

    Thank you for this cool instruction!! I am annotating mice in a cage group-housed. Sometimes, animals go under the bedding and are scarcely visible. In case of no annotation made for an image, I do not get an annotation file created for the image. As a result, I have a mismatch in the number of image files and annotation files. Would this be a problem? I could delete the images that do not have any annotation. But I would rather keep them since no annotation is also a kind of annotation I guess. What would be your suggestion? Thank you in advance!!

    • @afjamo
      @afjamo 6 місяців тому

      Hi Philippe! I already tried without deleting any images. So the mismatch stayed. But it worked :)

  • @ahmadfaraz1288
    @ahmadfaraz1288 10 місяців тому

    Hey, thanks for the tutorials. I am new to computer vision. Currently, I am preparing a data set on the dry wall construction process. We have to detect dry wall stages like: Stud Installation, Gypsum Panelling, electrical works and plastering. However, I have some confusion about labelling the data. My question is: do I need to label each object in the image at the same time? Or should I focus on a single object in each image? Besides, we have only 250 images from the construction sites; are these enough for training?

  • @Jugeenias
    @Jugeenias Рік тому

    What could be the reason for the following issue:
    for j, mask in enumerate(result.masks.data):
    AttributeError: 'list' object has no attribute 'masks'

    • @ComputerVisionEngineer
      @ComputerVisionEngineer  Рік тому

      If no detections were found result may be an empty list. Print result and see how it looks like. Let me know how it goes. 🙌

  • @simonbaumgartner6612
    @simonbaumgartner6612 8 місяців тому

    Great video! I wonder if you could speak to the data vizualization. How do you create the masks? And say I only have the Yolov8 annotation format (.png and corresponding .txt), any recommendation on how to visualize it by chance?

  • @דניחלבניקוב
    @דניחלבניקוב Рік тому

    thanks for the response earlier, how can I expend it to a data set with multiple segmentation classes?

    • @ComputerVisionEngineer
      @ComputerVisionEngineer  Рік тому

      You would need to edit the config file and create the annotations accordingly. I may do a video about multiclass detection in the future.

  • @santarosajoe2
    @santarosajoe2 11 місяців тому

    it might seem simple to you, but I'd love to see just one example of utilizing a webcam.
    It seems using the webcam is the most interesting use of YOLO, instead of cycling thru a bunch of still jpg images.

    • @ComputerVisionEngineer
      @ComputerVisionEngineer  11 місяців тому

      Ok, thank you for your feedback, next time I work with Yolo I will use a Webcam. 🙌

  • @usamasherazi4598
    @usamasherazi4598 Рік тому

    can i use CVAT Anotation website for yoloV5

  • @amjoode2
    @amjoode2 7 місяців тому

    When creating the labels, do you devide by the dimensions of the mask or to the whole image?
    I am trying to adapt your label creation process to handle multiple masks in one image.

  • @joshteixeira6750
    @joshteixeira6750 Рік тому

    You are a hero

  • @vladimirkuzmenkov7285
    @vladimirkuzmenkov7285 Рік тому

    Good stuff, thanks

  • @DoBaMan77
    @DoBaMan77 5 місяців тому

    Hi Philippe, awesome tutorial! I really like your style 😊 And I have a question for you. What is the best way to make a dataset for damage detection on cars, machined products, or imperfections/dirt on railroads? Semantic segmentation like you did in this video or object detection like you did with tha alpacas? Regards and keep up your great work. Dom😊

  • @khoahuynh2809
    @khoahuynh2809 Рік тому

    My dataset has two classes and after using your python file to convert, I found that it just has only one class in txt file (class which labels 0) although in the image has clearly two objects in two classes. How can I fix this error?

  • @aadilarsh.s.r4098
    @aadilarsh.s.r4098 7 місяців тому

    Hello sir, just watched your video and it is great in case of a simple query let us take training should we include images which doesnt have ducks also? or is it fine that we use all images with ducks for training which is best for fine tuning the model to detect ducks in the images. In simple words for purpose of training should dataset contain all imgaes with ducks in it or a mixture of images with and without ducks.

  • @logos1396
    @logos1396 Рік тому

    Amazing video 👍🏻👍🏻👍🏻

  • @I77AGIC
    @I77AGIC Рік тому

    But how do we export from cvat into yolo format if we have more than just one class?

    • @ComputerVisionEngineer
      @ComputerVisionEngineer  Рік тому

      I will try to make a video about multi class image segmentation in the future 🙌

    • @I77AGIC
      @I77AGIC Рік тому

      I ended up exporting as COCO then using roboflow to convert to YOLO format. Now I'm just using roboflow to annotate. I like it better and it actually lets me export to YOLOv8.

  • @Abhinavnair1103
    @Abhinavnair1103 8 місяців тому

    hey, your way is working perfectly!!, but when I am taking multiple objects it is classifying all of them as one label. I believe the problem is in masks_to_polygon.py, I did the same thing as instructed by you for config.yaml. Can you tell me where I can be wrong

  • @SethmiyaAbeyrathna
    @SethmiyaAbeyrathna Рік тому

    Hello my weights folder is empty how can I overcome this problem it may be an issue of my train coding

    • @ComputerVisionEngineer
      @ComputerVisionEngineer  Рік тому +1

      Probably your training process is not being completed. Do you see any error? Have you tried to train the model from a google colab?

    • @SethmiyaAbeyrathna
      @SethmiyaAbeyrathna Рік тому

      @@ComputerVisionEngineer thank you very much no I didn't try using colab I used Spyder

    • @SethmiyaAbeyrathna
      @SethmiyaAbeyrathna Рік тому

      Thank you I used colab and I got the weights

    • @SethmiyaAbeyrathna
      @SethmiyaAbeyrathna Рік тому

      I trained model in colab then I downloaded but when I use the Spyder for prediction using that last weight model got an error called no attribute 'data' how can I solve this

    • @SethmiyaAbeyrathna
      @SethmiyaAbeyrathna Рік тому

      Hello can we use Spyder for this without using colab

  • @adithin740
    @adithin740 9 місяців тому

    i am not able to download dataset

  • @mpfmax0
    @mpfmax0 Рік тому

    would the segment anything model be better for this task? I'm trying to segment plants from a herbarium collection, they are full dried plants pressed on white paper sheets and scanned into digital images, but there is a paper label with collection data and a stamp getting in the way of my automatic segmentation attempts. Im a bit confused on what would be the best method to accomplish the task of extracting the plant from the background ( I also may want to segment pieces of the plant, like leaves, flowers, stems). So far it seems to me the best method would be to train YOLO to detect the plant and draw a bounding box around it and then use SAM to make a mask of the plant inside the box (or multiple masks for the pieces of the plant) . Does this make sense?

    • @ComputerVisionEngineer
      @ComputerVisionEngineer  Рік тому

      It makes sense, although you would need a yolo model trained on detecting plants, do you have one?

    • @mpfmax0
      @mpfmax0 Рік тому

      @@ComputerVisionEngineer I do now. I trained it to draw bounding boxes around the plants using your other tutorial video, it performs really well. Now I'm going to try to use the bounding boxes as prompts for SAM (segment anything model) to extract detailed masks of the plants. Wish me luck!

    • @ComputerVisionEngineer
      @ComputerVisionEngineer  Рік тому

      @@mpfmax0 Good luck! Let me know how it goes! 😃

  • @juliogomez6065
    @juliogomez6065 Рік тому +2

    Hola Felipe, estuve trabajando sobre los archivos que nos compartes, lo adapté a mis necesidades, previamente hice todo el etiquetado en CVAT, pero me queda una duda ya que el training al parecer no me está funcionando: En el archivo "config.yaml", hay dos líneas que no explicaste: "nc:1" (que supongo es la cantidad de classes generadas en CVAT, y la línea "names:['...'] (Supongo que son los nombres asignados a las classes en CVAT). El problema es que asumiendo esto, lo adapto a mi necesidad (nc:7 names: ['Sin arandela', 'Arandela OK', 'Arandela rota', ...ETC ], y en el archivo run - weights, con el collage de imágenes que me arroja, solo me aparece la primera etiqueta, siendo "Sin arandela". ¿Es posible que me digas qué puedo estar haciendo mal? Hice 100 epochs, donde en TRAIN tengo 187 fotografías, y en VAL tengo 46 fotografías.

    • @ComputerVisionEngineer
      @ComputerVisionEngineer  Рік тому

      Hola Julio, 187 + 46 imagenes parecen pocas para entrenar un algoritmo de este tipo, especialmente considerando que tenes 7 clases. Adaptaste las mascaras para trabajar con 7 clases?

    • @juliogomez6065
      @juliogomez6065 Рік тому +1

      @@ComputerVisionEngineer Hola, Felipe. Cuantas consideras que pueden ser una buena base para desarrollar un buen script? El problema es que en este momento soy únicamente yo en el proyecto, por lo que no puedo tener una cantidad muy grande, al menos hasta demostrar resultados y que me asignen una persona adicional. Sobre adaptar las máscaras, no entiendo a qué te refieres, apenas inicio en este mundo. No sé si responda tu pregunta, pero a lo largo del etiquetado en CVAT, hice uso de todas las etiquetas, habiendo aproximadamente de 2 a 3 etiquetas por imagen.

    • @ComputerVisionEngineer
      @ComputerVisionEngineer  Рік тому

      @@juliogomez6065 El tutorial de este video es para semantic segmentation de una sola clase. Para segmentación multi-clase hay que estudiar la documentación de yolov8 para ver cómo hacer las máscaras. Las máscaras que usé en el video son binarias y solo sirven para segmentación de una sola clase (blanco=objeto, negro=fondo). Sobre la cantidad de imágenes, todo depende... pero te sugiriía al menos unas miles, por ejemplo en este video uso ~3000 imágenes para hacer segmentación de una sola clase.

  • @saikiran3964
    @saikiran3964 Рік тому

    Is there any reference on how to save the segmented object as it's own image?

  • @stevegreen5638
    @stevegreen5638 Рік тому

    what is speed on segmentation, will I be able to use it on live video?

    • @ComputerVisionEngineer
      @ComputerVisionEngineer  Рік тому

      You can make it work on ~real time if not mistaken, let me know how it goes. 🙌

  • @shafagh_projects
    @shafagh_projects Рік тому

    Thank you so much for your lovely content. Indeed, it is very informative. However, I have a question about data handling. I noticed in the images folder you shared, the duck images are in both the train and val folders. Shouldn't it be that the train folder contains only duck images and the val folder is with non-duck images? Looking forward to your clarification. Thanks!

    • @ComputerVisionEngineer
      @ComputerVisionEngineer  Рік тому +1

      The val folder contains the validation data, this is how you validate the model. If the model detects ducks, it's appropriate to use images with ducks as validation data.

    • @shafagh_projects
      @shafagh_projects Рік тому

      many thanks for your prompt response but I have a big challenge of using a webcam to detect ducks using the following line:
      model.predict(source=0, show=True, conf=0.2)
      it has a huge lag.
      can you help me how to resolve this to be real-time detection?

  • @fawazmirza4646
    @fawazmirza4646 Рік тому

    I followed everything exactly but for some reason my val_batch0_pred has no segmentations on it. Even though the val_batch0_labels is segmented perfectly. I think this is probably the reason why I'm getting "AttributeError: 'NoneType' object has no attribute 'data'" when I try running the code. The object I'm trying to detect and the images given are very simple and easy, the model should not be struggling with this at all. What can I do?

    • @ComputerVisionEngineer
      @ComputerVisionEngineer  Рік тому

      Hey, are you using the same dataset as I did in the video? How many epochs are you training?

    • @fawazmirza4646
      @fawazmirza4646 Рік тому

      @@ComputerVisionEngineer no I'm using my own data set which is alot smaller, because I don't have many images of the thing I'm trying to detect, because it's of a proprietary ph indicator test so not many images exist, and so getting more is not an option.
      I have 6 images for training, and 4 for validation. I tried with 10, 50 and 100 epochs but still not a single detection on val_batch0_pred

    • @fawazmirza4646
      @fawazmirza4646 Рік тому

      @@ComputerVisionEngineer I've seen other people on github who have had more images and everything have the same issue, but non of them really got an answer. Or at least not one that is relevant in my case.
      My validation pictures and very similar to the training ones so the model should have no issues, idk what's wrong.

    • @ComputerVisionEngineer
      @ComputerVisionEngineer  Рік тому

      @@fawazmirza4646 oh I see. 10 images is usually not enough to train this type of model. Training for that many epochs on 6 images will produce overfitting.

    • @fawazmirza4646
      @fawazmirza4646 Рік тому

      So what do you suggest I do with the small data I have? What machine learning method, if any should I try? Or is there a way to make yolov8 work for my case?

  • @MdFaysalAhamed-q1m
    @MdFaysalAhamed-q1m Рік тому

    I have a question, I see you used the masked label (txt) data for training, What is the process to train the model directly using mask and original samples without any txt data on YoloV8? I have mask image but don't have any text data.

    • @ComputerVisionEngineer
      @ComputerVisionEngineer  Рік тому +1

      You need to convert the masks into the txt files in order to train the model with yolov8. I have a Python script in this project's github repository that may help you to do that. 🙌

    • @mdfaysalahamed4605
      @mdfaysalahamed4605 Рік тому

      Thank you brother. I found the file named as "masks_to_polygons". ❤

  • @penguinie4325
    @penguinie4325 Рік тому

    hello! thank you for your video! I have a question regarding using the prediction to predict segmentation from the image. From my results, It states it indicates 2 ducks in my image (which has 2 ducks) however, the outcome only displays 1 image segmentation. What should I do if I want both image segmentations to be predicted? Thank you!

    • @ComputerVisionEngineer
      @ComputerVisionEngineer  Рік тому

      Hey, when you say the outcome only displays 1 image segmentation you mean it only covers one of the two ducks?

    • @penguinie4325
      @penguinie4325 Рік тому

      ​@@ComputerVisionEngineer yes, can the code detect 2 ducks instead? or is it only for one duck segmentation detection...

  • @hongquangnguyen3230
    @hongquangnguyen3230 6 місяців тому

    This is very cool, thanks a lot! Could you make a similar video for the SAM?

  • @lobo5727
    @lobo5727 Рік тому

    no attribute called data... Error

  • @sto2779
    @sto2779 Рік тому

    8:33 - Its called duck feet. The arms of the duck is the wings.

  • @jesusmtz29
    @jesusmtz29 Рік тому

    do you need an account with cvat even when you host it locally?

    • @ComputerVisionEngineer
      @ComputerVisionEngineer  Рік тому

      If I am not mistaken when hosted locally you don't need an account with cvat, but each user needs to create an account in your locally hosted cvat app. 🙌

  • @kagadevishal5008
    @kagadevishal5008 Рік тому

    what if there are more than 1 classes, will the same method to convert to polygon will work?

    • @ComputerVisionEngineer
      @ComputerVisionEngineer  Рік тому +2

      If there are more than one classes, the same script will not work, you would need to adjust it to deal with multi class masks. 🙌

    • @kagadevishal5008
      @kagadevishal5008 Рік тому

      @@ComputerVisionEngineer any Idea how can we do that, I tried but not able to find concrete solution.

    • @haseebkhawaja1050
      @haseebkhawaja1050 6 місяців тому

      @@kagadevishal5008 me also tried many things but at the end went with ROBOFLOW which automatically does this

  • @안녕-k8u3t
    @안녕-k8u3t Рік тому

    Hi! thanks for the video, it's helping too much!!!
    I have a question,
    What version of tensorboard and numpy do you use?

  • @craftman147100
    @craftman147100 Рік тому

    I noticed that images\train has around 1800 files - unlike in the video, and labels\train has 3965 files. Is that an issue?

    • @ComputerVisionEngineer
      @ComputerVisionEngineer  Рік тому

      Hey, you should have the same number of images and label files. Yolov8 will probably trigger an error in any other case.

  • @larafischer420
    @larafischer420 Рік тому

    I have a question: How do you download the images from google datasets? Can you make a video explaining that process? Seems like a dumb process, but I really don't know how to do that

    • @ComputerVisionEngineer
      @ComputerVisionEngineer  Рік тому

      I am currently preparing a Python script to download a semantic segmentation dataset from the google open images dataset. It will be available in my Patreon soon. 🙌

    • @larafischer420
      @larafischer420 Рік тому

      @@ComputerVisionEngineer

  • @mithactatus
    @mithactatus Рік тому +1

    I am predicting watermelons, pineapples and blackberries. My model can predict the objects but call them all watermelons. Do you have any idea?

    • @ComputerVisionEngineer
      @ComputerVisionEngineer  Рік тому

      Take a look at your training data. Perhaps you need to train the model with more data. 🙌

    • @mithactatus
      @mithactatus Рік тому

      i fixed it. it was because of the mask transformation to yolo files. the txt files had all 0 as the class (the very first number of the .txt file). and i manually changed 0's according to the images with the same name @@ComputerVisionEngineer

  • @zeinabelsharkawy9014
    @zeinabelsharkawy9014 Рік тому

    Hi, thank you for this video. can you convert the Yolo label to a binary mask?

  • @akifakbulut765
    @akifakbulut765 Рік тому

    Each frame reading interval is 53 milliseconds in ultralytics, how can we reduce this interval to 33 milliseconds.

    • @ComputerVisionEngineer
      @ComputerVisionEngineer  Рік тому

      Hey, do you mean the inference is taking 53 ms per frame?

    • @akifakbulut765
      @akifakbulut765 Рік тому

      @@ComputerVisionEngineer Yes, the elapsed time between each frame is 53 milliseconds

    • @ComputerVisionEngineer
      @ComputerVisionEngineer  Рік тому

      @@akifakbulut765 are you using a GPU?

    • @akifakbulut765
      @akifakbulut765 Рік тому

      @@ComputerVisionEngineer No

    • @ComputerVisionEngineer
      @ComputerVisionEngineer  Рік тому

      @@akifakbulut765 using a GPU would be a good way to try to reduce the execution time, if you don't have a GPU in your local computer you could consider using something like an EC2 instance from AWS.

  • @revuriakhil3148
    @revuriakhil3148 Рік тому

    Is it mandatory to convert masks to polygon or we can directly do labeling in polygon template and can we train that

    • @ComputerVisionEngineer
      @ComputerVisionEngineer  Рік тому

      Converting masks to polygon is necessary in order to do semantic segmentation with yolov8. 🙌

  • @aurum9864
    @aurum9864 Рік тому +1

    Thanks for the video! Do you have a masks_to_polygons script that would also work for multiple segmentation classes? Or do you know where I would find one? Have been looking for ages..

    • @ComputerVisionEngineer
      @ComputerVisionEngineer  Рік тому

      I don't have a multiclass masks_to_polygons script, but I think you could create one taking my one class script as baseline. Maybe chatgpt can help you adapting the script to multiclass. 💪💪

    • @quillaja
      @quillaja Рік тому

      I'd annotate my images in Inkscape or Illustrator, using paths as the masks, save it to SVG, then just convert SVG to YOLO format. Straight text-to-text conversion, more or less. All the info you need to normalize the vertices is in the SVG. To create multiple classes, you could group the classes, or probably the better thing would be to give each path a custom xml attribute for the object class.

  • @govindaagrawal816
    @govindaagrawal816 Рік тому

    Just a query, If I wanted to train it on multiple classes, how would I go about editing the config file?

    • @ComputerVisionEngineer
      @ComputerVisionEngineer  Рік тому

      Edit the 'nc' field to your number of classes, and edit the 'names' field so it contains all your class names. In case of multiclass segmentation you also need to edit your masks. 🙌

    • @haseebkhawaja1050
      @haseebkhawaja1050 6 місяців тому

      @@ComputerVisionEngineer please can you share code for masks to labels also please for multi class. Help would be much appreciated

  • @enricobovo2184
    @enricobovo2184 Рік тому

    Hi and thank you for your video! I have noticed that, when annotating a dataset of multiple images with CVAT with two labels, the export phase goes wrong and not all the segmentation masks are created correctly. Some of them contain for example just one class of objects even though I had previously annotated objects of different classes in that picture. Do you know how to solve it? Is there any other annotation tool that allows to export the images of the segmented masks?
    Thank you

    • @ComputerVisionEngineer
      @ComputerVisionEngineer  Рік тому

      Hey, I just tried to do it and everything seems fine with a couple of images. I annotated two labels and exported it as 'Segmentation mask 1.1', are you using this export format?

    • @mithactatus
      @mithactatus Рік тому

      you can just manually correct the .txt files. first number in the file represents the class. all of them might be 0 in your case.

  • @muhteguhadhiputra6041
    @muhteguhadhiputra6041 Рік тому

    penjelasan yang bagus👍👍

  • @ЦинциннатЦ-т8х
    @ЦинциннатЦ-т8х Рік тому

    Great job, man! Thanks a lot! Btw, does this segmebtation project work in c2.capread? Or can i use it for segmentation objects in video ?

  • @santarojoe1
    @santarojoe1 Рік тому

    Very well done , however my code still errors that .
    The only things you aren't explaining very well is WHAT goes inside each of the directories. You've explained that its " images to train the model" and " to validate the model", however I cant tell if IMAGES\TRAIN contains
    1) images the masks were trained from
    2) images of the masks
    3) or bulk unknown images to be analyzed
    IMAGES\VAL
    you said contains "images to validate the training model" , however I dont know which images those might be - 1)original bulk of all ducks known and unknown, 2) images of masks 3) or just the images that were used to create the masks)
    ...all same above questions with
    LABELS\TRAIN
    LABELS\VAL (you never mentioned that any files are inserted into this directory, or if this is the output )
    Then,
    are any of the folders we created empty?
    finally, it would be great to see the results found in your runs\detect\train folder.

    • @ComputerVisionEngineer
      @ComputerVisionEngineer  Рік тому

      Hi, you can download the data from the github repository if I am not mistaken.

    • @santarojoe1
      @santarojoe1 7 місяців тому

      @@ComputerVisionEngineer the images are not in github, nor any folders

  • @jay_9070
    @jay_9070 11 місяців тому

    Is the validation dataset a separate data set?? Is it necessary?

    • @ComputerVisionEngineer
      @ComputerVisionEngineer  11 місяців тому

      It is not absolutely necessary, but it is a good practice to use a different dataset as validation set.

  • @mscussiatto
    @mscussiatto Рік тому

    Thank you for the tutorial, it's one of the best i've seen in yolo. Would you be able to provide me some support on how to get the RGB masks from the inferences crop results? Cheers

    • @ComputerVisionEngineer
      @ComputerVisionEngineer  Рік тому

      Do you mean how to crop the original rgb image in the region given by the predicted mask?

    • @anisiobinzubechi
      @anisiobinzubechi Рік тому

      This is one of the most informative I have seen on this topic. Yes, I would also like to know how to crop out the original region given by the predicted mask. I have been having a hard time with that
      @@ComputerVisionEngineer

    • @ComputerVisionEngineer
      @ComputerVisionEngineer  Рік тому

      @@anisiobinzubechi I will try to make a video about it soon.

  • @chispun2
    @chispun2 Рік тому

    That part of the duck is just called the webbed foot (palmeado). I have some questions I would like to ask you, computer vision related, because I don't know who else could I ask

    • @ComputerVisionEngineer
      @ComputerVisionEngineer  Рік тому +1

      Oh, webbed foot! 🦆 Cool, thank you!
      Sure, you can ask me on discord. 💪🙌

    • @chispun2
      @chispun2 Рік тому

      @@ComputerVisionEngineer No sabía lo del discord! Allá voy! Gracias!!

  • @marsrover2754
    @marsrover2754 Рік тому

    @ComputerVisionEngineer Can you make a video on finetune SAM model(Segment Anything Model) on custom dataset.

  • @OceanAye
    @OceanAye Рік тому

    Thanks a lot for the tutorial, however, I seem to run into the same problems as @dmitrium12. Somehow the runs/segment/train file does not mask predictions and thus the graphs with train/loss and val/loss is just a dot in the middle of the grapth.
    I have used your dataset and followed every step.

    • @OceanAye
      @OceanAye Рік тому

      sorry meant @guillemcobos1987

    • @ComputerVisionEngineer
      @ComputerVisionEngineer  Рік тому

      If evaluation plots are only a dot in the middle of the graph it means you are training for only 1 epoch. Increase the number of epochs and you should be able to see a different plot. 🙌

  • @guillemcobos1987
    @guillemcobos1987 Рік тому

    Hello! I found your video very interesting, and it's helping me a lot in my new job as a vision engineer. I managed to train the duck-segmenting algorithm following your steps - amazingly clear! I can see how it makes some batch predictions in the 'runs' folder for some of the images in 'val'. However, when I import the model 'last.pt' and I try to make predictions, I consistently get 'no detections' and 'masks: None'. Do you know what could be going on? Thanks a million😊

    • @ComputerVisionEngineer
      @ComputerVisionEngineer  Рік тому

      Hey Guillem, I am glad the video is helping you in your new job! 😃 How many epochs did you train the model for? Are you using the exact same dataset as I use in the video?

    • @sarthakdas815
      @sarthakdas815 Рік тому

      @@ComputerVisionEngineer Hi getting the same error as stated above. No masks are gnerated for the predictions after 10 ephors using the same data set and code you have given not sure whats going wrong.

    • @ebrahim-nourmohammadi
      @ebrahim-nourmohammadi Рік тому

      @@sarthakdas815 I faced the same issue but solved it. That was because I used the masks that are in the "SegmentationObject" folder, we should use the masks that are in the "SegmentationClass" folder.

  • @gizem3166
    @gizem3166 Рік тому

    Hi, thanks for the video, its helping too much. Can we crop the segmented object instead of taking a mask? Do you have another video for this?

  • @santarojoe1
    @santarojoe1 Рік тому

    and would be cool if had small cv2 script that you waive a picture of a duck under cam, and it highlights the duck.

  • @vishalpahuja2967
    @vishalpahuja2967 Рік тому

    Hi,
    Can you make a video of getting output such as on your thumbnail?
    Thank you!

    • @ComputerVisionEngineer
      @ComputerVisionEngineer  Рік тому +1

      Hey, next time I make a video about semantic segmentation I will make the output to look like that 💪

  • @thisurawz
    @thisurawz Рік тому +1

    hello, im new to computer vision and I have a question. what is the most suitable algorithm/s or method/s for image steganalysis to detect the changed pixels in the stego image? i want to segment only the changed pixels in the stego image? can I use semantic segmentation also for this kind of problem?

    • @ComputerVisionEngineer
      @ComputerVisionEngineer  Рік тому

      Hey, I don't think that is a problem you can solve with semantic segmentation 🤔, but you can try! 😃 Regarding what are the most suitable methods for image steganalysis, I recommend you do a {Google, Github, Google Scholar} search, it is a field I haven't been involved in. 💪💪

    • @thisurawz
      @thisurawz Рік тому

      @@ComputerVisionEngineer Okay, i'll search. thank you. btw, I really appreciate your effort in making really valuable videos related to CV for free. I learned so much from your channel. this is one of the best channels with real-world implementations for CV that I've seen on UA-cam. keep going 💪💪!!

    • @ComputerVisionEngineer
      @ComputerVisionEngineer  Рік тому

      @@thisurawz 😃 Thank you so much for your support! 💪🙌

  • @joydeepkundu509
    @joydeepkundu509 Рік тому

    Can anyone share the dataset?

  • @pavanmhamsa
    @pavanmhamsa 9 місяців тому

    Thanks a Lot

  • @oi4252
    @oi4252 Рік тому

    have you seen meta's SAM

    • @ComputerVisionEngineer
      @ComputerVisionEngineer  Рік тому +1

      I have! Although I haven't tested so far. I should make a video about it later on! 💪

    • @oi4252
      @oi4252 Рік тому

      @@ComputerVisionEngineer thank you for your videos. they are truly amazing and very very educative and fun to watch

    • @ComputerVisionEngineer
      @ComputerVisionEngineer  Рік тому +1

      @@oi4252 😊 I am so happy you enjoy them!

  • @vishalpahuja2967
    @vishalpahuja2967 Рік тому +1

    Here if we want to infer on an image , so how to do it?
    I tried doing:
    from ultralytics import YOLO
    # Load a model
    model = YOLO('/content/yolov8n-seg.pt') # load an official model
    model = YOLO('/content/runs/segment/train/weights/best.pt') # load a custom model
    # Predict with the model
    results = model('image.jpg') # predict on an image
    Output:
    image 1/1 /content/gdrive/MyDrive/segmentation/data/images/train/11-03-22-ROHAN SANGHVI-DAUGHTER_S BEDROOM_page-0001.jpg: 480x640 (no detections), 10.6ms
    Speed: 0.6ms preprocess, 10.6ms inference, 0.5ms postprocess per image at shape (1, 3, 640, 640)
    it ran successfully but cannot see where is it saved.
    if the code is wrong pls update on how i can change it for inferencing on a single image?
    Thank you.

    • @ComputerVisionEngineer
      @ComputerVisionEngineer  Рік тому

      Hey, take a look at the tutorial. In the last chapter I show you how to make predictions with the model you trained. 🙌

    • @vishalpahuja2967
      @vishalpahuja2967 Рік тому +1

      @@ComputerVisionEngineer yes after doing that masks can be seen of that shapes , what to do if i want want my segmentation on my test image or actual image so that bounding box can be seen in my output with segmented part?

    • @ComputerVisionEngineer
      @ComputerVisionEngineer  Рік тому +1

      @@vishalpahuja2967 I see, you would like a visualization as the one in the thumbnail, right? img + mask on top + bounding box, is that it? you can visualize the mask on top of the image by applying an overlay, take a look on how to do that, and about the bounding box take a look at my video on object detection with yolov8 + object tracking, the first part is about how to get bounding boxes with a yolov8 model and how to draw the bounding box on the image. 💪🙌