PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation

Поділитися
Вставка
  • Опубліковано 18 гру 2024

КОМЕНТАРІ • 41

  • @guruprasadsomasundaram9273
    @guruprasadsomasundaram9273 2 роки тому +8

    I remember this being such a landmark achievement in 3D computer vision. I used to work with point clouds and meshes and this technique just opened a lot of doors for us.

  • @qudezheng
    @qudezheng 7 років тому +15

    Brilliant paper! While 2D image classification is more or less a solved problem. There're lots of exciting things happening in the 3D space

  • @tksl9236
    @tksl9236 6 років тому +4

    读了这个paper,感觉很有创新性

  • @erfanmoshiri3117
    @erfanmoshiri3117 2 роки тому

    such a string and backed theory! probably will be a lead to many other important projects

  • @JeracVideo
    @JeracVideo 9 місяців тому

    Hi, I'm searching for software that can detect eyes, nose, and ears on a human face in a point cloud and return the coordinates of those features. Then, I'd like to merge a second point cloud according to these coordinates. I'm not sure where to start. Can someone give me some hints? Thanks!

  • @endingalaporte
    @endingalaporte 4 роки тому

    a new wide range of applications open, ahead of us

  • @jimmyshenmusic
    @jimmyshenmusic 6 років тому +8

    This work is cool and it creates a new world for the 3D vision understanding. Surprisingly, it even did not use the CNN.

    • @gledsonmelotti7497
      @gledsonmelotti7497 6 років тому

      Hello, could you tell me why this PointCloud method is better than using a 3D CNN (advantages and disadvantages)?

    • @jimmyshenmusic
      @jimmyshenmusic 6 років тому +4

      3D CNN is computation expensive. We do have some works are using 3D CNN. You can check the leader board of the ModelNet 40. Currently, we have method: based on voxel(typical using 3D CNN), mesh(seems not so popular), raw point cloud(it is becoming popular) and project 3D image to 2D then using multi-view 2D CNN. We can not say which one is better. From the lead board, it seems that the multi-view 2D CNN can get the best result. However, the approach based on point cloud is interesting as in the RGB-D and outdoor LiDAR sensor, the data representation is either point cloud itself or very friendly to raw point cloud. That is one of the most important reasons that this work is interesting. Another important reason is that this work shows some theoretical analysis which is very useful to help us have a deeper understanding about their framework.

    • @gledsonmelotti7497
      @gledsonmelotti7497 6 років тому

      Hi, how are you? Thank you very much for the explanations about my doubts. Thank you very much. His explanations are very interesting and useful.

    • @mattiafucili7597
      @mattiafucili7597 6 років тому

      What do you mean by "it even did not use the CNN."? If you look at the implementation code in TensorFlow they use a 2Dconv layer everywhere before the max pooling operation. So, the problem is mine and I can't understand the code or did you say that because in the image of the net they wrote MLP instead of 2Dconv?

    • @jimmyshenmusic
      @jimmyshenmusic 6 років тому +1

      Mattia Fucili To be honest, I didn't look the code. Based on the paper, the author mentioned only FC is used.

  • @yifeipeng389
    @yifeipeng389 Рік тому

    Does the model takes labels to do segmentation?

  • @quantummath
    @quantummath 7 років тому +1

    Great Paper! I'm using it for a project I'm currently taking part in.

    • @noemi9351
      @noemi9351 6 років тому +1

      Hi, I am starting to use Point Net for a project of mine as well and just saw your comment here. Do you have any information available online about your project, e.g. Research Gate? I'm curious to see all the different projects with PN at the moment. Best

    • @chillfuturist9668
      @chillfuturist9668 5 років тому +2

      @@noemi9351 I am working on a project using this as well. Are you still working in this space?

  • @金属光泽
    @金属光泽 5 років тому +5

    感觉这两篇paper的引用爆炸了

  • @vivekgr3001
    @vivekgr3001 4 роки тому

    Excellent Explanation!

  • @gracebear
    @gracebear 3 роки тому +1

    Go, Charles Qi ! :)

  • @sanjanakambar2947
    @sanjanakambar2947 4 роки тому

    What happens if we add Global Average Pooling after feature transform?

  • @charlieli7486
    @charlieli7486 5 років тому

    Great work!

  • @shindepratibha31
    @shindepratibha31 2 роки тому

    Brilliant paper and work. I am thinking of incorporating rgb in pointnet++ model. How can I do that? Could you please help with some suggestions?

    • @JamesDear-b3q
      @JamesDear-b3q 3 місяці тому

      did you manage to figure this out? it would greatly help me thanks

  • @kalpanashowroom2576
    @kalpanashowroom2576 7 років тому

    What is the meaning of critical points?

    • @bindukarki5616
      @bindukarki5616 6 років тому

      Not entirely sure but I think those are the points that are distinct and best represents/distinguish the objects OR in another word key-points in point cloud library... check this: stanford.edu/~rqi/pointnet/ or check for key points in point cloud library for some understanding..apart from being distinct key-points and critical points might be same

    • @Rorypandanda
      @Rorypandanda 6 років тому

      Yeah, I believe those are the points most responsible for influencing the classification decision of the network.

    • @jimmyshenmusic
      @jimmyshenmusic 6 років тому

      The critical points are inferred from the learned global features after the max pooling. From the paper, the critical points will provide the main contribution on the classification task. From the vis result, critical points can well represent the geometric shape of the cloud points.

    • @gledsonmelotti7497
      @gledsonmelotti7497 6 років тому

      Hello, but how to get the critical points?

    • @jimmyshenmusic
      @jimmyshenmusic 6 років тому

      traverse all the raw cloud points. Check whether it will give a contribution to the max pooling result, if it does, then it is a critical point, if not, then it is not a critical point.

  • @operaworker
    @operaworker 7 років тому +13

    It's truly a great paper and great explanation, however the chomping sound, which is made by the speaker at the end of each sentence, is horribly disgusting and annoying.

    • @Imandabbagh
      @Imandabbagh 5 років тому +4

      I didn't notice until reading your comment, now I can't focus on the topic anymore!

    • @saisachinnekkanti7763
      @saisachinnekkanti7763 5 років тому

      @@Imandabbagh ;(

    • @williambonvini5806
      @williambonvini5806 4 роки тому +4

      "horribly disgusting"? Don't be so dramatic please 😅 great paper anyways!