(Speaker Identification)--Speaker Recognition from raw waveform with SincNet

Поділитися
Вставка
  • Опубліковано 17 лис 2024

КОМЕНТАРІ • 10

  • @tanmay_ds
    @tanmay_ds 4 роки тому

    Not a query specifically regarding this video but can you suggest me a tutorial to learn about speaker verification using ivector

    • @KrishnaDN
      @KrishnaDN  4 роки тому +2

      I would suggest one tutorial by Najim Dahek (Inventor of i-vector).
      www.superlectures.com/odyssey2016/i-vector-representation-based-on-gmm-and-dnn-for-audio-classification

  • @ismaeelkrayyan2488
    @ismaeelkrayyan2488 5 років тому

    i want how to implement this in my project and i want the dataset please

  • @yashtiwari4705
    @yashtiwari4705 4 роки тому

    Hii. Is it possible to implement this model on Windows OS.? As in documentation we have Linux OS mentioned.

  • @osakajay480
    @osakajay480 4 роки тому

    can't i get the code?

    • @KrishnaDN
      @KrishnaDN  3 роки тому

      github.com/mravanelli/SincNet

  • @suryan3479
    @suryan3479 4 роки тому

    can you do a video on rawnet

    • @KrishnaDN
      @KrishnaDN  4 роки тому

      ua-cam.com/video/9lOkPtilD74/v-deo.html

  • @karthikgali
    @karthikgali 3 роки тому

    Hi Krishna.. Thanks for this video. I have a question:
    As mentioned here ua-cam.com/video/sendxu-rHlY/v-deo.html for each 200 ms of audio you would be predicting the speaker?

    • @KrishnaDN
      @KrishnaDN  3 роки тому

      Not for each 200ms. They use a window of 200ms with 10ms shift to obtain frame level classification posteriors. Then they pool them to obtain utterance level classification.