AI Summarize HUGE Documents Locally! (Langchain + Ollama + Python)

Поділитися
Вставка
  • Опубліковано 8 січ 2025

КОМЕНТАРІ • 15

  • @jakubzakowski7422
    @jakubzakowski7422 12 днів тому

    one of the best videos i have ever seen. I just want to tell you Thank you and good job

  • @srivenkateswaraswamy3403
    @srivenkateswaraswamy3403 2 місяці тому +4

    what if images of tables and equations are there in that case?

  • @DebugVerseTutorials
    @DebugVerseTutorials  4 місяці тому +6

    Source code
    github.com/debugverse/debugverse-youtube/tree/main/summarize_huge_documents_kmeans

  • @danila8823
    @danila8823 3 дні тому

    Using gemini vision to describe the video?? Nice technique

  • @ajays6393
    @ajays6393 13 днів тому

    Thank you very informative!

  • @thingX1x
    @thingX1x 15 днів тому

    Will this work for a procedurally generated file containing a conversation? Or should I look at another method?

  • @mightyboessu
    @mightyboessu 2 місяці тому +4

    Why do you use the HuggingFaceBgeEmbeddings and not OllamaEmbeddings?

  • @meereslicht
    @meereslicht 3 місяці тому

    Excellent, thank you! A very clever strategy for large documents. However, I am a little at a loss in the search of a good embedding model for texts in Spanish. I am not sure whether the BGE models are the best option for these. Can you suggest one that could be integrated seamlessly within your code?

    • @DebugVerseTutorials
      @DebugVerseTutorials  3 місяці тому +2

      Hi, for Spanish language take a look at jinaai/jina-embeddings-v2-base-es . In your code simply replace the model_name variable and everything should work.

    • @meereslicht
      @meereslicht 3 місяці тому +1

      @@DebugVerseTutorials Thank you very much for your kind answer. I'll do that 😊🤗🤗

    • @igorcastilhos
      @igorcastilhos 2 місяці тому

      @@DebugVerseTutorials Hi, if I would to use the Ollama model, how can I know the exact name necessary to put in the model_name?

    • @mukeshkund4465
      @mukeshkund4465 27 днів тому

      ​@@igorcastilhosdo ollama list to see the model available and copy the name.

    • @allok501
      @allok501 17 днів тому

      you can use latest jina embeddings v3 as it is multilinugal.