Langchain & Neo4j: Create Knowledge Graphs from Text

Hands-on AI

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 15 січ 2025

КОМЕНТАРІ • 93

@SridharKumarKannam 5 місяців тому ⁺³
If you found this content useful, pleases consider sharing it with others who might benefit. Your support is greatly appreciated :)
@syntheticperson 5 місяців тому ⁺¹
Exciting and inspiring! Thanks for sharing.
@SridharKumarKannam 5 місяців тому
Thanks for your support :)
@fernyd.t.6716 3 місяці тому
I am working in a big research project for over two months and I finally found the perfect video for my graph! It turned out amazing!!
@SridharKumarKannam 3 місяці тому
Thank you.
If you found this content helpful, please consider sharing it with others who might benefit. Your support is greatly appreciated :)
@johnhelewa 8 місяців тому ⁺¹
You did a great job of explaining llm graph transformer clearly.
@SridharKumarKannam 7 місяців тому
Thanks for your support :)
@Gfghb-u7w 3 місяці тому
Fanatic explanation. Thank you!
@SridharKumarKannam 3 місяці тому
If you found this content helpful, please consider liking, subscribing, and sharing it with others who might benefit. Your support is greatly appreciated :)
@SuperTipu10 5 місяців тому
Amazing work, thank you so much!
@SridharKumarKannam 5 місяців тому
Thanks for your support. Pleases consider sharing it in your communities who might benefit. Your support is greatly appreciated :)
@matp3209 5 місяців тому
Very useful! Thanks so much!
@SridharKumarKannam 5 місяців тому
Thanks for your support :)
@emmanuelolayemi2494 2 місяці тому
Hello, the link in the description is not working anymore.
@varuntirupati2566 8 місяців тому ⁺²
The main challenge I see with Knowledge graphs is in Retriever side. Until unless if we or LLM generate the cypher query with the exactly same nodes and relationships that are present in graph database from the question we asked we are not going to get the proper response.
@SridharKumarKannam 7 місяців тому ⁺¹
yes, these tools and technologies are still improving everyday. Sometime not too far in the future, we will get minimum acceptable accuracy required..
@Prashant-s7f 9 місяців тому ⁺¹
Hi... Do you provide any courses on machine learning or ai?
@SridharKumarKannam 9 місяців тому ⁺¹
I don't have any structured course year on ML but I'm planning to do one.
@lnaehhrai 4 місяці тому
Did you share the notebook? I can't find it. It will be helpful to be able to run in on our own.
@SridharKumarKannam 4 місяці тому
Pls check the link in the description, its a link to the tutorial.
If you found this content helpful, please consider sharing it with others who might benefit. Your support is greatly appreciated :)
@debarghaya 6 місяців тому
With structured data in your previous pgsql examples, can we still apply graphdb concepts on structure data for optimal rag retrieval?
@SridharKumarKannam 6 місяців тому
Structured data (sql db) is usually easy to query, if we can have a good text2LLM model, then we don't need any additional algorithmic techniques. Querying unstructured data is complex and hence we have RAG, knowledge graphs, and combination of them...
@debarghaya 6 місяців тому
@@SridharKumarKannam what would be your recommendation for writing optimal few-shot query examples to further optimise the sql LLM agent and also to reduce the LLM tokens overlaid and context size with bloated info?
@awakenwithoutcoffee 6 місяців тому
when do you expect GraphRag to be production ready ?
@SridharKumarKannam 6 місяців тому ⁺¹
I would think in the next few months, definitely within 6 months. There is a lot of work going on in converting free text to graphs.
@awakenwithoutcoffee 6 місяців тому
@@SridharKumarKannam nice thank you for sharing your knowledge. One of the challenges with graphRAG seems that it requires allot more tokens / time (from testing it took about 40-60 seconds for 1 answer compared to 1-4 seconds for regular RAG, but the answer was 40% better) Do you think these challenges can be solved with the rise of LPU's/inference speed increasement ?
@williammariasoosai1153 9 місяців тому
Sridhar, Great job! Thanks
@SridharKumarKannam 9 місяців тому
thank you :)
@nullvoid12 8 місяців тому ⁺¹
You can view your graphs with Neo4j bloom also
@SridharKumarKannam 7 місяців тому
thats right, Thanks for your support :)
@shehzadahmed1782 9 місяців тому ⁺¹
how to i use multiple Neoj4 database's at the same time.. is this possible
@SridharKumarKannam 9 місяців тому
you can use only one db in the free version.
@ujjwalgoel6359 3 місяці тому
i was so confused earlier , since i am a student and recently got a job where i have to drive the insight from unstructured raw text and i was so confused since i didn't knew neo4j and all other videos were so confusing
Thanks
Also what u suggest where i can keep updated with new topics or updates like these?
@SridharKumarKannam 3 місяці тому
Thank you.
Pls follow neo4j blog and medium.
@shadyhamrouny8755 8 місяців тому
Ollama based llms don't work with the LLMGraphTransformer do you know why?
@SridharKumarKannam 8 місяців тому
I've not tested that, what error are you getting?
@shadyhamrouny8755 8 місяців тому
@@SridharKumarKannam Thanks for your response!
----> 9 llm_transformer = LLMGraphTransformer(llm=llm)
215 schema = create_simple_model(allowed_nodes, allowed_relationships)
--> 216 structured_llm = llm.with_structured_output(schema)
217 self.chain = prompt | structured_llm
108 warned = True
109 emit_warning()
--> 110 return wrapped(*args, **kwargs)
199 @beta()
200 def with_structured_output(
201 self, schema: Union[Dict, Type[BaseModel]], **kwargs: Any
202 ) -> Runnable[LanguageModelInput, Union[Dict, BaseModel]]:
203 """Implement this if there is a way of steering the model to generate responses that match a given schema.""" # noqa: E501
--> 204 raise NotImplementedError()
Apparenttly Ollama doesn't output a structured output compatible with LLMGraphTransformer. I couldn't find a way around it.
@somashekharb1846 8 місяців тому
@@SridharKumarKannam AttributeError on calling LLMGraphTransformer.convert_to_graph_documents
@somashekharb1846 8 місяців тому
AttributeError on calling LLMGraphTransformer.convert_to_graph_documents having this error
it runs ollama server but ends with this error
@JelckedeBoer 7 місяців тому
@@SridharKumarKannam Traceback (most recent call last):
File "/home/jelcke/dev/test/txt2graph/openai-graph.py", line 38, in
llm_transformer_filtered = LLMGraphTransformer(
File "/home/jelcke/dev/test/txt2graph/venv/lib/python3.10/site-packages/langchain_experimental/graph_transformers/llm.py", line 216, in __init__
structured_llm = llm.with_structured_output(schema)
File "/home/jelcke/dev/test/txt2graph/venv/lib/python3.10/site-packages/langchain_core/language_models/base.py", line 208, in with_structured_output
raise NotImplementedError()
NotImplementedError
@matheusaltomarecruz4598 7 місяців тому
Congratulations for the video. I'm facing some issues that looks like a package version problem. Could you provide the requirements for this experiment, please?
@SridharKumarKannam 7 місяців тому
Langchain library is being updated very frequently. Pls check the latest docs/APIs. Did you resolve the issue ? Whats the error pls...
@dipanjansaha6824 5 місяців тому
Instead of text, can we do it from 2 tables instead? Maybe two columns in two tables?
@SridharKumarKannam 5 місяців тому
You can if you have text in those columns. The source can be anything, as long as you format in the text format...
@retro-gameplay-new 4 місяці тому
Can it will possible using Gemini Model
@SridharKumarKannam 4 місяці тому
Yes, it should work with Gemini and most LLMs as long as their output format is as expected.
If you found this content helpful, please consider sharing it with others who might benefit. Your support is greatly appreciated :)
@usmanjalil6711 6 місяців тому
How can we process large documents. It takes so much time.
@usmanjalil6711 6 місяців тому
And how can we use aconvert_to_graph_documents function
@SridharKumarKannam 6 місяців тому
whats is document size? It is expected to take long time for large documents, for example, if its a pdf book, then all the pages needs to converted to text, a typical book can have 1000s of chunks and for each chunk embedding needs to be create and then stored in the index. Anyway, its mostly a one-off task, for check the end-to-end workflow with short text and then load the entire docs for usage.
@luisramos1977 8 місяців тому
after the code:
llm = ChatOpenAI(temperature=0, model_name="gpt-4-0125-preview")
I receive the error:
ValidationError: 1 validation error for ChatOpenAI
__root__
Did not find openai_api_key, please add an environment variable `OPENAI_API_KEY` which contains it, or pass `openai_api_key` as a named parameter. (type=value_error)
But, I do not find where you add such variables.
@codingx6654 7 місяців тому
you can create a .env file and paste your openapi key , OPENAI_API_KEY=your_api_key
@SridharKumarKannam 7 місяців тому
You can add key either directly in the code using this command os.environ["OPENAI_API_KEY"] = "YOUR_KEY"
@SridharKumarKannam 7 місяців тому
I've setup the key in my environmental variables in the bashrc file
@leed2002isa 9 місяців тому
hello, thank you for this video it was very helpful, how do i link the project to the neo4j desktop client ?
@SridharKumarKannam 9 місяців тому
This should do
----
import os
from langchain_community.graphs import Neo4jGraph
os.environ["NEO4J_URI"] = "bolt://localhost:7687"
os.environ["NEO4J_USERNAME"] = "neo4j"
os.environ["NEO4J_PASSWORD"] = "password"
graph = Neo4jGraph()
@datatalkswithchandranshu2028 6 місяців тому
Using same code..my llm_transformer.convert_to_graph part not working
..gives error that index of list should be integer and not str
Using hugging face llm
@SridharKumarKannam 6 місяців тому
The ouput format of the llms can be the issue. Pls test using openai with a small text, if its working, then the issue is HF llm op format, choose a different model.
@Shaan_Suri 5 місяців тому
Hi sir, thank you for your great video. Two questions for you (or for anyone else):
1) What exactly does langchain Document do? Why not just feed raw text straight into the LLM Graph Transformer and let it extract relationships/identify entities?
2) What are some resources I can use to learn specifically about how the LLM Graph Transformer works?
Thank you kindly.
@SridharKumarKannam 5 місяців тому ⁺¹
(1) That the format llm_transformer expects. Your text content is still the same. (2) llm_transformer as it is shown in the video is specific to langchain, refer to the documentation. There is a ton of info on converting text to KGs, including my channel has a number of videos. All the best...
@somjrgebn 7 місяців тому
Very impressive output on this content volume.
I do have a question, though. While this is great for building knowledge graphs, I'm curious how complicated these relationships can get in these knowledge graphs.
I have more experience with formal logic, and more sophisticated logical frameworks like temporal logic, fuzzy logic, etc., try to explain relationships with more accuracy.
It would be great if these more complicated relationships could be represented in these knowledge graphs. It would help so much in discovery and improving logical accuracy/soundness in written text.
@SridharKumarKannam 7 місяців тому ⁺¹
its better to define the schema explicitly for production usecases and some post processing of LLMs output for sanity checks. I've several videos on these concepts. I'll add more content on this topic...
@prriyamvradhaparthasarathi9804 4 місяці тому
Amazing video! It clearly explains the concept. I have one question: The nodes and relationships are created by the LLM, so every time the code is run, it generates a different output. How do we handle that? Additionally, you spoke about allowed nodes and relationships. In a real-time scenario, when we don't have much knowledge about the input file, how can we extract all the entities and relationships from the document so that we don't miss any information? Your suggestions on this would be very helpful. Thank you!
@SridharKumarKannam 4 місяці тому
1. first run without any schema - this will result in a lot of node and relationship types.
2. Analyse nodes+relations to find out which ones are important for your use case.
3. now run it with a fixed schema
If you found this content helpful, please consider sharing it with others who might benefit. Your support is greatly appreciated :)
@Shivam-bi5uo 7 місяців тому
great! apprecitate this!
@SridharKumarKannam 7 місяців тому
thank you very much for your support :)
@animation-1023 5 місяців тому
I am writing this query to get output "response=chain.invoke({"query":"what was the name of SPOUSE of Marie Curie?"})
response" but in output it is giving "'result': "I don't know the answer.
"}" although in neo4j it is showing relationship
@SridharKumarKannam 5 місяців тому
Did you run the query multiple times and got the same output? All LLMs are stochastic (random) in nature, some times strange results are expected.
@malikanaser8251 8 місяців тому
Hi, what happen if I run this code again for a second text? does it add it to the same kg database? if not, what should I do to add a new text to the same database?
@SridharKumarKannam 8 місяців тому
It will overwrite the DB. I'm sure Neo4J has capability to add new information, but I'm not sure if thats implemented yet in the langchain wrapper. If your database is not too large, you can re-create it, by adding the new information to raw data before extracting nodes/relationships...
@jeffhaskin895 8 місяців тому
What software is he using?
@hazardscarn 8 місяців тому ⁺¹
Neo4j
@SridharKumarKannam 7 місяців тому
Thanks for your support :)
@SridharKumarKannam 7 місяців тому
Yes it is.
@piotrbjastrzebski 2 місяці тому
Good stuff! Thank you!!! Has anybody tried with Ollama Open source models, consistently I am getting nodes, but no relationships (other than MENTIONS from document to an entity). llm_transformer = LLMGraphTransformer(llm=llm, node_properties=True, relationship_properties=True, strict_mode = False) and we define llm = ChatOllama(model="llama3.1", temperature=0, format="json") - I enev increase temperature to >0, but that does not help either ???
@SridharKumarKannam 2 місяці тому
there are function calling issues with ollama models. Try the solution suggested here, I've not tested it though..
github.com/langchain-ai/langchainjs/issues/6051
@jingluhan3289 6 місяців тому
Only me met the error at step 6? my pthon told me that there is an error at in LLMGraphTransformer.process_response(self, document)
593 nodes_set = set()
594 relationships = []
--> 595 parsed_json = self.json_repair.loads(raw_schema.content)
AttributeError: 'str' object has no attribute 'content'
@SridharKumarKannam 6 місяців тому
The output format from LLM models is important for these things.
stackoverflow.com/questions/78521181/llmgraphtransformer-convert-to-graph-documentsdocuments-attributeerror-str
@anshumangiramkar2589 9 місяців тому
Sir, can you please explain the basics too? I mean how to install Neo4j correctly on your PC, because I tried with setting up the system env for Neo4j in Mac OS Big Sur and I struggled a lot with connecting it to my Jupyter Notebook in Python=3.11... and I see no improvement on my side. If you could enlighten me with how to establish a similar environment to run the app on my Mac/Win system, I would be grateful to you... I am watching all of your vids on Knowledge Graphs and LLMs and I have learnt a lot.
@SridharKumarKannam 9 місяців тому
It is resolved? What error are you getting? After installing neo4j desktop, you need to install a couple of plugins also from neo4j desktop.
@anshumangiramkar2589 8 місяців тому
@@SridharKumarKannam The error is no bolt://localhost:7687 URL not found or running, I want to connect my Neo4J desktop to my Python Script/Notebook and try connecting both. So that, I could make use of knowledge graphs to visualize information from certain given text.
@mystic2212 6 місяців тому
Bro youre the best, you have saved my Thesis. Only question is how to retrieve
@SridharKumarKannam 6 місяців тому
thanks for your support :)
@weebprogrammer2979 7 місяців тому
You explained it well but the results are always inconsistent.
@SridharKumarKannam 7 місяців тому
with LLMs the results are not always the same, they are stochastic. Set the temperature to very low value. You can add some post-processing to the output of LLMs.
@weebprogrammer2979 7 місяців тому
@@SridharKumarKannam I did but still not consistent results 😢
@rabbit1259 7 місяців тому
Can you provide for the code
@SridharKumarKannam 7 місяців тому
the link is in the description...
@RishuKumarMishra-pt9pf 8 місяців тому
What api key you are using and From where do you get it?
@SridharKumarKannam 8 місяців тому
many frameworks use openai by default if we don't specify a model explicitly. The key in my config files.

Наступне

Автоматичне відтворення