William Lyon
William Lyon
  • 6
  • 55 225
Getting Started With The Apache Sedona Docker Image
Apache Sedona is an open-source framework for working with large scale geospatial data. It adds spatial functionality to distributed data processing frameworks like Apache Spark and Apache Flink to enable developers and data scientists to work with spatial data at scale. Apache Sedona exposes native types for representing complex geometries like points, lines, polygons and implements geospatial indexing and partitioning for fast lookups and efficient distributed processing of spatial data at scale. Geospatial querying functionality is available with Spatial SQL by implementing the SQL-MM3 and OGC SQL standards. We can work with Apache Sedona via Python, R, Spatial SQL, and other tooling - such as a Jupyter Notebook environments and via seamless integration with the PyData ecosystem.
There are many ways to leverage Apache Sedona whether incorporating into an existing data pipeline or building a new greenfield analytics application. For example Apache Sedona can be deployed into a Databricks cluster, run in AWS EMR, it works with Snowflake, or can be run on your own infrastructure. In this video we'll be using the Apache Sedona Docker image to get a cluster running locally and perform some basic geoprocessing tasks.
Resources:
* Apache Sedona: sedona.apache.org
* Apache Sedona docker image: hub.docker.com/r/apache/sedona
* Follow Will on Twitter: lyonwj
* Connect with Will on LinkedIn: linkedin.com/in/lyonwj
* Will's blog / newsletter: lyonwj.com/
* Wherobots: www.wherobots.ai/
Переглядів: 1 458

Відео

Building a Real-time Recommendation Engine With Neo4j - Part 1/4 - William Lyon - OSCON 2017
Переглядів 22 тис.7 років тому
All data and code available here: neo4j.com/sandbox-v2/oscon github.com/johnymontana/oscon-neo4j-recommendations/blob/master/graph-based-rec-sys.ipynb Description William Lyon demonstrates how to build a recommendation engine using Neo4j and Python. The solution will be a hybrid that makes use of both content-based and collaborative filtering to come up with multilayered recommendations. Willia...
Building a Real-time Recommendation Engine With Neo4j - Part 4/4 - William Lyon - OSCON 2017
Переглядів 3 тис.7 років тому
All data and code available here: neo4j.com/sandbox-v2/oscon github.com/johnymontana/oscon-neo4j-recommendations/blob/master/graph-based-rec-sys.ipynb Description William Lyon demonstrates how to build a recommendation engine using Neo4j and Python. The solution will be a hybrid that makes use of both content-based and collaborative filtering to come up with multilayered recommendations. Willia...
Building a Real-time Recommendation Engine With Neo4j - Part 2/4 - William Lyon - OSCON 2017
Переглядів 6 тис.7 років тому
All data and code available here: neo4j.com/sandbox-v2/oscon github.com/johnymontana/oscon-neo4j-recommendations/blob/master/graph-based-rec-sys.ipynb Description William Lyon demonstrates how to build a recommendation engine using Neo4j and Python. The solution will be a hybrid that makes use of both content-based and collaborative filtering to come up with multilayered recommendations. Willia...
Building a Real-time Recommendation Engine With Neo4j - Part 3/4 - William Lyon - OSCON 2017
Переглядів 3,7 тис.7 років тому
All data and code available here: neo4j.com/sandbox-v2/oscon github.com/johnymontana/oscon-neo4j-recommendations/blob/master/graph-based-rec-sys.ipynb Description William Lyon demonstrates how to build a recommendation engine using Neo4j and Python. The solution will be a hybrid that makes use of both content-based and collaborative filtering to come up with multilayered recommendations. Willia...
Loading JSON with Neo4j
Переглядів 19 тис.7 років тому
In this quick tutorial we'll learn how to import JSON files into Neo4j, allowing us to model them as a graph and query across documents.

КОМЕНТАРІ

  • @adilmajeed8439
    @adilmajeed8439 4 місяці тому

    Thanks for the video. How to integrate Apache Sedona with Microsoft Fabric. There is no documentation available where we can follow it properly, till so far we are in the dark and not able to use this fantastic software.

    • @lyonwj
      @lyonwj 4 місяці тому

      Hi - this GitHub issue has some details on running Apache Sedona in MS Fabric: github.com/apache/sedona/issues/1198

  • @Stream_Squad
    @Stream_Squad 7 місяців тому

    Hi Will, nice intro! However, when I replicate the same code, at line 9, I get the following error: AnalysisException: [TABLE_OR_VIEW_NOT_FOUND] The table or view `basins` cannot be found. Verify the spelling and correctness of the schema and catalog. If you did not qualify the name with a schema, verify the current_schema() output, or qualify the name with the correct schema and catalog. To tolerate the error on drop use DROP VIEW IF EXISTS or DROP TABLE IF EXISTS.; line 1 pos 49; 'Project ['ST_Centroid('geometry) AS centroid#226, *] +- 'UnresolvedRelation [basins], [], false

    • @lyonwj
      @lyonwj 7 місяців тому

      Did you run hucs_df.createOrReplaceTempView("basins") ? That line will create the view. Also, feel free to create an issue at community.wherobots.com where it might be easier to troubleshoot

    • @Stream_Squad
      @Stream_Squad 7 місяців тому

      hucs_df.createGlobalTempView("basins") hucs_df.printSchema() This one is working fine!

  • @Gett37
    @Gett37 9 місяців тому

    Including instruction on how to install and configure apoc would be nice. And still it doesn't work, because it treats windows filepath as relative for no reason at all. Very unpleasant experience.

    • @Gett37
      @Gett37 9 місяців тому

      I've moved json to neo4j inner dir and now I'm getting `java.lang.OutOfMemoryError`. These simple tasks work flawlessly in other dbms's.

    • @Gett37
      @Gett37 9 місяців тому

      setting `server.memory.heap.max_size=10000m` helped... 💢💢

  • @supreethmenon1259
    @supreethmenon1259 Рік тому

    Link for the JSON data ?

  • @sercanyildirimtugcann
    @sercanyildirimtugcann 2 роки тому

    Hi William, I am too confused about Neo4j and wanted to ask you directly. Any help would be appreciated. What I want to create is search engine for documents ok. I am also interested in grap machine learning and recommendation engines but also want my platform seem a bit interactive and stylish so I will use react js. The question is that which way is much more compatible with Neo4j? Node. js+react or Django+react? What do you suggest me as beginner on graph databases? I want to fully dive into this topics. Thanks in advance!!!

  • @masterjp227
    @masterjp227 2 роки тому

    Hey man, the slides that you use for Neo4j are different from the ones shown in the github, what happened?

  • @luisramos1977
    @luisramos1977 2 роки тому

    in minute 21 otherGroups is a variable that is bound to the groups which have the keyword topics. That can be use to find not declared relations of the key topics with other groups.

  • @mihirhardas8297
    @mihirhardas8297 2 роки тому

    Very clear and brief!!!

  • @kebman
    @kebman 2 роки тому

    Need to approve imports, but apoc.conf isn't in the conf folder :p

  • @diarmuidbrady8987
    @diarmuidbrady8987 2 роки тому

    Where can I get the data for this if it is not on the sandbox

    • @rickyS-D76
      @rickyS-D76 Рік тому

      Here in his github link: github.com/johnymontana/oscon-neo4j-recommendations/blob/master/graph-based-rec-sys.ipynb

  • @diarmuidbrady8987
    @diarmuidbrady8987 2 роки тому

    Where can I find the data for this??

  • @krishnachaitu17
    @krishnachaitu17 3 роки тому

    will ,great session thank you for it

  • @arnavsshah
    @arnavsshah 3 роки тому

    this is too good!!!!

  • @Skandawin78
    @Skandawin78 3 роки тому

    But I'm not sure how the applications can use this data from neo4j query ? Can BI or visualization groups connect to neo4j and create reports?

  • @Skandawin78
    @Skandawin78 3 роки тому

    Very good presentation

  • @supa9029
    @supa9029 3 роки тому

    Thank you.

  • @stewiefre
    @stewiefre 4 роки тому

    I have a question: When i try to invoque call apoc.load.json(path) it says that i have illegal characters. I have my json file at this path: C:\Users\Joao\Desktop\ProjetoFinal_8150133\DadosExtraidosGOV ew.json What should be my full path? This is the error: Failed to invoke procedure `apoc.load.json`: Caused by: java.lang.RuntimeException: Can't read url or key file:/C:/Users/Joao/.Neo4jDesktop/neo4jDatabases/database-ae613b5d-df8e-4180-af6f-d99159d18e86/installation-4.0.3/import/Users/Joao/Desktop/ProjetoFinal_8150133/DadosExtraidosGOV/new.json as json: C:\Users\Joao\.Neo4jDesktop eo4jDatabases\database-ae613b5d-df8e-4180-af6f-d99159d18e86\installation-4.0.3\import\Users\Joao\Desktop\ProjetoFinal_8150133\DadosExtraidosGOV ew.json (system could not find the specified path)

  • @LokoKolima
    @LokoKolima 4 роки тому

    Someone has an idea why my importation query is taking so long? (33504419ms -> 9.3068 Hrs ). Similarly, none Node appears on Node Labels. Help me :( BTW, I'm using the same JSON (YELP)

  • @abhishekanand1939
    @abhishekanand1939 4 роки тому

    can you explain how to export file from the neo4j

  • @PriyaAggarwal24
    @PriyaAggarwal24 5 років тому

    Hello Will, Great session!! :) I just have one small question if you help answer that please. You mentioned that there could be multiple tags associated with a node. Could you please share an example where having multiple labels for a node would be useful?

    • @dhananjayghanwat
      @dhananjayghanwat 4 роки тому

      Let's say you have a Service Node which represents every service (train) in a plan. Now in a typical plan for 6 months, you might have hundreds of thousands of services which run on different days. Now instead of having just one label "Service", you can also label them as SundayService, MondayService and so on to represent Services which run on a given weekday. By doing that you will have the total number of SundayService nodes far less than Sevices nodes. So now you can use label SundayService in your cypher queries to work on smaller data set rather than huge dataset of Service nodes, which will make you query much faster. Hope it helps

  • @VinayNarayanaK
    @VinayNarayanaK 5 років тому

    Hey William, The cypher Query at 27:40 Considering the direction of relationship Is it *Match (t)<-[:HAS_TAG]-(other:Book)* or *Match (other:Book)<-[:HAS_TAG]-(t)* ?

  • @vanniubi
    @vanniubi 6 років тому

    Great tutorial. How do you deal with nested (multi-layer) JSON?

    • @simpleman1546
      @simpleman1546 4 роки тому

      It doesn't work with nested JSON (wait). The properties can only support one level information however if you have nested JSON and you want to cherry-pick something out the nested part and bring it to one-level. goessner.net/articles/JsonPath/ This is what they give you traverse the nested JSON.

  • @studentstudent7209
    @studentstudent7209 6 років тому

    Thanks, Will for this great tutorials. :)

  • @Anonymous-lw1zy
    @Anonymous-lw1zy 6 років тому

    Neo4J desktop throws the error: Neo.ClientError.Procedure.ProcedureCallFailed: Failed to invoke procedure `apoc.load.json`: Caused by: java.lang.RuntimeException: Import from files not enabled, please set apoc.import.file.enabled=true in your neo4j.conf but when I add that to /etc/neo4j/neo4j.conf it does not fix the error. So I am dead in the water.

    • @studentstudent7209
      @studentstudent7209 6 років тому

      I was getting the same error. It's gone after Neo4j restart. Restart Neo4j after any change. If you have done it already, Please share the new error.

    • @fabianodesouzadasilva9322
      @fabianodesouzadasilva9322 Рік тому

      Step:1 To traverse to the Dbms folder, click the three dots (...), go to "open folder" -> DBMS, the dbms folder will open, then navigate to "conf" folder inside it & you could find the "neo4j.conf" file. Now you need to create a new "apoc.conf" file. Step2: In the dbms folder -> open “conf” folder -> create a new “apoc.conf” file by opening a notepad, then type the following 2 lines and save files as “.conf” file under “all files” in the same “conf” folder. apoc.import.file.enabled=true apoc.import.file.use_neo4j_config=true Step: 3 Restart the dbms

  • @anicethounkpe582
    @anicethounkpe582 6 років тому

    Hey William Lyon! Thanks for your presentation. The link for your slides is not available anymore.

    • @lyonwj
      @lyonwj 5 років тому

      Sorry about that - looks like the dropbox link died :-( I've added the slides to the Github repo here: github.com/johnymontana/oscon-neo4j-recommendations

  • @nawazuddin9683
    @nawazuddin9683 6 років тому

    how can i get this slides

  • @vjaybio7
    @vjaybio7 6 років тому

    Can you please share the link to download the json data files for yelp... the link in the video doesn't open

    • @jokerddlv
      @jokerddlv 6 років тому

      apoc.import.file.enabled=true Put this in your config file and make sure you get the right file path

  • @eolhovsky2003
    @eolhovsky2003 6 років тому

    where do you get apoc.load.jason procedure from?

    • @sholavandan81
      @sholavandan81 6 років тому

      I too have the same question...

    • @lyonwj
      @lyonwj 6 років тому

      You can download the APOC library here: github.com/neo4j-contrib/neo4j-apoc-procedures/releases or using Neo4j Desktop there is a one-click install button in the Plugins tab.

    • @simpleman1546
      @simpleman1546 4 роки тому

      @@lyonwj Also, in Linux tar installation. Place the downloaded jar apoc file at this place /var/lib/neo4j/plugins/ make sure to change your neo4j.conf (read the docs) and restart the database

  • @dziadzia992
    @dziadzia992 6 років тому

    William, do you have rsvp's ready :)?