Unity Catalog setup for Azure Databricks

Поділитися
Вставка
  • Опубліковано 19 лип 2024
  • In this video I walk through setting up Unity Catalog on Azure and quickly exploring the cataloging features for a couple tables with a workflow. This includes setting up storage and access connector, then a quick walk through of lineage and other metadata tracked at the table level.
    * All thoughts and opinions are my own *
    Learn more about Unity Catalog with one of the below videos:
    Short intro from Databricks - • Databricks Unity Catal...
    Data Lineage - • Automated Data Lineage...
    Deeper dive introduction - • Introduction to Unity ...
    More from Dustin:
    Website: dustinvannoy.com
    LinkedIn: / dustinvannoy
    Github: github.com/datakickstart
    CHAPTERS
    00:00 Intro
    01:35 Setup Metastore
    4:58 Assign workspace + explore
    9:20 Outro
  • Наука та технологія

КОМЕНТАРІ • 11

  • @kacho2580
    @kacho2580 11 місяців тому

    Thanks for the video Dustin. I just wanted to ask, the data and metadata that already exist on hive_metastore will be affected after I create the Unity Catalog?

    • @DustinVannoy
      @DustinVannoy  9 місяців тому

      That data will still be available under a catalog named hive_metastore. So you would now have a 3 part name available: {catalog}.{schema}.{table}

  • @abisheksubramanian8069
    @abisheksubramanian8069 Рік тому

    Awesome content Dustin

  • @thedatamaster
    @thedatamaster Рік тому

    Thanks for this video Dustin.
    When I click on manage account in Azure Databricks workspace. I am not able to get account console. Could you please help me on that?
    Thanks in advance.

    • @thekydang5720
      @thekydang5720 Рік тому +2

      You need Azure AD - Global Admin Roles to access Manage Account

  • @maira9648
    @maira9648 Рік тому

    thanks for this video! just wondering if unity catalog allows us to create functions to flag invalid data? I was thinking of have all business rules validation in the unity catalog rather than spread across its own individual ETL solution.

    • @DustinVannoy
      @DustinVannoy  9 місяців тому +1

      For doing check when processing the data, I like to create Python libraries stored along with notebooks in Databricks Repos so you can reuse logic. You can create UDFs that are stored in Unity Catalog. learn.microsoft.com/en-us/azure/databricks/udf/unity-catalog

  • @TheDataArchitect
    @TheDataArchitect Місяць тому

    Can delta sharing works with hive_metastore?

  • @nielshoogeveen3767
    @nielshoogeveen3767 Рік тому +1

    I have an admin account. However I do not see "Create catalog". What could be wrong?

    • @DustinVannoy
      @DustinVannoy  Рік тому

      Has a Unity Catalog metastore already been created and assigned?

  • @TJ-hs1qm
    @TJ-hs1qm 8 місяців тому

    Only 2min in and he already lost me. 1:53 can't see the referenced screen 😆?!
    For future videos: it would be greatly appreciated if the necessary prerequisites could be at least listed in the description box.
    this -> ua-cam.com/video/M7C-MyVHyrU/v-deo.html