Azure Synapse Analytics - Introduction to Azure Purview

Поділитися
Вставка
  • Опубліковано 19 сер 2024

КОМЕНТАРІ • 39

  • @CameronNeale
    @CameronNeale 3 роки тому +1

    Love the videos! The way you present all of the details is very engaging. What a change from the dull videos we are used to watching from other channels!

  • @carltonpatterson5539
    @carltonpatterson5539 3 роки тому

    I’m usually very lazy at making comments, but I feel I should say a big thanks for this excellent demonstration on Purview. Cheers Simon

  • @benjaminbong9262
    @benjaminbong9262 2 роки тому +1

    Great video! Very engaging and informative. Also clear and easy to understand. Thank you!

  • @DarkMisley
    @DarkMisley 3 роки тому +3

    Cheers Simon, hopefully they'll put more effort into this service than Data Catalog v1 (which never seemed to move beyond an MVP state)

    • @AdvancingAnalytics
      @AdvancingAnalytics  3 роки тому +1

      Yeah... The Data Catalog that couldn't understand non-relational sources... Given Purview is Atlas-based that's already a good start. I imagine we'll see quite a few rough edges as it matures!

  • @karnasaurav
    @karnasaurav 3 роки тому

    Thanks, man. Great stuff. The explanation is very informative. Appreciate it.

  • @VINITSANSARE
    @VINITSANSARE 3 роки тому

    Beautiful demonstration of Azure Purview 👍

  • @radekou
    @radekou 2 роки тому +1

    Where things stand right now - would you use Purview for your company's metadata management and governance or would you rather build something of your own (using API's, custom pipelines, dataverse, Power BI, ...)? I'd ask the same of the Databricks Unity Catalog.

  • @28nov82
    @28nov82 3 роки тому

    Very nice overview!

  • @NeumsFor9
    @NeumsFor9 3 роки тому +3

    Dataverse, Purview, Synapse.....I am getting marketing rename fatigue, but at least metadata is getting more love.
    Of course, there is always Kimball's "Meta Meta Data Data" article to fall back on. I swear, sometimes I feel as though companies read his older articles...apply 55% of stated functionality.....wait for MVPs to fill in 20% more.....then, the final 25% comes from community pain and feedback. Ha. Rinse. Repeat.

  • @jdr9861
    @jdr9861 2 роки тому

    Useful content -- thank you. Moving background is distracting and drives me crazy though.

  • @firstch7801
    @firstch7801 3 роки тому

    Could you please make content for purview about custom classification , I really need it :)

  • @PicaPauDiablo1
    @PicaPauDiablo1 3 роки тому

    Thank you

  • @jinlinxu5109
    @jinlinxu5109 3 роки тому

    Great video, Simon! Did you ever get the chance to look at the performance impact to the data provider?

  • @NeumsFor9
    @NeumsFor9 3 роки тому

    Simon,
    Thanks for showing this out of the box. It makes me miss consulting. As I search for a new position, these videos do a great job of helping people stay abreast. Does it feel as though this is a good step in the "DataOps" direction as part of the puzzle?

    • @AdvancingAnalytics
      @AdvancingAnalytics  3 роки тому

      It does... but it's not perfect yet. There's been a big focus on control & security here (the name Purview alone speaks volumes), whereas for me, enabling DataOps is all about ease of data discovery and connectivity. It has features that align to this vision, and I think we'll get there fairly quickly, but the initial push is certainly more on the "govern & control" than "enable & encourage" path of data gov

    • @NeumsFor9
      @NeumsFor9 3 роки тому

      @@AdvancingAnalytics Maybe they can mix in the IDEAR tool from the team data science process.

  • @renatofreitas3877
    @renatofreitas3877 3 роки тому

    Great introduction Simon, good to see Microsoft investing more in this field. With Atlas API's we might be able to use that as a schema registry for the data platform, to control the data that is ingested, process and the schema evolution. My thinking is correct?

    • @AdvancingAnalytics
      @AdvancingAnalytics  3 роки тому

      Aha, I have a master plan along those lines, although I believe some of the auto-scanning functionality is disabled if you override entities manually... But there's certainly potential to use it as an engineering metastore!

    • @renatofreitas3877
      @renatofreitas3877 3 роки тому

      Good to hear that. I'II play a litte bit with Purview. Thanks again Simon.

    • @iphadkegmail
      @iphadkegmail 3 роки тому +1

      Its a little weird. They just added schema registry feature directly into Event Hubs. Now, if they integrated that into Purview, they will be the absolutely killer use case!

  • @singhrakeshr
    @singhrakeshr 2 роки тому

    Could you share the tool you use for diagrams?

  • @pankajsingh23UTube
    @pankajsingh23UTube 3 роки тому

    I scanned Azure Synapse Workspace but it does not show lineage information. Am I missing any permission?

  • @saurabhkp89
    @saurabhkp89 3 роки тому +1

    I have been using Babylon data catalog for sometime but it's never stable , Hopefully with purview things will stable enough to do something.

    • @AdvancingAnalytics
      @AdvancingAnalytics  3 роки тому

      Yeah, but that's the nature of internal/private preview - testing features not SLAs! Not to be cynical, but I imagine proper stability will come a few months after GA...

    • @gauravkumar796
      @gauravkumar796 3 роки тому

      I have similar experience with Babylon data catalog

  • @rajeevsharma2664
    @rajeevsharma2664 3 роки тому

    Nice video. Does it mean Azure Data Catalogue is out? Second, in Azure SQL DB we've seen data classification/confidentiality setting? Will that be deprecated?

    • @AdvancingAnalytics
      @AdvancingAnalytics  3 роки тому

      Yep, what was Data Catalog V2 is all wrapped into Purview. It's "out" as public preview, isn't fully out yet.
      It uses a lot of the same "scanning" technologies that you see with the auto-classification features in SQL, Cosmos etc, no idea about depreciation & merging of features in the future though!

  • @pini22ki
    @pini22ki 3 роки тому

    Thanks for the review.
    Is it all scans or workspace per subscription?
    Do we have an option to do it all over my account?

    • @AdvancingAnalytics
      @AdvancingAnalytics  3 роки тому

      Hi! You currently set up scans on specific objects that you have connections to - this could be within the same subscription, other accounts you have access to, or even via service principals connecting to external sources. I don't believe there's a "scan my whole subscription" function built it, and I honestly don't know if they're going that direction, or if it's always going to be a manual inclusion process.
      Simon

    • @MarkVersteegh
      @MarkVersteegh 3 роки тому

      @@AdvancingAnalytics Microsoft shows the option to register multiple sources at the same time, see ua-cam.com/video/27bA4KFiEKk/v-deo.html
      However, I have not been able to find the option myself (and I tried all the regions where purview is available), so I guess it's still private preview.

  • @iphadkegmail
    @iphadkegmail 3 роки тому

    Thank you Simon.. One thing not very clear is, how are they scanning for data lineage? Scanning tables alone won't do that. Did not see a place to scan ADF flows or SSIS flows. Thoughts?

    • @AdvancingAnalytics
      @AdvancingAnalytics  3 роки тому +2

      I'm not sure if it's changed since I looked at the preview - There's a fair bit about the ADF & Data Share setup in the docs, but when I had a quick look I couldn't see it - might not be in the public preview just yet!
      Expect it'll follow what's described here: docs.microsoft.com/en-gb/azure/purview/catalog-lineage-user-guide

    • @iphadkegmail
      @iphadkegmail 3 роки тому

      @@AdvancingAnalytics ty!

    • @NeumsFor9
      @NeumsFor9 3 роки тому

      You should check out the blog of Mr. Paul Andrew, who put together a nice metadata framework for Azure Data Factory in a similar way that sqlmetadata.codeplex.com did about 10 years ago.

    • @iphadkegmail
      @iphadkegmail 3 роки тому

      @@NeumsFor9 Can you share the link? Ty!

    • @somediver9925
      @somediver9925 3 роки тому

      You can also do API calls against ADF and I would suggest you store the data as key value pairs since there are a HUGE number of hierarchies. You can also do API calls within ADF just be aware of paging the result sets.