- 19
- 34 199
Brent Wesler
United States
Приєднався 18 лип 2023
This is a duplicate site
All content has been moved to youtube.com/@brent_wesler
All content has been moved to youtube.com/@brent_wesler
Parsing through the JSON Payload Returned by Microsoft Azure Prebuilt Document Intelligence
In this short video see how the Info Input solution can parse through the JSON payload that is returned from Microsoft Azure AI Document Intelligence.
By default Azure does not return sub nodes of an address field (remittance, customer, vendor) such as house number, street, postal code, etc. These sub node fields are not natively returned by Azure, so having the ability within Info Input (using standard JavaScript) to extract just the fields we need to perform validation is why using Info Input with Azure is such a powerful solution.
Info Input provides a great UI to allow junior or non-programmers to interrogate and transform data returned by Azure.
By default Azure does not return sub nodes of an address field (remittance, customer, vendor) such as house number, street, postal code, etc. These sub node fields are not natively returned by Azure, so having the ability within Info Input (using standard JavaScript) to extract just the fields we need to perform validation is why using Info Input with Azure is such a powerful solution.
Info Input provides a great UI to allow junior or non-programmers to interrogate and transform data returned by Azure.
Переглядів: 1 298
Відео
Microsoft Azure Document Intelligence Neural vs. Template Models
Переглядів 1,5 тис.Рік тому
In this session we will take a 12 document sample set, all with varying document types and formats, some handwritten correspondence, standard text-based forms, mortgage, insurance claims, medical claims and the like. We train 5 samples of the same document and then create a template and neural model in Azure Document Intelligence studio. We then test the same document set with both models. Ther...
Microsoft Azure AI Document Intelligence New Analyze Options
Переглядів 2,3 тис.Рік тому
Microsoft Azure AI Document Intelligence What's New in 3.1 specifically the new analyze options for font, high definition and formula data extraction. Additionally, we will explore neural vs template model building, how long it takes to build, limitations and language support.
Infuse Headless Smart Connected Scanning of Patient Onboarding using UiPath Unattended Bots
Переглядів 77Рік тому
In this short video, see how Kodak Alaris headless, computer less scanning solutions hands off a healthcare questionnaire, drivers license and insurance card to Info Input for automated classification, validation logic to ensure all three document types are present, and then extraction of patient data from each document. The data is then sent to UiPath unattended bot as part of UiPath Automatio...
Kodak Infuse Smart Connected Scanning with Base64.ai Unstructured Engine
Переглядів 235Рік тому
Infuse Smart Connected Scanning with cloud based Base64.ai unstructured document processing solution with backend data integration with hundreds of ERP, CRM and line of business systems . Its a really unique Kodak Alaris product which allows the scanner to "plug-in" to any cloud engine that supports REST. So my video below is showing scanning various document types with Base64.ai and showing th...
Microsoft Azure AI Document Intelligence Prebuilt AP invoice Parser with Advanced Mapping
Переглядів 1,7 тис.Рік тому
In this short video, see how to handle odd AP invoices where common label names for instance PO Number is not found and how to use the json output from Azure to string string for a key value pair like "Customer Ref" to populate the PO Number field in Info Input.
Microsoft Azure AI Document Intelligence-What's New in v3.1
Переглядів 1,4 тис.Рік тому
Learn what's new in Microsoft Azure Document Intelligence framework version 3.1, how to auto label tables and testing with new models.
How to call REST web service from Info Input through custom script
Переглядів 71Рік тому
Learn how to take manual user data entry of a zip code and pass the zip code to a REST web service and return the USA State that the zip code belongs to.
Basic Zone Capture with Advanced Indexing Forms and Auto Classification
Переглядів 40Рік тому
See how Info Input can use computer vision technology to automatically separate, extract and display custom end user data entry UI screens using internal classification engine
RegEx OCR Mapping Consent Form - Info Input Intelligent Document Processing
Переглядів 70Рік тому
Learn how to leverage ICR and OCR data extraction using RegEx mapping to look for label key value pairs to extract information from structured documents without the need for zones.
Kodak Alaris Infuse Smart Connected Scanning Solution
Переглядів 79Рік тому
Learn about 6 different use cases of leveraging the Kodak Alaris Infuse, headless, computerless scanning with bi-directional, direct channel responses back to the scanner touch panel. See how Infuse works with partners such as UiPath, Papercut, ABBYY Vantage, ID.now as well as direct integration with Info Input which supports Microsoft Azure Document AI Intelligence, AWS Textract, Google Docume...
How to build custom Document Intelligence models in Microsoft Azure
Переглядів 14 тис.Рік тому
Learn the skills on how to build custom document intelligent processing using Azure cognitive services, applied ai and Document Intelligence framework
Microsoft Azure Document Intelligence Custom Classification Models
Переглядів 9 тис.Рік тому
Learn how to build custom document separation and classification within Microsoft Azure Document Intelligence studio
Info Input Accounts Payable Invoice Automation
Переглядів 110Рік тому
Info Input Accounts Payable Invoice Automation
Info Input Intelligent Classification using Google Document AI
Переглядів 149Рік тому
Learn how to use the new Google Document AI procurement and lending splitter to separate and classify documents without the need to add training images.
Custom Cloud Document AI Comparisons: ABBYY Vantage, Google and Microsoft Azure
Переглядів 1,4 тис.Рік тому
Custom Cloud Document AI Comparisons: ABBYY Vantage, Google and Microsoft Azure
Info Input Transactional Module: Embed Document Capture in your Line of Business System
Переглядів 91Рік тому
Info Input Transactional Module: Embed Document Capture in your Line of Business System
Kodak Alaris Infuse Smart Connected Scanning with Base64.ai
Переглядів 121Рік тому
Kodak Alaris Infuse Smart Connected Scanning with Base64.ai
Kodak Alaris Google document AI Prebuilt Document Processing
Переглядів 484Рік тому
Kodak Alaris Google document AI Prebuilt Document Processing
I label my documents using a table field in which I specify several selectionMark columns. The model training for template models works for some columns to be selectionMark, but for others I have to keep them formatted as String, otherwise the training fails with a super generic message. Do you know by chance how to dig deeper into those error messages and whats behind the model failing with selectionMark columns? Thx
I have never attempted selection mark within columns. You can click the three vertical dots next to the field and define the sub data type for the field. Tables are tricky and very strict. I would need to see the sample documents and what youre trying to achieve to better direct you
@BrentWesler it's a very narrow table layout. table basically stretches over the whole length and width of the page. 20 or so columns, the rightmost 10 are columns meant to be selectionMark columns. maybe the squeezed layout is a reason the model keeps failing. I had a custom template model trained and in production so far on that same document layout - only without the selection marks. but now the customer wants all the information, also selection mark columns. I tried a minimal working solution with just a table field and one selectionMark column. Trained successfully. Then I added another one right next to it. Trained successfully. But then I completed some unlabeled fields in the second column with that "draw field" method and whoops. training failed again. Document intelligence gives great resukts when it works. When training fails you hardly know why
Hi Brent, that was helpful, thank you. What is that application you are using to add the advanced match type?
Kodak Info Input
you can buy it from us at piftech.com, its not an off the shelf product you need to purchase through a dealer like piftech.com
If azure ai maps a field incorrectly. Is there a feedback mechanism to train it back programmatically or do we need to upload the document to intelligence studio and train back?
No there is no human in the loop but thats why we use a IDP product like Kodak Info Input which provides a UI to interrogate the data
What's the use case of it. How to leverage it in real-time when you have mix of documents type, how it can detect from which Model it will run the OCR. I want to run beyond classification. Example I put 10 different types of document, firt it classifies and give me output from my trained model accordingly
Classification and separation happens mutually exclusive from extraction. To perform both you need to call both a prebuilt or custom extraction model plus a classification model.
Hi I have 10 document type. Each type has 5 samples. How I manage to train them. Do I need to train all them in single model or I can have different set of models for each type. How a runtime identify in which model I has to put Run Analysis, when actual document submitted?
at 8:14, there is a jump to localhost:8000/client-html it isn't clear if this is a proprietary site, or something that is bundled with azure document intelligence.
We have some Formulae in the document with a heading. Is it possible to extract just the formulae as a image using Azure Document AI?
yes, azure has a new Add-on/premuim option called formulas which will extract formulas as extracted. This is available in the general Documents prebuilt parser in azure studio and using the "Formulas"
Q1.How can i pull the results of custom model into a excel sheet or csv format. I can only see JSON format. Q2.Does it support .docx and .xlsx format? Thanks.
I had to use a programming language to do it but it does provide a sample code to go off by.
You can then take the document and export it to csv
@@KEVINSURIEL Hi Kevin, I am working on a similar problem to export the extracted data to csv format. Do you mind sharing the steps you performed to export it onto csv?
@@Shivank-op6yy do you have python coding skills by any chance if not download chatgpt and it ask you to make a python script that can pull data from document ai intelligence and export it to csv
@@Shivank-op6yy do you have any experience in python if not you can use chatgpt to help you create a python script.
How can we utilize in our code.
I saw your video on parsing the JSON fields and thought that it was very interesting. However I have a question for you. Where would I get the utility Info Input from? I googled and cannot seem to find the utility. Any help would be appreciated.
Kodak Alaris Info Input at alarisworld.com or you can purchase through me at piftech.com
Hi Brent
first of all that thank you for your content , andi choose a blob container that contain my data and a json file with this format { "location": "005.txt", "language": "en-us", "class": { "category": "Computer_science" } }, and i created a document intelligence custom classification model project with the same blob container but the data didn't appear , i can understand why , i used the documentation of microsoft learn with the same databse , can you help me with this
you need to use the same blob storage account and same container name. Happy to consult with you. bwesler@piftech.com
Hi Brent, I'm trying to get serial numbers from apple invoices but because they're structured differently from invoice to invoice, the document intelligence doesn't seem to be recognizing each serial number accordingly. Do you have any experience with this or help you could provide?
yes I can help. you need to use the Key-Value pair checkbox on the invoice prebuilt model and then find the serial number using its label or key. Happy to consult with you. bwesler@piftech.com
May I ask what application that is you are running on localhost? I would like to embed such UI (pdf viewer and labeling properties) in my application as well.
ImageTrust by Image Access or rebranded by Kodak Alaris Info Input. You can buy through piftech.com
Hi Brent, thank you for this preview! Do you know if there is a way to also process emails in *.msg format?
You would have to use Power Automate and AI Builder which can monitor an email and call the data model of choice. This way you are not consuming Azure AI Document Intelligence models but rather AI Builde rusing AI credits through your M365 tenant. Its more expensive to use AI builder than the Azure version. You can use a 3rd party IDP software like Kodak Alaris Info Input to read eml or msg emails and rasterize them to PDF
What's the best way to go about extracting table data that does run across pages or extracting table data that for some reason gets split into many different tables (essentially the same table but headers repeated for each instance of X).
We would have to look at th document and use case. The Azure Invoice prebuilt parser is the only model that will paginate through pages of following line items. We then get the JSON payload back and can use Javascript to find column labels that have repeating controls using Kodak Alaris Info Input
template model is disabled in my build mode section, what could be the possible reason?
Hi, this is a great feature. May I know if I can have Azure DI to analyze the document's type as well as do the extraction in one go?
Yeah, that's my question too. Can it tell which training model that file belongs to? So that I do not need to specifically say which training model it should be used to extract the content of the document.
@@rolandyaulingsiang7911 Exactly I am trying to find out answer. Do you have figured it out?
😞 "Promo sm"
👏🏻👏🏻👏🏻👏🏻🙏🏻
How would you analyse multiple tables across a multi page pdf file, do you create a new table for every page?
Hi Keith, I am stuck with the same problem. When I create my custom Table with the columns which I need. The Table does not sort the values. I am not sure how will it work.