“Automation 2.0 coming…No more boring data entry job”

Поділитися
Вставка
  • Опубліковано 1 чер 2024
  • The real AI Automation is coming - Let GPT reads invoices and enter data into Xero - The step by step guide from extracting structured data from docs, to send data to Xero, HubSpot and more;
    🤘 Get 1 month Pro plan on make.com free: www.make.com/en/register?pc=j...
    🔗 Links
    - Follow me on twitter: / jasonzhou1993
    - Join my AI email list: www.ai-jason.com/
    - My discord: / discord
    - Github link: github.com/JayZeeDesign/gpt-d...
    - Zoum’s video for extract data from PDF: • How to Extract Text fr...
    - No code alternative: relevanceai.com/
    ⏱️ Timestamps
    0:00 Intro
    1:35 Quick demo
    2:05 Step1: PDF to Text
    6:05 Step2: LLM extract structured data
    7:55 Step3: Streamlit GUI
    10:48 Step4: Xero integration
    16:00 No code alternative
    👋🏻 About Me
    My name is Jason Zhou, a product designer who shares interesting AI experiments & products. Email me if you need help building AI apps! ask@ai-jason.com
    #gpt #autogpt #ai #artificialintelligence #tutorial #stepbystep #openai #llm #largelanguagemodels #largelanguagemodel #langchain #nocode #langflow #flowise #chatgpt #automation #aiautomation#aiautomationagency
  • Наука та технологія

КОМЕНТАРІ • 129

  • @USBEN.
    @USBEN. 10 місяців тому +26

    I think yours is the only channel that shows practical usage for gpt and automation with existing tools.
    I learn a lot here, thankyou man.

    • @CodingAfterThirty
      @CodingAfterThirty 9 місяців тому +2

      That is the fact, this is my go to channel for learning.

  • @Cygx
    @Cygx 10 місяців тому +43

    This is an incredible saas product on its own. Now you just need a easy to use frontend for the user to take pictures and export a well defined excel spreadsheet. Incredible work!

    • @AIJasonZ
      @AIJasonZ  10 місяців тому +5

      Thanks! Good idea to turn this into a micro sass with simple scanning function

    • @amandamate9117
      @amandamate9117 9 місяців тому +7

      the bottleneck is: no company want to send private highly sensitive data as cleartext to openAIs chatGPT to process. Not in USA, not in Europe.

    • @jesuelm5482
      @jesuelm5482 9 місяців тому

      @@amandamate9117 maybe some encrypted solution

    • @antoninleroy3863
      @antoninleroy3863 9 місяців тому

      @@amandamate9117 Any large company could afford to run an open source LLM internally on a private network.
      EDIT: or even private microsolft openAi endpoints

    • @sw4rmify
      @sw4rmify 8 місяців тому

      @@amandamate9117the OpenAI API data is never used for training etc…

  • @KarlJuhl
    @KarlJuhl 10 місяців тому +4

    Great vidoe Jason, you are awesome at explaining these things. I personally support doing more of these guides in core coding format like here, it is super helpful for understanding.

  • @RyckmanApps
    @RyckmanApps 9 місяців тому

    Your videos are pretty helpful. The way you logically explain each tool is helpful.

  • @ChefDomein
    @ChefDomein 10 місяців тому +23

    Hey Jason, you are the greatest teacher I have encountered! This is exactly how people need to learn to build AI apps. You're going to be very successful if you keep teaching us like this. Thank you for all the great work, man!

  • @oktyler4k
    @oktyler4k 9 місяців тому +2

    Thank you for all of these videos bro, please keep making them!

  • @AngusLou
    @AngusLou 9 місяців тому +1

    Jason is always giving amazing and practical use cases

  • @jasonfinance
    @jasonfinance 10 місяців тому +2

    Thank you Jason. Great work as always. Very practical user case

  • @jayhu6075
    @jayhu6075 10 місяців тому

    As a beginner in ML I am very glad to find your channel. I learn a lot and you from each topic everything understandable. Many thanks

  • @VRDivision
    @VRDivision 10 місяців тому +4

    dude you're on fire! keep it up, I can't wait to apply knowledge from your videos

    • @AIJasonZ
      @AIJasonZ  10 місяців тому

      Thank you!!

  • @asithakoralage628
    @asithakoralage628 10 місяців тому +1

    Hi Jason, fantastic video, I learned a lot from your content. Please keep up the good work. Cheers

  • @Scooterboy_and_freinds109
    @Scooterboy_and_freinds109 9 місяців тому +1

    Fantastic simple walk-thru of e2e Business Scenario

  • @MuslimFriend23
    @MuslimFriend23 6 місяців тому

    man, your content is brilliant, by the way the thumbnails ROCK :)

  • @korywilson3005
    @korywilson3005 10 місяців тому +1

    This content is so GREAT. Thank you. Very transparent.

  • @mikepetersen5662
    @mikepetersen5662 10 місяців тому +1

    That is amazing. Thank you so much for this great code and tutorial!

  • @KundanKumar-xu4kd
    @KundanKumar-xu4kd 10 місяців тому +1

    Thank you for exposing me to Make, just signed up. great tool will use this in a lot my projects, and it will make my life a lot easier.

  • @mlg4035
    @mlg4035 10 місяців тому

    Freaking awesome video, Jason! So much info! Keep these videos coming!

  • @ShawnCady
    @ShawnCady 10 місяців тому +1

    Another great video, Jason!

  • @kevon217
    @kevon217 9 місяців тому +1

    Another banger tutorial, thanks!

  • @user-vk5fj5mj8s
    @user-vk5fj5mj8s 10 місяців тому +1

    Good Job Jason. Top content🔥

  • @epireve
    @epireve 10 місяців тому +1

    Incredible work as always Jason!
    P/s : I just realised Jin Yang and you has over 90% resemblance. What a doppelgänger! Minus the hair of course

  • @lucyn.7501
    @lucyn.7501 9 місяців тому

    Another wonderful tutorial thank you Jason so much ❤. In the perfect world, there should be no manual intervention, the POS machine should just talk to the bank, and AI in the middle transforming the semi/un-structured data into structured data, which then get feed into your online banking and accounting software. Scanning is a serious pain when the transaction gets large and digitalise receipts save a lot of trees and ink too 😂

  • @miltondavilaharjula
    @miltondavilaharjula 10 місяців тому

    Awesome tutorial !! 🎉

  • @nathan_leo
    @nathan_leo 10 місяців тому

    This is amazing, love all your content, thank you! Would you be able to make this video’s git public? Also, love the thumbnails 😂

  • @chetans1557
    @chetans1557 10 місяців тому +1

    I was here before he was subscribed by every AI enthusiast
    Incredible video as always, thank you!

  • @micbab-vg2mu
    @micbab-vg2mu 10 місяців тому +1

    Thank you for the video.

  • @harrisongovan7623
    @harrisongovan7623 10 місяців тому +1

    Brother, you’re amazing

  • @aliphian
    @aliphian 10 місяців тому

    Great channel!

  • @_derive
    @_derive 9 місяців тому +1

    Thanks!

  • @AkulSamartha
    @AkulSamartha 9 місяців тому

    You are a Genuis bro. 👏

  • @enceladus96
    @enceladus96 9 місяців тому +1

    Exactly what I’m looking for 😭

  • @dhaw
    @dhaw 10 місяців тому +1

    This is Amazing !

  • @rafael_tg
    @rafael_tg 9 місяців тому

    Very nice video. Have you tried to use function calling in GPT instead of asking it to return a string json ?

  • @faridmohdismail31
    @faridmohdismail31 10 місяців тому

    i was thinking of using this to just extract text from PDF if its better then langchain for embedding, i guess your example is good for forms and invoices, but for instructional document or PDF of wikipedia, the tesseract dont handle some data that well.
    but still its a very good guide.. thx for sharing

  • @readmarketings9061
    @readmarketings9061 10 місяців тому +2

    waiting for this

  • @alessandroceccarelli6889
    @alessandroceccarelli6889 7 місяців тому

    Best llm content on the web!
    Why OCR instead of native pdf text retrieval though? Don’t you risk to incur into ocr-related mistakes?
    I mean, you already have the “real” text! Thank you

  • @user-jt3jc7fk2x
    @user-jt3jc7fk2x 10 місяців тому

    Seriously I mean this is great video for educational purposes and I have two specific questions 1’ have you got access to GPT 4 api 2’ they are great educational contents, have you ever thing about productizing your idea such as this one, I mean filling for tax return seems to be a high demand for a lot of people

  • @DePhpBug
    @DePhpBug 7 місяців тому

    I like the approach above here , as I require to do alot of admin work as well.
    Was wondering is there a way to protect your data ? Bit concern with data privacy!! T.T

  • @autoboto
    @autoboto 10 місяців тому

    Surprised could not access the pdf object model to get text from the pages. . But yes tessaract does work well

  • @umeshtiwari9249
    @umeshtiwari9249 9 місяців тому

    believe me you do fantastic AI use case to handle business processes which anyone can use to get a job in AI. It will be great if you can do more use case in AI. would be really helpful to me and many others. At the end thanks a lot. 😃

  • @learningstuff5679
    @learningstuff5679 2 місяці тому

    Awesome. Jason do you offer 1-on-1 consulting?

  • @HarshVerma-xs6ux
    @HarshVerma-xs6ux 9 місяців тому +1

    Hey Jason, your content is really amazing. Thanks for creating AI related content. I wanted to ask if there's any advantage of saving the image in jpeg format before extracting text because if there's no actual advantage the same can be done with just 3 lines of code which also makes the process faster.
    def parse_pdf(file_path, scale=300/72):
    pdf_file = pdfium.PdfDocument(file_path)
    renderer = pdf_file.render(
    pdfium.PdfBitmap.to_pil,
    scale=scale
    )
    return "
    ".join(image_to_string(img) for img in renderer)

    • @gonorrex_571
      @gonorrex_571 8 місяців тому

      Hey, you seem to understand the field. Looking to launch this idea into the market? Sales guy here looking for a tech cofounder. Cheers!

  • @user-iy9fr5td2f
    @user-iy9fr5td2f 9 місяців тому

    Why did you use a simple langchain prompt template instead of using openai’ s function api to get the structured data?

  • @temirzhanyussupov6997
    @temirzhanyussupov6997 10 місяців тому

    Would not function calling be more appropriate for formatting invoice data into a JSON format you need?

  • @avi7278
    @avi7278 3 місяці тому

    when Jason drops a video I can't click fast enough

  • @khirtah
    @khirtah 9 місяців тому +1

    This is a great as you.

  • @ryancoble-neal6186
    @ryancoble-neal6186 10 місяців тому +5

    Hi Jason, when I try to run your code I encounter the following error: PdfiumError: Failed to load document (PDFium: File access error). Do you know what might be causing this and how to rectify it? Thanks

    • @krasimirivanov6627
      @krasimirivanov6627 10 місяців тому +1

      +1 I am facing the same error. Appreciate if someone has an advice on how to solve it

    • @albertalbert5785
      @albertalbert5785 10 місяців тому +1

      i also have the same error :/, someone help pls

    • @Abubakr844
      @Abubakr844 9 місяців тому +1

      facing same error

    • @bibinbalakrishnan
      @bibinbalakrishnan 9 місяців тому +1

      The NamedTemporaryFile is getting deleted. You can change it like - with NamedTemporaryFile(suffix='.pdf',delete=False) as f:

    • @kenhtinhthuc
      @kenhtinhthuc 9 місяців тому

      Thanks. It worked for me.@@bibinbalakrishnan

  • @lukaszl9542
    @lukaszl9542 9 місяців тому

    And are those language model libraries available in Python? You said you Will explain it later in the video but i think you didnt

  • @adolphododo
    @adolphododo 10 місяців тому

    If the PDF has many pages (for example, a contract), do I need to go through the process of splitting it into smaller chunks, or can I simply insert any PDF, regardless of the text size?

    • @AIJasonZ
      @AIJasonZ  10 місяців тому

      the function auto split them into pages!

  • @senxo.visuals
    @senxo.visuals 10 місяців тому +4

    I'm curious how's the accuracy of pytesseract. I did the exact same project a long time ago (it's in production up to this date) and we used Google Vision API to perform OCR. The biggest issue is that although the accuracy is at idk like 99.9% it's still at least one wrong character recognized in each invoice! And since there's a lot of numeric data (prices, VAT values, amounts, different units of measures) writing validation for this all took more time than the rest of the project. You never actually knew what the OCR will return and you REALLY don't want to put the wrong data for accounting.

    • @senxo.visuals
      @senxo.visuals 10 місяців тому

      And actually here's the thing, in the video the Transaction ID wasn't recognized 100% correctly

    • @TheParagamer
      @TheParagamer 10 місяців тому

      @@senxo.visuals You're right it's missing an extra W @5:46, eagle eyes🦅! I suppose you could feed this output to another llm checking whether sequences numbers of another run match, repeating until however accuracy you want. It wouldn't ever be perfect tho and would add up quickly💸

    • @andrewxzvxcud2
      @andrewxzvxcud2 10 місяців тому

      yh thats one of the problems w all these ai apps, problems where u need to be 100% accurate or there could be big consequences is hard to actually solve with ai

    • @AIJasonZ
      @AIJasonZ  10 місяців тому

      @@TheParagamer ohh Having 2 OCR service to do text extraction & LLM to validate, this is 🧠

    • @AIJasonZ
      @AIJasonZ  10 місяців тому +1

      @@senxo.visuals ahh good catch! i really like @TheParagamer idea on having 2 service for validating the result, will give it a try

  • @staceyjo1752
    @staceyjo1752 6 місяців тому

    when the invoice has subtotal with an indented item, it gets read as duplicate item (as pytesseract doesn't recognized indent) and therefore, the total doesn't match the invoice total... do you have any suggestions for this kid of error?

  • @digital4smallbusiness
    @digital4smallbusiness 7 місяців тому

    Hey Jason, this is great! But can you Llama2 to achieve the same?

  • @salamina_
    @salamina_ 2 місяці тому

    great content! thank you for taking the time to put together and share!

  • @JalanTrips
    @JalanTrips 3 місяці тому

    With function calling, is it more convenient for LLMs to extract structured data?

  • @jamesxprosper
    @jamesxprosper 8 місяців тому

    Im getting an an error that says Import "dotenv" could not be resolved Pylance (reportMissingImports) [Ln 4_ Col 6], what am I doing wrong?

  • @tapos999
    @tapos999 10 місяців тому

    not clear yet, what are the output difference from pypdf/langchain pdf to pdf->img->text? do the later one, keep some structure of the info in certain way or what's good/bad from these 2 approach?

    • @AIJasonZ
      @AIJasonZ  10 місяців тому

      When I tried pypdf/langchain unstructured file upload, it only extract like 10~20% of the text from img, so almost unusable

  • @ivanlee7450
    @ivanlee7450 9 місяців тому

    Can you do an assist filling form using langchain tutorial?

  • @ayusharora2019
    @ayusharora2019 10 місяців тому

    tons of companies have been doing this with OCR. I don't know what are you saying!!

  • @sanatanidharmaraj
    @sanatanidharmaraj 9 місяців тому

    But ocr is built already right? why cant we directly use that

  • @hazema.6150
    @hazema.6150 10 місяців тому +2

    One of the key takeaways from this amazing tutorial is: AI by itself will not replace you but rather one who uses AI effectively is the one will insha’Allah (God willing). So go learn how to use AI in your day-to-day job now and impress your employers with your ideas.
    Great tutorial Jason.

  • @Ascended23
    @Ascended23 10 місяців тому

    Given the thumbnail I have to ask... when do we get the Hot Dog or Not Hot Dog App?

  • @photon2724
    @photon2724 10 місяців тому

    Hi Jason! thanks for the great video. looks like your github link is broken. would love an updated link to access the code!

    • @AIJasonZ
      @AIJasonZ  10 місяців тому +2

      Sorry forgot to set it public, just updated it! github.com/JayZeeDesign/gpt-data-extraction

  • @jreamer0
    @jreamer0 7 місяців тому

    how do I get the file_url to be passed from make to relevance?

  • @user-jt3jc7fk2x
    @user-jt3jc7fk2x 10 місяців тому +1

    AI Jason is a must watch, now I wanna make a copycat of him on Chinese web, what about NewAI Jason for my channel 👨🏿‍🔧👨🏿‍🔧👨🏿‍🔧

  • @MichaelHoughton_
    @MichaelHoughton_ 10 місяців тому +1

    AWS has a really good system to extract data from a document and it cods $1.50 per 1000 pages... so its super efficient

    • @AIJasonZ
      @AIJasonZ  10 місяців тому

      oh nice, didnt know that, will give it a try! whats the name of the service?

  • @DIY_Foodie
    @DIY_Foodie 9 місяців тому

    please attach link to medium article

  • @Supasweet95
    @Supasweet95 9 місяців тому

    What about safety concerns regarding data? Anyway to overcome this? Good video.

  • @markdin2988
    @markdin2988 3 місяці тому

    How does GPT4 vision affect this ? better or worse?

  • @MK-jn9uu
    @MK-jn9uu 9 місяців тому +1

    🤬 why am I having so much trouble importing? What am I missing?

  • @quantixxx
    @quantixxx 5 місяців тому

    Красавчик

  • @carkawalakhatulistiwa
    @carkawalakhatulistiwa 10 місяців тому +2

    all repetitive work using computers can be automated within 2 years by ai.

  • @niharikasingh2541
    @niharikasingh2541 9 місяців тому

    Why are we converting pdf to image instead u can use any python Library to get text from pdf

  • @user-wr4yl7tx3w
    @user-wr4yl7tx3w 10 місяців тому

    would using OpenAI's function calling be useful here?

    • @AIJasonZ
      @AIJasonZ  10 місяців тому

      You can try function calling for data extraction for sure! but still need a way to turn PDF text well first

  • @howtowithtt
    @howtowithtt 10 місяців тому

    Hey everyone, im pretty new to all of this. im the type to just dive in and do, i keep getting this error after i pip install anything "is not recognized as the name of a cmdlet, function, script file, or operable program." any help?

  • @cjbobby
    @cjbobby 10 місяців тому +2

    The github link seems to be broken. Could repost the link pls? :)

    • @jmanhype1
      @jmanhype1 10 місяців тому

      he took it down looks like he will be turning it into a micro service

    • @AIJasonZ
      @AIJasonZ  10 місяців тому

      Sorry forgot to set it public, just updated it! github.com/JayZeeDesign/gpt-data-extraction

    • @AIJasonZ
      @AIJasonZ  10 місяців тому

      @@jmanhype1 Sorry forgot to set it public, just updated it! github.com/JayZeeDesign/gpt-data-extraction

  • @Nurof3n_
    @Nurof3n_ 10 місяців тому +1

    hey, github link doesn't work :(

    • @AIJasonZ
      @AIJasonZ  10 місяців тому +1

      Sorry forgot to set it public, just updated it! github.com/JayZeeDesign/gpt-data-extraction

    • @Nurof3n_
      @Nurof3n_ 10 місяців тому

      @@AIJasonZ Thanks!

  • @user-tc9bo7zq1b
    @user-tc9bo7zq1b 10 місяців тому

    Vid content aside,你的声音jimmy o yang是真的很像哈哈哈哈哈

  • @ammadali5799
    @ammadali5799 10 місяців тому +1

    This is nice. maybe deploying these models on MS Azure so we can have their API?
    and for the next video try making a simple streamlit app with that API
    Really appreciate the work you are doing. Thank you very much

  • @rverm1000
    @rverm1000 9 місяців тому +1

    wow the python coding tutorials keep getting more and more complicated lately thats good.

  • @juancasas5532
    @juancasas5532 9 місяців тому

    Jason for presiden 2024

  • @saadkassim9729
    @saadkassim9729 Місяць тому

    Can you do the all SLOWLY.. Again I COULDN'T FOLLOW YOU 😮😮😮😮

  • @iamseth5253
    @iamseth5253 9 місяців тому

    Each time he says pdffiles 👀

  • @napent
    @napent 9 місяців тому

    Use new Microsoft office features xs

  • @MrBou.
    @MrBou. 8 місяців тому

    im a marketer, i just don't understand the whole coding part, it's like chinese for me.

  • @gonorrex_571
    @gonorrex_571 8 місяців тому +1

    Anyone with tech background wanna work on this? I'm looking to launch a SaaS company and I have more than 10 years in Sales working on B2B Finance. Reply here and I will get in touch!

  • @amandamate9117
    @amandamate9117 9 місяців тому

    the bottleneck is: no company want to send private highly sensitive data as cleartext to openAIs chatGPT to process. Not in USA, not in Europe.

    • @AIJasonZ
      @AIJasonZ  9 місяців тому +4

      yea you are right; Im making a new video about how companies can handle data privacy soon, so hopefully it can address that :) But in general, host private cloud, or using opensource LLM should solve that

    • @krasimirivanov6627
      @krasimirivanov6627 9 місяців тому

      Looking forward to this video!

  • @udaynj
    @udaynj 7 місяців тому

    What you call a boring data entry job feels millions of families around the world where the bread "earner" has no better skills. I find the attitude of CS and esp AI folks distasteful. You guys are so flippant about the destruction of families and communities caused by AI taking over jobs. There will be a day of reckoning I am afraid when the world turns against CS folks. Please watch your language leave the commentary out....

  • @kickingnscreaming
    @kickingnscreaming 9 місяців тому +2

    Thanks!