Querying 100 Billion Rows using SQL, 7 TB in a single table

Поділитися
Вставка
  • Опубліковано 11 січ 2025

КОМЕНТАРІ • 38

  • @TheElementFive
    @TheElementFive Рік тому +8

    The first question you should always ask when working with a 100 billion row database: “Why do I have a 100 billion row database?”

    • @davidlean8674
      @davidlean8674 Рік тому +6

      And the answer would be "because I work with a multinational enterprise customer". If you have a large market share in China (1 bill people) , India (1 Bill people), Europe 0.75 Bill, USA (350M people) it doesn't take long to get to 100 BIllion transactions.
      If you want to do Financial Year on Year comparisons, you need to keep at least 24 months of data, usually 36 months. .

    • @leksetengah
      @leksetengah 21 день тому

      ebay? amazon store?

  • @alok5253
    @alok5253 3 роки тому +9

    Simple and concise, thank you!

  • @mathteacher5670
    @mathteacher5670 Рік тому

    excellent sir thank you so much highly motivational for passionate person

  • @vaibhavis1
    @vaibhavis1 3 роки тому +4

    Thanks for the explaination. I am curious that is it just scaling of the systems, or BigQuery does query optimization to reduce the latency as well?

  • @WanderWisdom731
    @WanderWisdom731 2 роки тому

    Wow.. this experiment was really amazing to benchmark the bigquery .

  • @JunaidKhan-gq8nw
    @JunaidKhan-gq8nw 3 місяці тому +1

    Great, Thanks a lot, sir.

  • @Hrzzz1
    @Hrzzz1 Рік тому

    we can download this database to do some testes ?
    I nice ideal for next video is compare this same situation with noSQL database.

  • @vipulkumar7938
    @vipulkumar7938 3 роки тому +1

    Well Explained, Thanks a lot

  • @ashitoshthakur9402
    @ashitoshthakur9402 3 роки тому

    Wow what a gr8 video sir ji..pls sir make video on sql with ml and sql also..

  • @rajakumarkeelu9449
    @rajakumarkeelu9449 18 днів тому

    Hi Bro, what if I apply FELLTEXT INDEX(View) prior to the query apply

  • @houssem25000
    @houssem25000 7 місяців тому

    So I don't have to carry about performance when I make projects ?!

  • @merhaiakshay9625
    @merhaiakshay9625 3 роки тому

    Please organize the videos and make playlists , great video , very informative and helpful, which led me to subscribe , thanks 😊

  • @PradeepMishra-qs2hz
    @PradeepMishra-qs2hz 2 роки тому

    Awesome . Keep it up.

  • @skill-learning
    @skill-learning 3 роки тому

    I appreciate your effort. Could you put the used link for the google cloud project?

  • @prathivenkatasaipavan9909
    @prathivenkatasaipavan9909 3 роки тому

    Great explanation

  • @MDDM03
    @MDDM03 Рік тому +1

    marketer of google cloud.. nothing states what to improve

  • @AamirKhan-vu2om
    @AamirKhan-vu2om 3 роки тому

    Heyy, very informative. I came here around searching for big data processing in seconds. Ive a question, I would like to build a system where I import terabytes of data into single table with keys and I want to perform all the DML operatiom in such a way it should take very less execution time as shown. Please help me out, how I can acheive. Im stuck.

    • @SK-rl3wu
      @SK-rl3wu 19 днів тому

      Hi
      I have similar requirement, could you please share your analysis/solution if you find any, thank you.

  • @ungeedh
    @ungeedh 3 роки тому

    Nicely explained.

  • @Rpskmr
    @Rpskmr 10 місяців тому

    Nice video but while voicing better to expand the screen than side by side videos

  • @visva2005
    @visva2005 3 роки тому

    @Arpit Agrawal, Good. Let me know what database is behind this Console?

    • @elastiqai
      @elastiqai  3 роки тому

      Google Cloud Bigquery 😁

  • @Mju98
    @Mju98 10 місяців тому

    Hello sir. I tried to import 400k data into big query sandbox. But ended with more errors. Is this possible to import those data. Pls anyone help me it's urgent ( interview assignment)

  • @toxiclife
    @toxiclife Рік тому

    what to do when I want to overwrite 100 millions of rows into new table, in minutes?
    df.write.mode("overwrite").saveAsTable("FINAL"), if you could please help with this?

  • @nfacundot
    @nfacundot Рік тому

    Hello, can I connect it on php?

  • @aminremiiii
    @aminremiiii 2 роки тому

    Please for 50 days I am looking for this i wanna to create 2000 users in mysql and set the phone number as user name and password my be say me how can i create most users with default password? That's

  • @arthurrodrigues5382
    @arthurrodrigues5382 2 роки тому

    Amazing!

  • @Helloimtheshiieet
    @Helloimtheshiieet 2 роки тому

    Im confused were these indexes?

    • @elastiqai
      @elastiqai  2 роки тому

      BigQuery doesn't have indexes. It has partitions and clustering.

  • @abhijayrajvansh
    @abhijayrajvansh 5 місяців тому +1

    it's always an Indian guy!

  • @himanish2006
    @himanish2006 2 роки тому

    This is good...

  • @MdRakib-rc6ub
    @MdRakib-rc6ub 2 роки тому

    I need your help

  • @muhamadridwan4766
    @muhamadridwan4766 2 роки тому

    wow!

  • @sconnell194
    @sconnell194 3 роки тому

    👍

  • @davidlean8674
    @davidlean8674 Рік тому

    This is nice but not that impressive. Obviously, the table is being stored using Columnstore Compression techniques. So you only need to query the columns in the select list. And they are typically grouped in blocks of 1 M or more. These header pages keep rowcount values. So you are not reading every row. Just the block headers of a single column.
    If your query forced the scan of all rows in the "block" asking it to be combined with other fields in the same row or in other tables before you could filter it. You will no longer be in the columnstore sweet spot. and the difference in query speed would be more striking.
    Still good thou, as that is a common use case.