Querying 100 Billion Rows using SQL, 7 TB in a single table

Поділитися
Вставка
  • Опубліковано 11 гру 2024

КОМЕНТАРІ • 35

  • @vaibhavis1
    @vaibhavis1 2 роки тому +3

    Thanks for the explaination. I am curious that is it just scaling of the systems, or BigQuery does query optimization to reduce the latency as well?

  • @TheElementFive
    @TheElementFive Рік тому +5

    The first question you should always ask when working with a 100 billion row database: “Why do I have a 100 billion row database?”

    • @davidlean8674
      @davidlean8674 Рік тому +6

      And the answer would be "because I work with a multinational enterprise customer". If you have a large market share in China (1 bill people) , India (1 Bill people), Europe 0.75 Bill, USA (350M people) it doesn't take long to get to 100 BIllion transactions.
      If you want to do Financial Year on Year comparisons, you need to keep at least 24 months of data, usually 36 months. .

  • @alok5253
    @alok5253 3 роки тому +9

    Simple and concise, thank you!

  • @JunaidKhan-gq8nw
    @JunaidKhan-gq8nw 2 місяці тому +1

    Great, Thanks a lot, sir.

  • @Hrzzz1
    @Hrzzz1 Рік тому

    we can download this database to do some testes ?
    I nice ideal for next video is compare this same situation with noSQL database.

  • @mathteacher5670
    @mathteacher5670 Рік тому

    excellent sir thank you so much highly motivational for passionate person

  • @WanderWisdom731
    @WanderWisdom731 2 роки тому

    Wow.. this experiment was really amazing to benchmark the bigquery .

  • @AamirKhan-vu2om
    @AamirKhan-vu2om 2 роки тому

    Heyy, very informative. I came here around searching for big data processing in seconds. Ive a question, I would like to build a system where I import terabytes of data into single table with keys and I want to perform all the DML operatiom in such a way it should take very less execution time as shown. Please help me out, how I can acheive. Im stuck.

  • @houssem25000
    @houssem25000 6 місяців тому

    So I don't have to carry about performance when I make projects ?!

  • @ashitoshthakur9402
    @ashitoshthakur9402 3 роки тому

    Wow what a gr8 video sir ji..pls sir make video on sql with ml and sql also..

  • @skill-learning
    @skill-learning 3 роки тому

    I appreciate your effort. Could you put the used link for the google cloud project?

  • @vipulkumar7938
    @vipulkumar7938 3 роки тому +1

    Well Explained, Thanks a lot

  • @PradeepMishra-qs2hz
    @PradeepMishra-qs2hz 2 роки тому

    Awesome . Keep it up.

  • @abhijayrajvansh
    @abhijayrajvansh 4 місяці тому +1

    it's always an Indian guy!

  • @merhaiakshay9625
    @merhaiakshay9625 3 роки тому

    Please organize the videos and make playlists , great video , very informative and helpful, which led me to subscribe , thanks 😊

  • @Mju98
    @Mju98 9 місяців тому

    Hello sir. I tried to import 400k data into big query sandbox. But ended with more errors. Is this possible to import those data. Pls anyone help me it's urgent ( interview assignment)

  • @MDDM03
    @MDDM03 Рік тому +1

    marketer of google cloud.. nothing states what to improve

  • @visva2005
    @visva2005 3 роки тому

    @Arpit Agrawal, Good. Let me know what database is behind this Console?

    • @elastiqai
      @elastiqai  3 роки тому

      Google Cloud Bigquery 😁

  • @Rpskmr
    @Rpskmr 9 місяців тому

    Nice video but while voicing better to expand the screen than side by side videos

  • @toxiclife
    @toxiclife Рік тому

    what to do when I want to overwrite 100 millions of rows into new table, in minutes?
    df.write.mode("overwrite").saveAsTable("FINAL"), if you could please help with this?

  • @prathivenkatasaipavan9909
    @prathivenkatasaipavan9909 3 роки тому

    Great explanation

  • @nfacundot
    @nfacundot Рік тому

    Hello, can I connect it on php?

  • @aminremiiii
    @aminremiiii 2 роки тому

    Please for 50 days I am looking for this i wanna to create 2000 users in mysql and set the phone number as user name and password my be say me how can i create most users with default password? That's

  • @ungeedh
    @ungeedh 3 роки тому

    Nicely explained.

  • @Helloimtheshiieet
    @Helloimtheshiieet 2 роки тому

    Im confused were these indexes?

    • @elastiqai
      @elastiqai  Рік тому

      BigQuery doesn't have indexes. It has partitions and clustering.

  • @arthurrodrigues5382
    @arthurrodrigues5382 2 роки тому

    Amazing!

  • @MdRakib-rc6ub
    @MdRakib-rc6ub 2 роки тому

    I need your help

  • @himanish2006
    @himanish2006 2 роки тому

    This is good...

  • @davidlean8674
    @davidlean8674 Рік тому

    This is nice but not that impressive. Obviously, the table is being stored using Columnstore Compression techniques. So you only need to query the columns in the select list. And they are typically grouped in blocks of 1 M or more. These header pages keep rowcount values. So you are not reading every row. Just the block headers of a single column.
    If your query forced the scan of all rows in the "block" asking it to be combined with other fields in the same row or in other tables before you could filter it. You will no longer be in the columnstore sweet spot. and the difference in query speed would be more striking.
    Still good thou, as that is a common use case.

  • @muhamadridwan4766
    @muhamadridwan4766 2 роки тому

    wow!

  • @sconnell194
    @sconnell194 3 роки тому

    👍