I didn't like the comparison between Hive and RDBMS. Hive is for processing data and RDBMS for storing data. You could say Hive+HDFS to avoid confusion. Anyway thank's for the introduction !
Sir, will you please give me answer to this? What approach we should take to load thousands of small 1 KB files using Hive, do we load one by one or should we merge together and load at once and how to do this?
Yo! Thanks for the video, really insightful and concrete. 5:23 minutes of my life well spent.
I didn't like the comparison between Hive and RDBMS. Hive is for processing data and RDBMS for storing data. You could say Hive+HDFS to avoid confusion. Anyway thank's for the introduction !
Very useful
fantastic , helped me a lot.
ty
Sir, will you please give me answer to this? What approach we should take to load thousands of small 1 KB files using Hive, do we load one by one or should we merge together and load at once and how to do this?
you can use sequence file
A commercial RDBMS machine has more than 10's of terabytes of just ram. RDBMS can manage much larger datasets, not 10 terabytes..
what does HDFS stand for?
explanation please...
Hadoop Distributed File System
But then Why HBase ?😰😥