Understanding Data Lakehouse
Ғылым және технология
To use a Data Lakehouse you need to understand what it is and the core concepts or you will only be fumbling your way through. This video is the most important in the Data Lakehouse series because it lays the foundation for the others.
Join my Patreon Community and Watch this Video without Ads!
www.patreon.com/bePatron?u=63...
Slides at:
github.com/bcafferky/shared/b...
See my Pre Data Lakehouse training series at:
• Master Databricks and ...
Пікірлер: 19
superb presentation !! anyone can master technical tools but few can master the vision - which is the foundation. thank you !!
Stayed till the end - I’ve got a new dream now. 😁👍 Thx for the clear instruction and breakdowns - priceless.
This is a great introduction video!
@BryanCafferky
Жыл бұрын
Thank You!
This is great, thank you!
@BryanCafferky
Жыл бұрын
You're welcome!
Hello, thank you for these videos. I’m try to learn azure and databricks for a possible job. I‘be been interested in data analytics so I’m also taking a crash course in SQL , Python and Excel. Which of you videos should I start with?
Finally understand
Fantastic
@BryanCafferky
Жыл бұрын
Thank You!
Hey Bryan. Would you say that a Lakehouse needs to have storage separated from processing?
@BryanCafferky
Жыл бұрын
Yes. In general, at least. With cloud, that's implicit. On Prem, different story I guess.
Thank you for your greate videos. I have a question. Why is it better than data lake? Is it because of acid and schema management?
@BryanCafferky
3 ай бұрын
Believe it or not, Data Lakes are read only. They are for flat files so table like functionality was not possible. Lakehouse modifies parquet file support to enable add/change/delete. My video on Delta Logs explains how this works. kzread.info/dash/bejne/ond8wdOHodHTo5M.html
Can we say that data lakehouse is kind of a data storage architecture focused on catering data hungry AI systems as well as business analytics?
@BryanCafferky
9 ай бұрын
I thinks that's a bit too specific. I would say Lakehouse is trying to add relational database data warehouse type functionality (CRUD operations) so tables can be maintained. Prior to this, parquet files only support append so very limited in functionality.
How can i contact you?
"Im not talking about the 60s and the things u did in college" 🤣
"Show me the code! I don't want to understand anything! ' Lol!