NovelVista logo

DENG 251 Building an Open Data Lakehouse Using Apache Iceberg

  • Duration: 32 hours
  • Exam Voucher: Yes
  • Language: English
  • Course Delivery : E - Learning Access
Google

4.9 Ratings on

Reviews

9000+ Professionals Enrolled

Enquire Now

Phone

Course Overview

The DENG-251: Building an Open Data Lakehouse Using Apache Iceberg course is designed to provide data engineers, architects, and analytics professionals with the skills needed to build and manage an open data lakehouse architecture leveraging Apache Iceberg. This course focuses on the core principles and best practices for implementing scalable, performant, and governable data lakehouse solutions that unify data warehousing and data lake capabilities. Participants will gain hands-on experience with Iceberg table formats, metadata management, query optimization, and integration with popular processing engines such as Apache Spark and Hive.

Course Details

  • Understand the principles of open data lakehouse architecture
  • Hands-on experience with Apache Iceberg table format and metadata handling
  • Ability to design and implement performant and scalable data platforms
  • Expertise in integrating Iceberg with processing engines like Spark and Hive
  • Skills to optimize queries, manage schema evolution, and ensure data governance
  • Preparation for real-world data engineering roles using modern lakehouse technologies
  • Open to data engineers, data architects, analytics professionals, and IT practitioners
  • Ideal for professionals involved in building or scaling data platforms and analytics solutions
  • Basic understanding of data processing, SQL, and distributed systems is recommended
  • Familiarity with Apache Spark or Hadoop ecosystem components is beneficial
  • Corporate sponsorship or group participation is encouraged
  • Master the fundamentals of open data lakehouse design and implementation
  • Learn how to implement Apache Iceberg table structures and metadata management
  • Gain expertise in integrating Iceberg with Spark, Hive, and other processing engines
  • Understand schema evolution, partitioning strategies, and performance optimization
  • Develop skills in query tuning, data governance, and operational practices
  • Achieve readiness to implement modern data lakehouse solutions in enterprise environments
  • Introduction to Data Lakehouse Architecture
  • Apache Iceberg Fundamentals and Table Format
  • Metadata Management and Table Operations
  • Integrating Iceberg with Spark & Hive
  • Schema Evolution and Partitioning Techniques
  • Query Optimization and Performance Tuning
  • Data Governance and Compliance Features
  • Handling Large-Scale Data Workloads
  • Best Practices for Lakehouse Deployment
  • Hands-On Labs and Real-World Scenarios

Looking for the best training fit for your team?

Our advisors are here to assist you.

Schedule a free consultation with our training experts to discuss your organization's needs, customize your training program, and get answers to all your questions.

What Our Corporate Clients Say

Trusted by leading organizations worldwide

James Abot

★★★★★

Much obliged to you for this course. I get know understanding and information in utilizing various types of online apparatuses which are helpful and viable. I'll utilize some of them during my exercises. Also, heaps of much obliged.

Sayali Patil

★★★★★

This was a very immersive and interesting course from NovelVista a lot of self-learning to be done on your own to really understand and put together into practice the technology into your own course and workflow.

Amit Shrivastav

★★★★★

It was truly an amazing learning session. I did have my apprehensions before signing up, but trainer made me feel so comfortable from the time we started the session till the very end of it.Thanks for this amazing experience.

Frequently Asked Questions