Get in Touch

Course Outline

  1. Big Data fundamentals
    • The role of Big Data in the corporate world
    • Development phases of a Big Data strategy within an organization
    • Rationale for a holistic approach to Big Data
    • Essential components of a Big Data Platform
    • Big Data storage solutions
    • Limits of traditional technologies
    • Overview of database types
    • The four dimensions of Big Data
  2. Impact of Big Data on business
    • Business significance of Big Data
    • Challenges in extracting valuable data
    • Integrating Big Data with traditional data sources
  3. Big Data storage technologies
    • Overview of Big Data technologies
      • Data storage models
      • Hadoop
      • Hive
      • Cassandra
      • MongoDB
    • Selecting the appropriate Big Data technology
  4. Processing Big Data
    • Connecting to and extracting data from databases
    • Transforming and preparing data for processing
    • Using Hadoop MapReduce for distributed data processing
    • Monitoring and executing Hadoop MapReduce jobs
    • Building blocks of the Hadoop Distributed File System
    • MapReduce and YARN
    • Handling streaming data with Spark
  5. Big Data analysis tools and technologies
    • Programming Hadoop with Pig Latin
    • Querying Big Data with Hive
    • Mining data with Mahout
    • Visualization and reporting tools
  6. Big Data in business
    • Managing and establishing Big Data requirements
    • Business importance of Big Data
    • Selecting the right Big Data tools for specific problems

Data Warehousing Concepts

  • What is a Data Warehouse?
  • Difference between OLTP and Data Warehousing
  • Data Acquisition
  • Data Extraction
  • Data Transformation
  • Data Loading
  • Data Marts
  • Dependent vs Independent Data Mart
  • Database design

ETL Testing Concepts:

  • Introduction
  • Software Development Life Cycle
  • Testing methodologies
  • ETL Testing Workflow Process
  • ETL Testing Responsibilities in Data Stage

Big Data Fundamentals

  • The role of Big Data in the corporate world
  • Development phases of a Big Data strategy within an organization
  • Rationale for a holistic approach to Big Data
  • Essential components of a Big Data Platform
  • Big Data storage solutions
  • Limits of traditional technologies
  • Overview of database types

NoSQL Databases

Hadoop

Map Reduce

Apache Spark

Requirements

Participants should possess an understanding of storage tools and have some experience handling large data sets.

 14 Hours

Custom Corporate Training

Training solutions designed exclusively for businesses.

  • Customized Content: We adapt the syllabus and practical exercises to the real goals and needs of your project.
  • Flexible Schedule: Dates and times adapted to your team's agenda.
  • Format: Online (live), In-company (at your offices), or Hybrid.
Investment

Price per private group, online live training, starting from 2600 € + VAT*

Contact us for an exact quote and to hear our latest promotions

Testimonials (1)

Provisional Upcoming Courses (Contact Us For More Information)

Related Categories