Skill Up Card - Course Bundles

Save up to €4,145 per delegate.

skill up card logo - Nexus Human

Designing and Building Big Data Applications

4.6 out of 5 rating Last updated 23/07/2024   English

Jump to outline

Find out more about this course

Interested in alternative dates? Would like to book a private session of this course for your company? Or for any other queries please simply fill out the form below.

Duration

4 Days

24 CPD hours

Overview

Skills learned in this course include:Creating a data set with Kite SDKDeveloping custom Flume components for data ingestionManaging a multi-stage workflow with OozieAnalyzing data with CrunchWriting user-defined functions for Hive and ImpalaWriting user-defined functions for Hive and ImpalaIndexing data with Cloudera Search

Description

Cloudera University's four-day course for designing and building Big Data applications prepares you to analyze and solve real-world problems using Apache Hadoop and associated tools in the enterprise data hub (EDH).

IntroductionApplication Architecture
  • Scenario Explanation
  • Understanding the Development Environment
  • Identifying and Collecting Input Data
  • Selecting Tools for Data Processing and Analysis
  • Presenting Results to the Use
Defining & Using Datasets
  • Metadata Management
  • What is Apache Avro
  • Avro Schemas
  • Avro Schema Evolution
  • Selecting a File Format
  • Performance Considerations
Using the Kite SDK Data Module
  • What is the Kite SDK
  • Fundamental Data Module Concepts
  • Creating New Data Sets Using the Kite SDK
  • Loading, Accessing, and Deleting a Data Set
Importing Relational Data with Apache Sqoop
  • What is Apache Sqoop
  • Basic Imports
  • Limiting Results
  • Improving Sqoops Performance
  • Sqoop 2
Capturing Data with Apache Flume
  • What is Apache Flume
  • Basic Flume Architecture
  • Flume Sources
  • Flume Sinks
  • Flume Configuration
  • Logging Application Events to Hadoop
Developing Custom Flume Components
  • Flume Data Flow and Common Extension Points
  • Custom Flume Sources
  • Developing a Flume Pollable Source
  • Developing a Flume Event-Driven Source
  • Custom Flume Interceptors
  • Developing a Header-Modifying Flume Interceptor
  • Developing a Filtering Flume Interceptor
  • Writing Avro Objects with a Custom Flume Interceptor
Managing Workflows with Apache Oozie
  • The Need for Workflow Management
  • What is Apache Oozie
  • Defining an Oozie Workflow
  • Validation, Packaging, and Deployment
  • Running and Tracking Workflows Using the CLI
  • Hue UI for Oozie
Processing Data Pipelines with Apache Crunch
  • What is Apache Crunch
  • Understanding the Crunch Pipeline
  • Comparing Crunch to Java MapReduce
  • Working with Crunch Projects
  • Reading and Writing Data in Crunch
  • Data Collection API Functions
  • Utility Classes in the Crunch API
Working with Tables in Apache Hive
  • What is Apache Hive
  • Accessing Hive
  • Basic Query Syntax
  • Creating and Populating Hive Tables
  • How Hive Reads Data
  • Using the RegexSerDe in Hive
Developing User-Defined Functions
  • What are User-Defined Functions
  • Implementing a User-Defined Function
  • Deploying Custom Libraries in Hive
  • Registering a User-Defined Function in Hive
Executing Interactive Queries with Impala
  • What is Impala
  • Comparing Hive to Impala
  • Running Queries in Impala
  • Support for User-Defined Functions
  • Data and Metadata Management
Understanding Cloudera Search
  • What is Cloudera Search
  • Search Architecture
  • Supported Document Formats
Indexing Data with Cloudera Search
  • Collection and Schema Management
  • Morphlines
  • Indexing Data in Batch Mode
  • Indexing Data in Near Real Time
Presenting Results to Users
  • Solr Query Syntax
  • Building a Search UI with Hue
  • Accessing Impala through JDBC
  • Powering a Custom Web Application with Impala and Search
Additional course details:

Nexus Humans Designing and Building Big Data Applications training program is a workshop that presents an invigorating mix of sessions, lessons, and masterclasses meticulously crafted to propel your learning expedition forward.

This immersive bootcamp-style experience boasts interactive lectures, hands-on labs, and collaborative hackathons, all strategically designed to fortify fundamental concepts.

Guided by seasoned coaches, each session offers priceless insights and practical skills crucial for honing your expertise. Whether you're stepping into the realm of professional skills or a seasoned professional, this comprehensive course ensures you're equipped with the knowledge and prowess necessary for success.

While we feel this is the best course for the Designing and Building Big Data Applications course and one of our Top 10 we encourage you to read the course outline to make sure it is the right content for you.

Additionally, private sessions, closed classes or dedicated events are available both live online and at our training centres in Dublin and London, as well as at your offices anywhere in the UK, Ireland or across EMEA.

FAQ for the Designing and Building Big Data Applications Course

Available Delivery Options for the Designing and Building Big Data Applications training.
  • Live Instructor Led Classroom Online (Live Online)
  • Traditional Instructor Led Classroom (TILT/ILT)
  • Delivery at your offices in London or anywhere in the UK
  • Private dedicated course as works for your staff.
How many CPD hours does the Designing and Building Big Data Applications training provide?

The 4 day. Designing and Building Big Data Applications training course give you up to 24 CPD hours/structured learning hours. If you need a letter or certificate in a particular format for your association, organisation or professional body please just ask.

What is the correct audience for the Designing and Building Big Data Applications training?

This course is best suited to developers, engineers, and architects who want to use use Hadoop and related tools to solve real-world problems.

Do you provide training for the Designing and Building Big Data Applications.

Yes we provide corporate training, dedicated training and closed classes for the Designing and Building Big Data Applications. This can take place anywhere in Ireland including, Dublin, Cork, Galway, Northern Ireland or live online allowing you to have your teams from across Ireland or further afield to attend a single training event saving travel and delivery expenses.

What is the duration of the Designing and Building Big Data Applications program.

The Designing and Building Big Data Applications training takes place over 4 day(s), with each day lasting approximately 8 hours including small and lunch breaks to ensure that the delegates get the most out of the day.

Why are Nexus Human the best provider for the Designing and Building Big Data Applications?
Nexus Human are recognised as one of the best training companies as they and their trainers have won and hold many awards and titles including having previously won the Small Firms Best Trainer award, national training partner of the year for Ireland on multiple occasions, having trainers in the global top 30 instructor awards in 2012, 2019 and 2021. Nexus Human has also been nominated for the Tech Excellence awards multiple times. Learning Performance institute (LPI) external training provider sponsor 2024.
Is there a discount code for the Designing and Building Big Data Applications training.

Yes, the discount code PENPAL5 is currently available for the Designing and Building Big Data Applications training. Other discount codes may also be available but only one discount code or special offer can be used for each booking. This discount code is available for companies and individuals.

Jump to dates

Training Insurance Included!

When you organise training, we understand that there is a risk that some people may fall ill, become unavailable. To mitigate the risk we include training insurance for each delegate enrolled on our public schedule, they are welcome to sit on the same Public class within 6 months at no charge, if the case arises.

What people say about us


Top