Talend Data Integration Basics & Advanced

Duration: 3 Days

Talend Data Integration provides an extensible, highly scalable set of tools to access, transform, and integrate data from any business system. This course will teach you from basics to advanced features of Talend Data Integration as quickly as possible. Participants can work in teams on projects shared on a remote repository to monitor Jobs and database changes. This course is designed for anyone who wants to perform data integration and management tasks.

  • Create a project
  • Ceate and run a Job that reads, converts, and writes data
  • Merge data from several sources within a Job
  • Save a schema for repeated use
  • Create and use metadata and context variables within Jobs
  • Connect to, read from, and write to a database from a Job
  • Access a web service from a Job
  • Work with master Jobs and subJobs
  • Build, export, and test run Jobs outside Studio
  • Invoke basic error-handling techniques,
  • Use best practices for Job and component naming, hints, and documentation
  • Start and connect Talend Studio to a remote repository
  • Use SVN branches in Studio
  • Run a Job in Studio on a remote Job server
  • Monitor host CPU and JVM memory in real time during Job execution
  • Use debugging features in Studio
  • Configure a Talend project to capture statistics and logs, and monitor them from Activity Monitoring Console (AMC)
  • Implement several methods of parallel execution in a Talend Job
  • Create Joblets
  • Create a unit test from a working Job
  • Configure a database to monitor and log changes in a separate change data capture (CDC) database, and use it to perform incremental updates between the source and target
  1. Getting started
    • Starting Talend Studio
    • Creating and running a Job
  2. Working with files
    • Reading an input file
    • Transforming data
    • Combining columns
    • Duplicating a Job
  3. Joining data sources
    • Creating metadata
    • Joining data sources
    • Capturing rejects
    • Correcting a lookup
  4. Filtering data
    • Filtering output data
    • Using multiple filters
  5. Using context variables
    • Understanding context variables
    • Using context variables
    • Using repository context variables
  6. Error handling
    • Detecting and handling basic errors
    • Raising a warning
  7. Generic schemas
    • Setting up sales data files
    • Creating metadata
  8. Working with databases
    • Creating database metadata
    • Creating and setting up a table
    • Finalizing a Job
  9. Creating master Jobs
    • Controlling Job execution with master Job
  10. Working with web services
  11. Running Jobs standalone
    • Building a Job
    • Modifying a Job
  12. Documenting a Job
    • Best practices for documenting a Job
  13. Remote Repository and Execution
    • Create remote connection
    • Creating and executing a Job remotely
  14. SVN in Studio
    • Copying a Job to a branch
    • Comparing Jobs
    • Resetting a branch
  15. Resource usage and basic debugging
    • Using Memory Run to view real-time resource usage
    • Debugging Jobs using Debug Run
  16. Activity Monitoring Console (AMC)
    • Configuring statistics and logging
    • Using Activity Monitoring Console (AMC)
  17. Parellel Execution
    • Writing large files
    • Writing to databases
    • Parellelizing automatically
    • Partitioning
  18. Joblets
    • Creating a Joblet
    • Triggering Joblets
  19. Creating a Unit Test
  20. Change Data Capture
    • Examining databases
    • Configuring the CDC database
    • Monitoring changes
    • Updating a warehouse
    • Resetting the databases

Register Now

Drop us your entry if you are interested to join this course.

This field is for validation purposes and should be left unchanged.