Talend Big Data Advanced – MapReduce

Who Should Attend?

Complete TALEND BIG DATA BASICS 

Using TALEND STUDIO to interact with BIG DATA SYSTEMS

}
Duration: 1 Day
  • Connect to a Hadoop cluster from a Talend Job

  • Use context variables and metadata

  • Read and write files in HDFS in a Big Data batch Job

  • Use the Twitter API with Talend components

  • Schedule Big Data Job execution from Talend Administration Center (TAC)

  • Tune memory requests to YARN

     

     

     

     

     

1. Clickstream Use Case

  • Monitoring the Hadoop cluster
  • Setting up a development environment
  • Loading data into HDFS
  • Enriching logs
  • Computing statistics
  • Converting a standard Job to a Big Data batch Job
  • Understanding MapReduce jobs
  • Using Studio to configure resource requests to YARN

2. Sentiment Analysis Use Case

  • Loading dictionary and time zone data into HDFS
  • Loading tweets into HDFS
  • Processing tweets with MapReduce
  • Scheduling Job Execution

Register Now

Drop us your entry if you are interested to join this course.