Talend Big Data Advanced – MapReduce

Who Should Attend?

Complete TALEND BIG DATA BASICS 

Using TALEND STUDIO to interact with BIG DATA SYSTEMS

}
Duration: 1 Day
  • Connect to a Hadoop cluster from a Talend Job

  • Use context variables and metadata

  • Read and write files in HDFS in a Big Data batch Job

  • Use the Twitter API with Talend components

  • Schedule Big Data Job execution from Talend Administration Center (TAC)

  • Tune memory requests to YARN

     

     

     

     

     

1. Clickstream Use Case

  • Monitoring the Hadoop cluster
  • Setting up a development environment
  • Loading data into HDFS
  • Enriching logs
  • Computing statistics
  • Converting a standard Job to a Big Data batch Job
  • Understanding MapReduce jobs
  • Using Studio to configure resource requests to YARN

2. Sentiment Analysis Use Case

  • Loading dictionary and time zone data into HDFS
  • Loading tweets into HDFS
  • Processing tweets with MapReduce
  • Scheduling Job Execution

Register Now

Drop us your entry if you are interested to join this course.

 


You may like

Using Pig, Hive, and Impala with Hadoop

Using Pig, Hive, and Impala with Hadoop

Using Pig, Hive, and Impala with HadoopDuration: 3 DaysThrough instructor-led discussion and interactive, hands-on exercises, participants will navigate the Hadoop ecosystem, learning topics such as: The features that Pig, Hive, and Impala offer for data acquisition,...

Visual Analytics

Visual Analytics

Visual AnalyticsDATA PROFESSIONALS involve in DATA STORYTELLINGDuration: 3 DaysIn this course, you will learn to design visualizations that effectively share information and insights with others. This course will strengthen your understanding of visual best practices,...

Time Series Analytics with RapidMiner

Time Series Analytics with RapidMiner

Time Series Analytics with RapidMinerDATA ANALYST and DATA SCIENTIST involved in Time Series DataDuration: 1 DayTime Series Analysis with RapidMiner is a course regarding the analysis and handling of time series data science techniques. It introduces basic concepts in...