Duration
3 Days
Level
Intermediate Level
Design and Tailor this course
As per your team needs
Talend is a leader in cloud and big data integration software that fosters organizations to become data driven by making data more accessible, improving its quality and quickly moving it where it’s needed for real-time decision making.
First and only open source company in Leaders quadrant in Gartner Report – Unified platform, Big Data and Cloud Integration, Batch and Real time data integration.
Talend Open Studio for Data Integration is an open Source ETL Tool, which means businesses can use this tool to perform Extract, Transform and Load their data into various data sinks like RDBMS etc.
The program is focused on enhancing data handling and integration capabilities. Create ETL jobs that connect to almost any data source, Filter, Modify, unite data, Build standalone jobs that run on a schedule or based on an event and Make jobs more user-friendly for non-technical users.
Upon completion of this course, you should be able to:
- Learn about Talend as a ETL tool
- Compare Talend with other Key ETL tools
- Learn about Talend Studio
- Work with Talend Open Studio
- Perform Data Integration of lot of data sources
- Learn Best Practices, troubleshooting
- How to perform CI/CD with Talend
- Developers
- Bigdata Developers
- Data Engineers
- Integration Engineers
- Architects
- Data Steward
- Introduction To Talend
- Why Talend?
- Talend Vs Other Tools
- Logical Architecture
- More On Data Integration Aspects
- Talend Open Studio Walkthrough
- Key Components In Palette
- Installation
- Conclusion
- Introduction
- Working with Built-In Schema
- What are Generic Schemas?
- Advantages of Generic Schemas
- Generating Generic Schema
- More on Working with Schemas
- Introduction
- Enabling And Disabling Reject Flows
- Validating Against The Schema
- Rejecting Rows Using tMap
- Checking A Column Against A List Of Allowed Values
- Checking A Column Against A Lookup
- Creating Validation Rules For More Complex Requirements
- Introduction
- Mapping Data
- Working with tMap Expressions
- Filtering Input Data
- Joining Data Using tMap
- Introduction
- Working with A Context Group
- Linking Context Group with Job
- Adding Contexts To A Context Group
- Integrating Context with OS variables
- Introduction
- Setup Connection
- Importing Schemas
- Reading Tables
- Dynamic parameters to query
- Debugging and Printing your input query
- Storing results in Database Table
- Dealing with Database Sessions
- Working with Child Job
- Error Handling
- Introduction
- Consuming CSV data
- Consuming XML data
- Writing XML data
- Working with Nested data
- Invoking Web Services
- Reading And Writing To A Queue
- Introduction
- How to troubleshoot?
- Debugging errors
- Logging
- Building Test Cases
- Generating Test Data
- Working With Git And Talend
- How To Perform Ci/Cd With Jenkins And Talend?
- Job Monitoring Using Resource Manager Ui
- Unit Testing
- Best Practices
- Joblets
- Parallelization
- Reusing Jobs (Child Jobs)
- Joblets
- Context Variables
- Using Repository
- Basic understanding of databases and data structures.
- Familiarity with ETL concepts (Extract, Transform, Load).
- Knowledge of basic programming concepts.