Infosys Pentaho Recently Asked Interview Questions

What Is Data Staging?

Data staging is actually a group of procedures used to prepare source system data for loading a data warehouse.

Data Staging Is Actually A Group Of Procedures Used To Prepare Source System Data For Loading A Data Warehouse.?

Full Load means completely erasing the insides of one or more tables and filling with fresh data.
Incremental Load means applying ongoing changes to one or more tables based on a predefined schedule.

Define Mapping?

Dataflow from source to target is called as mapping.

Explain Session?

It is a set of instruction which tell when and how to move data from respective source to target.
Infosys Pentaho Recently Asked Interview Questions Answers
Infosys Pentaho Recently Asked Interview Questions Answers

What Is Workflow?

It is a set of instruction which tell the infomatica server how to execute the task.

Define Mapplet?

It creates and configure the set of transformation.

What Do You Understand By The Term Pentaho Dashboard?

Pentaho Dashboards give business users the critical information they need to understand and improve organizational performance.

What Is The Use Of Pentaho Reporting?

Pentaho Reporting allows organizations to easily access, format and deliver information to employees, customers and partners.

Define Pentaho Schema Workbench?

Pentaho Schema Workbench offers a graphical edge for designing OLAP cubes for Pentaho Analysis.

Define Pentho Data Mining?

Pentaho Data Mining used the Waikato Environment for Information Analysis to search data for patterns. It have functions for data processing, regression analysis, classification methods, etc.

Brief About Pentaho Report Designer?

It is a visual, banded report writer. It has various features lilke using subreports, charts and graphs etc.

What Do You Un Derstand By The Term Etl?

It is an entri level tool for data manipulation.

What Is The Importance Of Metadata In Pentaho?

A metadata model in Pentaho formulates the physical structure of your database into a logical business model. These mappings are stored in a central repository and allow developers and administrators to build business-logical DB tables that are cost effective and optimized. It further simplifies the working of business users allowing them to create formatted reports and dashboards ensuring security to data access.

All in all, metadata model provides an encapsulation around the physical definitions of your database and the logical representation and define relationships between them.

Define Pentaho Reporting Evaluation.?

Pentaho Reporting Evaluation is a particular package of a subset of the Pentaho Reporting capabilities, designed for typical first-phase evaluation activities such as accessing sample data, creating and editing reports, and viewing and interacting with reports.

How To Perform Database Join With Pdi (pentaho Data Integration)?

PDI supports joining of two tables form the same databse using a ‘Table Input’ method, performing the join in SQL only.

On the other hand, for joining two tables in different databases, users implement ‘Database Join’ step. However, in database join, each input row query executes on the target system from the main stream, resulting in lower performance as the number of queries implement on the B increases.

To avoid the above situation, there is yet another option to join rows form two different Table Input steps. You can use ‘Merge Join ‘step, using the SQL query having ‘ORDER BY’ clause. Remember, the rows must be perfectly sorted before implementing merge join.

Explain How To Sequentialize Transformations?

Since PDI transformations support parallel execution of all the steps/operations, it is impossible to sequentialize transformations in Pentaho. Moreover, to make this happen, users need to change the core architecture, which will actually result in slow processing.

Explain Pentaho Report Designer (prd).?

PRD is a graphic tool to execute report-editing functions and create simple and advanced reports and help users export them in PDF, Excel, HTML and CSV files. PRD consists of Java-based report engine offering data integration, portability and scalability. Thus, it can be embedded in Java web applications and also other application servers like Pentaho BAserver.

Explain The Benefits Of Data Integration.?

The biggest benefit is that integrating data improves consistency and reduces conflicting and erratic data from the DB. Integration of data allows users to fetch exactly what they look for, enabling them utilizeand work with what they collected.

Accurate data extraction, which further facilitates flexible reporting and monitoring of the available volumes of data.

Helps meet deadlines for effective business management.

Track customer’s information and buying behavior to improve traffic and conversions in the future, thus advancing your business performance.


Post a Comment

Previous Post Next Post