Mindtree Pentaho Recently Asked Interview Questions

Name Major Applications Comprising Pentaho Bi Project.?

Business Intelligence Platform.
Dashboards and Visualizations.
Reporting.
Data Mining.
Data Analysis.
Data Integration and ETL (also called Kettle).
Data Discovery and Analysis (OLAP).

What Is Mdx And Its Usage?

MDX is an acronym for ‘Multi-Dimensional Expressions,’ the standard query language introduced by Microsoft SQL OLAP Services. MDX is an imperative part of XML for analysis API, which has a different structure than SQL.

A basic MDX query is:

SELECT {[Quantity].[Unit Sales], [Quantity].[Store Sales]} ON COLUMNS,

{[Product].members} ON ROWS

FROM [Sales]

WHERE [Time].[1999].[Q2]
Mindtree Pentaho Recently Asked Interview Questions Answers
Mindtree Pentaho Recently Asked Interview Questions Answers

Define Three Major Types Of Data Integration Jobs.?

Transformation Jobs : Used for preparing data and used only when the there is no change in data until transforming of data job is finished.

Provisioning Jobs : Used for transmission/transfer of large volumes of data. Used only when no change is data is allowed unless job transformation and on large provisioning requirement.

Hybrid Jobs : Execute both transformation and provisioning jobs. No limitations for data changes; it can be updates regardless of success/failure. The transforming and provisioning requirements are not large in this case.

Illustrate The Difference Between Transformations And Jobs.?

While transformations refer to shifting and transforming rows from source system to target system, jobs perform high level operations like implementing transformations, file transfer via FTP, sending mails, etc.

Another significant difference is that the transformation allows parallel execution whereas jobs implement steps in order.

Define Pentaho Report Types.?

There are several categories of Pentaho reports :

Transactional Reports : Data to be used form transactions. Objective is to publish detailed and comprehensive data for day-to-day organization’s activities like purchase orders, sales reporting.

Tactical Reports : data comes from daily or weekly transactional data summary. Objective is to present short-term information for instant decision making like replacing merchandize.

Strategic Reports : data comes from stable and reliable sources to create long-term business information reports like season sales analysis.

Helper Reports : data comes from various resources and includes images, videos to present a variety of activities.

What Are Variables And Arguments In Transformations?

Transformations dialog box consists of two different tables: one of arguments and the other of variables. While arguments refer to command line specified during batch processing, PDI variables refer to objects that are set in a previous transformation/job in the OS.

How To Configure Jndi For Pentaho Di Server?

Pentaho offers JNDI connection configuration for local DI to avoid continuous running of application server during the development and testing of transformations.  Edit the properties in jdbc.propertiesfile located at…data-integration-serverpentaho-solutionssystemsimple-jndi.

Explain Pentaho Reporting Evaluation.?

Pentaho Reporting evaluation is a complete package of its reporting abilities, activities and tools, specifically designed for first-phase evaluation like accessing the sample, generating and updating reports, viewing them and performing various interactions. This evaluation consists of Pentaho platform components, Report Designer and ad hoc interface for reporting used for local installation.

Can Field Names In A Row Duplicates In Pentaho?

No, Pentaho doesn’t allow field duplication.

Does Transformation Allow Filed Duplication?

“Select Values” will rename a field as you select the original field also.  The original field will have a duplicate name of the other field now.

How To Use Database Connections From Repository?

You can either create a new transformation/job or close and reopen the ones already loaded in Spoon.

Explain In Brief The Concept Pentaho Dashboard.?

Dashboards are the collection of various information objects on single page including diagrams, tables and textual information. The Pentaho AJAX API is used to extract BI information while Pentaho Solution Repository contains the content definitions.

The steps involved in Dashboard creation include:

Adding dashboard to the solution.
Defining dashboard content.
Implementing filters.
Editing dashboards.

How To Use Logic From One Transformation/job In Other Process?

Transformation logic can be shared using subtransformations, which provides seamless loading and transformation of variables enhancing efficiency and productivity of the system. Subtransformations can be called and reconfigured when required.

Explain The Use Of Pentaho Reporting.?

Pentaho reporting enables businesses to create structured and informative reports to easily access, format and deliver meaningful and important information to clients and customers. They also help business users to analyze and track consumer behavior for the specific time and functionality, thereby directing them towards the right success path.

What Is Pentaho Data Mining?

Pentaho Data Mining refers to the Weka Project, which consists of a detailed tool set for machine learning and data mining. Weka is open source software for extracting large sers of information about users, clients and businesses. It is built on Java programming.


Is Data Integration And Etl Programming Same?

No. Data Integration refers to passing of data from one type of systems to other within the same application. On the contrary, ETL is used to extract and access data from different sources. And transform it into other objects and tables.

Explain Hierarchy Flattening.?

It is just the construction of parent child relationships in a database. Hierarchy Flattening uses both horizontal and vertical formats, which enables easy and trouble-free identification of sub elements. It further allows users to understand and read the main hierarchy of BI and includes Parent column, Child Column, Parent attributes and Child attributes.

Define Pentaho Bi Project?

The Pentaho BI Project is an current effort by the Open Source communal to provide groups with best-in-class solutions for their initiative Business Intelligence (BI) needs.

What Major Applications Comprises Of Pentaho Bi Project?

The Pentaho BI Project encompasses the following major application areas:

Business Intelligence Platform
Data Mining
Reporting
Dashboards
Business Intelligence Platform

Which Platform Benefits From The Pentaho Bi Project?

Java developers who generally use project components to rapidly assemble custom BI solutions
ISVs who can improve the value and ability of their solutions by embedding BI functionality
End-Users who can quickly deploy packaged BI solutions which are either modest or greater to traditional commercial offerings at a dramatically lower cost.

Explain Pentaho?

It addresses the blockades that block the organization’s ability to get value from all our data. Pentaho is discovered to ensure that each member of our team from developers to business users can easily convert data into value.

How Do You Duplicate A Field In A Row In A Transformation?

Several solutions exist:

Use a “Select Values” step renaming a field while selecting also the original one. The result will be that the original field will be duplicated to another name.

It will look as follows:

This will duplicate fieldA to fieldB and fieldC.

Use a calculator step and use e.g. The NLV(A,B) operation as follows:

This will have the same effect as the first solution: 3 fields in the output which are copies of each other: fieldA, fieldB, and fieldC.

Use a JavaScript step to copy the field:

This will have the same effect as the previous solutions: 3 fields in the output which are copies of each other: fieldA, fieldB, and fieldC.

Why Can’t I Duplicate Fieldnames In A Single Row?

You can’t. PDI will complain in most of the cases if you have duplicate fieldnames. Before PDI v2.5.0 you were able to force duplicate fields, but also only the first value of the duplicate fields could ever be used.


Post a Comment

Previous Post Next Post