Questions On A Enterprise And Community Edition

798 Words4 Pages
Pentaho Pentaho, an organization founded in 2004, provides open source Business Intelligence (BI) suite. They provide two kinds of editions, one is Enterprise edition and the other is Community edition. The difference between an Enterprise and Community edition is that the Enterprise edition has more features to offer to the users than the Community edition. Like all other ETL tools, Pentaho also runs on Java. Here, the storage and transformation are done separately and are stored in XML format. The Pentaho Business Analytics suite offers products which are helpful in data integration (ETL), OLAP, reporting, dashboards, and data mining. These products can be classified as Server applications, Desktop/client applications and Pentaho…show more content…
It is used to perform data flow functions like reading, validating, refining, transforming, writing data to data sources. • Pan – It is used to run data transformations. • Chef – It is used to create jobs which automates the database update process in a complex way. • Kitchen – It is used to execute jobs in a batch mode. • Carte – A web server which allows remote monitoring of the running ETL process through a web browser. • Encr – It is a command line tool for encrypting strings for storage. • Enterprise Edition (EE) Data Integration Server – Data Integration Engine, Security integration with LDAP/Active Directory, Monitor/Scheduler, Content Management. Other than being used in the data warehouses, Pentaho Data Integration tool can also be used for the following purposes: • Migrating data between applications or databases. • Exporting data from databases to flat files. • Loading data massively into databases. • Data cleansing. • Integrating applications. The latest Pentaho version available is v6.1, which brings the following mentioned enhancements: • Improved system performance monitoring. • Data profiling enhancements. • Updated Look and Feel for Pentaho Data Integration. • Easily add new plugins. • Deliver data from multiple data sources. • New Embedded Analytics APIs for Analyzer. • Data movement load balancing. •
Open Document