Hello,
we are evaluating Pentaho apps suite for reporting and analytics solution for one of our customers. After reading large amount of official/community documents we get some understanding of the solution being developed. We'd like to get some feedback/comments on our approach and answers to the questions provided below.
We think on the high level vision of solution as following:
- use BI server CE on development environment
- use BA enterprise version on production
- don't use DI components, instead we are going to develop custom service application with following functionality:
* work as silent background process on one or more nodes
* receive raw business data from customer's business applications and transfer it into cube's for further analysis
* provide automatic data scheme evolution - control and update physical DB schema and generate appropriate OLAP schema for Mondrian
* automatically update OLAP schema on BI server if it get changed
* later on possibly we'll also generate/update metadata (XMI files) for relational data analysis
Customer's business applications are in continuous development - they evolve intensively. Thus data structures for analysis will also change frequently - this will be also continuous development process. We think that our DI solution will grow and evolve within next few years (mainly by adding cubes and dimensions). Customer wants start using analysis immediately. That's why we decided to develop custom data integration solution.
Here some questions:
1. What do you think on such approach to data integration? We think that it is "developer friendly" and suites good for agile development process.
2. Is that possible to programmatically add/update Mondrian schema in BI server? Some remote service? Direct file access? We didn't found any info about that.
3. Is it available some API in Mondrian to programmatically construct schema and export it into XML format?
4. Is it OK using CE server on development and EE version on production for the same OLAP schema and physical data structures?
5. Are there available any performance metrics/comparisons for data analysis? How productive Mondrian in high load? Is it OK if cube's data will be updated frequently?
Thank you in advance.
// Dmitry
we are evaluating Pentaho apps suite for reporting and analytics solution for one of our customers. After reading large amount of official/community documents we get some understanding of the solution being developed. We'd like to get some feedback/comments on our approach and answers to the questions provided below.
We think on the high level vision of solution as following:
- use BI server CE on development environment
- use BA enterprise version on production
- don't use DI components, instead we are going to develop custom service application with following functionality:
* work as silent background process on one or more nodes
* receive raw business data from customer's business applications and transfer it into cube's for further analysis
* provide automatic data scheme evolution - control and update physical DB schema and generate appropriate OLAP schema for Mondrian
* automatically update OLAP schema on BI server if it get changed
* later on possibly we'll also generate/update metadata (XMI files) for relational data analysis
Customer's business applications are in continuous development - they evolve intensively. Thus data structures for analysis will also change frequently - this will be also continuous development process. We think that our DI solution will grow and evolve within next few years (mainly by adding cubes and dimensions). Customer wants start using analysis immediately. That's why we decided to develop custom data integration solution.
Here some questions:
1. What do you think on such approach to data integration? We think that it is "developer friendly" and suites good for agile development process.
2. Is that possible to programmatically add/update Mondrian schema in BI server? Some remote service? Direct file access? We didn't found any info about that.
3. Is it available some API in Mondrian to programmatically construct schema and export it into XML format?
4. Is it OK using CE server on development and EE version on production for the same OLAP schema and physical data structures?
5. Are there available any performance metrics/comparisons for data analysis? How productive Mondrian in high load? Is it OK if cube's data will be updated frequently?
Thank you in advance.
// Dmitry