Difference between revisions of "Cost estimate"
| Line 60: | Line 60: | ||
| Limit HDD || not in sight || 60 E-Shops with data for 2 months || 100 E-Shops with data for 2 months || not in sight || not in sight || 3000 rules / limit of users depends on user behaviour | | Limit HDD || not in sight || 60 E-Shops with data for 2 months || 100 E-Shops with data for 2 months || not in sight || not in sight || 3000 rules / limit of users depends on user behaviour | ||
|- | |- | ||
| − | | Limit Network if there is a limit at al it is the network, unless the communication is redesigned and all service modules write to a distributed database instead of communicating via REST || not in sight || not in sight || not in sight || not in sight || not in sight | + | | Limit Network || if there is a limit at al it is the network, unless the communication is redesigned and all service modules write to a distributed database instead of communicating via REST || not in sight || not in sight || not in sight || not in sight || not in sight |
|- | |- | ||
| Necessary Actions for Scaling || eventually changing the communication between ECC and service other modules || more Storage || more Storage || none || more CPU, RAM & Parallelization || more CPU, RAM & Parallelization | | Necessary Actions for Scaling || eventually changing the communication between ECC and service other modules || more Storage || more Storage || none || more CPU, RAM & Parallelization || more CPU, RAM & Parallelization | ||
| − | | | + | |} |
| − | |||
| − | |||
| − | |||
Revision as of 13:01, 20 January 2016
Cost estimate
For the future owner of the E-Compass Data Mining Services it is essential to be aware of the operational as well as the migration cost. As the migration of the services to a new data center essentially requires the deployment of a couple of virtual machines images which are available in the OVA (open virtualization archive) format and only minor adjustments by editing the service location URIs and running the available service configuation scripts the migrational cost can be neglected in comparison to the operational cost - as well in terms of finances as in terms of the time required. The complete migration of the services should be done within one or two days.
This leaves the us with the estimation of the operational cost. In order to estimate this we need the following:
Virtual Machine Specification for all service modules
- provision cost in data center (the cost estimate should include energy consumption and hardware maintenance, redundancy and backup as needed)
- by checking all inclusive public cloud offerings we can get a fairly good estimate of the provision cost, given that we know the requirements in terms of CPUs/Cores, RAM, Storage, Network capacities needed.
Maintenance efforts for each service module
- estimate of the overall effort to maintain the system
- staff required for service maintenance
Helpdesk / user support costs
- Experience from user testing: how much effort is needed to get the services set up for each E-Shop?
Limitations In order to get the dimension of a productive environment correctly - and therefore also the cost estimate - we need to derive the limitations of the current implementation of the different services. This includes the knowledge on all parameters that have influence on the scaling of the different services modules. These include:
- current load on VM
- number of E-Shop users
- number of products monitored for each user
This information the allows to derive
- scaling limits for these VM specifications
- VM cost depending on the parameters relevant for scaling
- general provision cost per user, per product or other parameters like e.g. the number of rules defined for the notification & actions engine
For a proper operation of the E-Compass Data Mining Services it is also helpful to be aware of some general scaling limits. Questions here are how far scaling can be reached by using more hardware capacity. Depending on the software implementation of the services larger scaling may also require the implementation of further parallelization of the software source code if higher scaling is needed. All these issues need to be considered in order to get an appropriate cost estimation for the productive operation of the services.
The following table gives an overview over the hardware characteristics used in the current implementations of the services and therefor over the virtual hardware used in total for all E-Compass Data Mining Services:
| Service Module | ECC | DCC & DA | Virtuoso | CDC VM1 | CDC VM2 | NAE | Total |
|---|---|---|---|---|---|---|---|
| Guest Operating System | CentOS 6.6 - x86_64 | CentOS 6.6 - x86_64 | CentOS 6.6 - x86_64 | Windows 8 Enterprise, 64 Bit | Windows 8 Enterprise, 64 Bit | Windows 8 Enterprise, 64 Bit | 64 bit Linux and Windows |
| Processor | 2 Processors, 4 Cores 2.20GHz | Dual 2 core Intel processor | i7 intel 5 cores | 2 Processors, 2.27GHz, 4 cores | 2 Processors, 2.27GHz, 8 cores | 4 Cores | 10 Processors / 27 Cores |
| RAM | 8 GB | 16 GB | 16 GB | 8 GB | 32 GB | 3 GB | 83 GB |
| Hard Drive Space VM | 68,36 GB (SAN/NAS configuration) | 100 GB internal | 100 GB internal | 50 GB | 533 GB | 44 GB | ca. 900 GB |
| Network Connection | 1 Gbit/s | 1 Gbit/s Ethernet | 1 Gbit/s Ethernet | 10 Gbit/s 10 Gbit/s | 10 Gbit/s | 1 Gbit/s |
The parameters relevant for scaling are described in the following table:
| Service Module | ECC | DCC & DA | Virtuoso | CDC VM1 | CDC VM2 | NAE |
|---|---|---|---|---|---|---|
| Scales with | E-Shops | Users | Users | Competitors monitored | Competitors monitored | Number of rules and total number of users |
| Limit Processors / Cores | not in sight | not in sight | not in sight | not in sight | 10 Competitors | 3000 rules / limit of users depends on user behaviour |
| Limit RAM | not in sight | ? | ? | not in sight | 40 Competitors | 3000 rules / limit of users depends on user behaviour |
| Limit HDD | not in sight | 60 E-Shops with data for 2 months | 100 E-Shops with data for 2 months | not in sight | not in sight | 3000 rules / limit of users depends on user behaviour |
| Limit Network | if there is a limit at al it is the network, unless the communication is redesigned and all service modules write to a distributed database instead of communicating via REST | not in sight | not in sight | not in sight | not in sight | not in sight |
| Necessary Actions for Scaling | eventually changing the communication between ECC and service other modules | more Storage | more Storage | none | more CPU, RAM & Parallelization | more CPU, RAM & Parallelization |