Availability of IT services as a key business indicator, and where does watermelon have to do with it?

The use of metrics in management is a progressive and modern practice, especially in such a digitalized environment as IT business. And what hasn't been the IT business in the last decade? From the flower trade to the automotive industry, IT is a key success factor everywhere. Metrics allow you to make competent management and engineering decisions, correctly allocate the budget, increase transparency, and achieve fairness and objectivity. 





( β€œ ” , β€œ ”). , . , . , β€œβ€. . , : β€œβ€ , . , β€œ ”, β€œβ€ β€œ ” : . 





. , KPI- . , β€œβ€ ( , ). ICMP- , , β€œ VLAN-”. , , , , , : IPTV multicast. , , . IPTV . : KPI ( ), , , . :  KPI , . , , KPI SLA. KPI, IT, .





, SLA KPI -. , : .





, KPI SLA. Service Desk, ITSM (Service Desk - ), : , , . , , - , , .





, () , . , , , - . , , , - (, ). , :





  1. -. - - , , ;





  2. . , .





  3. . , / . root cause . 





: 1) / (Service Availability); 2) (Health Map). , , , , .





- ,

, ? : (. Service Availability) - -, . , : , . . , - , . 





: - , . - , 30 . , . , , , . 0. : , - , , , , , , , . , , , - .   β€œ ”. , , -. , , KPI -, , , - , . - . , .





, , , , . (, , , .., - ) - . , , ( - , ). - / -, . 





  , , . - , , , ( , , ). - . , , .





, . ( ), :





  • . , , ;





  • RTO (recovery time objective) - , ;





  • .





( , ). .





, . .





SA (Service Availability) fProblem(t), :





  • (0) , , ;





  • (1) -  (-), ;





  • (N), ;





  • (S), .





:





  • timeNonWorking - . "N";





  • timeWorkingProblem - SLA . "1";





  • timeWorkingService - , , , . "S";





  • timeWorkingOK - , SLA. fProblem(t) "0".





SA (Service Availability) :





SA =timeWorkingOK / (timeWorkingOK+timeWorkingProblem) * 100%





Fig. 1 An example of a possible distribution of time intervals when calculating SA (Service Availability) for one CU
.1 SA (Service Availability)
Fig.  2 An example of the influence of RTO on the calculation of the function fProblem (t)
. 2 RTO fProblem(t)

SAG (Service Availability Group)  fProblem(t) , . fProblem(t) , (. . 1)





1.





f1





f2





fResult





f1





f2





fResult





f1





f2





fResult





0





0





0





1





1





1





N





N





N





0





1





1





1





S





1





N





S





S





0





N





0





1





N





1





S





S





S





0





S





0





























fGroupProblem(t). :





  • timeGroupService - , fGroupProblem(t)= S; 





  • timeGroupOK - , fGroupProblem(t) = 0;





  • timeGroupProblem - , fGroupProblem(t) = 1;





, :





SAG = timeGroupOK / (timeGroupOK+timeGroupProblem) * 100%





 An example of a possible distribution of time intervals when calculating availability for a KE group

, , - , , , - , . , , , , . 





: - -.





:





  • , . , .





  • , , 1/N, N - .





:





  1. fProblem(t), SA





  2. , fProblem(t) = 1, , . , .





  3. . . , , , 1,   1/N, N - .





  4. :





    1.   - .





    2. fProblem(t) = 1. , , , SLA. 





  5. , fProblem(t). SA.





  6. . timeWorkingProblem. 





  7. %. timeWorkingProblem 100%.





  8. , , . , : . 





(. 4.)





Fig.  4 Analysis and assessment of problems in calculating availability
. 4

: , -. - , - , . : - , ; , . 





, , , . , - . - . , , , , - -.





In any case, this is not our last attempt to find the β€œHoly Grail” - the ideal metric and method of calculating it to help our clients not turn their IT environment into the same β€œwatermelon”. Our next bet is on the β€œHealth Card”. I hope to continue to share the results with you in the future.





Finally, a few screenshots of the availability calculation in the MONQ product.



All Articles