Setting up the monitoring of a BI infrastructure based on the Google Cloud
Project duration: 1 year, 4 months
Brief description
The project aims to implement a powerful monitoring system in the Google Cloud for the business intelligence infrastructure. The main objective is to ensure stable operation of the entire infrastructure and important ETL processes. To this end, the current and historical status of the systems is continuously monitored so that potential problems can be identified and responded to at an early stage. A new dashboard provides users with an intuitive overview of the performance and availability of the systems in real time. In addition, the existing alarm system is supplemented by extended alerting functions to enable more targeted notifications and proactive measures in the event of faults.
Supplement
The implementation of monitoring in the Google Cloud Platform (GCP) is designed in such a way that all metrics are managed centrally. Standard metrics such as CPU utilization, memory consumption and network utilization are recorded and monitored directly by the GCP. In addition, user-defined metrics are written in Python and run as scheduled cloud functions that collect specific metrics for the BI infrastructure and are made available via the GCP's monitoring API. Alerts are triggered based on these metrics, with notifications being sent directly from the Google Cloud when predefined thresholds are reached. Grafana is used to visualize the data, which enables a user-friendly and flexible presentation of the metrics. The dashboard shows both current and historical data and thus supports the monitoring and analysis of the infrastructure.
Subject description
The project offers significant advantages thanks to the standardization and consolidation of all metrics in a central monitoring system. Clarity is significantly improved as all relevant data is bundled in one place and presented in a standardized manner. This not only facilitates monitoring, but also the maintenance of the BI infrastructure. In addition, the flexible scalability of the solution ensures that it can be easily adapted to growing requirements without compromising system stability. Overall, monitoring enables more efficient operational management and faster response times in the event of disruptions.