OCI Observability and Management (O&M) Platform consists of several cloud services that we can enable to monitor, analyze, and manage applications and infrastructure environments with full-stack visibility, prebuilt analytics, and automation capabilities.
In this blog, I will create an alarm that is triggered when the service metrics reach a designated threshold, and using OCI notification we will get a notification alert.
Steps:
1. Confirm compute instance monitoring Plug-in is Enabled
Navigate to Main Menu -> Compute -> Instances and select your instance
Click on the Oracle Cloud Agent tab.
data:image/s3,"s3://crabby-images/9946d/9946d1056f8ba0f7552e1705024e4f51008bc82a" alt="oci oci"
The compute instance monitoring plug-in should be running and enabled.
data:image/s3,"s3://crabby-images/c55de/c55de6fbfcbbbade60a0f5304c4aec7f2e9b4744" alt="oci2 oci2"
2. Create a Topic and Subscription Inside a Topic for Notification
To create an alarm, first I need to create a notification so that the alarm has a way to notify the relevant parties.
OCI Notification has two main elements: Topic (a communication channel for sending alarms) and Subscription (endpoints for receiving those messages)
Navigate to Main Menu –> Developer Services –> Application Integration, and select Notifications.
Click Create Topic and give any topic name ( for eg. CPU stress-topic) and description and click create.
After the topic state changes to Active, click the topic Name.
data:image/s3,"s3://crabby-images/11bef/11befaf538596cc7f72290ccd024f2b60b9f817a" alt="oci3 oci3"
Now Click Create Subscription, Provide the Protocol name and detail
data:image/s3,"s3://crabby-images/9dffd/9dffd39a326b8bf7982a1d75c6603a1e11783d87" alt="oci4 oci4"
Click create and you will see your subscription OCID.
data:image/s3,"s3://crabby-images/47e6a/47e6a47b58729f8efee4507a337a4dbe820c031f" alt="oci5 oci5"
The subscription details screen will be displayed with the subscription status pending.
data:image/s3,"s3://crabby-images/a0dad/a0dad9dc251d56379ca4415f50f7473309464002" alt="oci6 oci6"
I should receive an email to confirm the subscription with Confirm subscription verification link in it. Like below..
data:image/s3,"s3://crabby-images/2f983/2f98318cf31d10723c74d4da61db6f99ef0155b8" alt="oci7 oci7"
Verify the subscription by clicking on Confirm subscription and then you will see the subscription status changes to active in the OCI console.
data:image/s3,"s3://crabby-images/8fbea/8fbea4c7a859eed615b15facf3f64aef46de4a72" alt="oci8 oci8"
3. Now the final step is to create an alarm for CPU Utilization
Navigate to Main Menu -> Observability & Management -> Monitoring and click Alarm Definitions.
Click Create Alarm.
a. Define Alarm
data:image/s3,"s3://crabby-images/e6bae/e6baed04e7f5854d309172abcba94d17ec7a6ac8" alt="oci9 oci9"
b. Define Metric description like compartment, metric namespace (oci_computeagent), Metric name, interval, and statistics like below.
data:image/s3,"s3://crabby-images/5cbc8/5cbc8d9c4bfddc3cbd24e1e9100f229e05f50eb2" alt="oci10 oci10"
I leave the metric dimensions area blank as I have only one compute instance but if you have like 5 or more instances then you can use dimensions to filter based on dimensions.
c. Define trigger rule: you can use any value as per requirement.
data:image/s3,"s3://crabby-images/c947e/c947e250dbab7972e12dac590172f59d1c10df9b" alt="oci11 oci11"
d. Define Alarm Notifications: Here you have to mention the Topic that we created in step 2.
data:image/s3,"s3://crabby-images/b0678/b06785be49f1d942f0df48c6e86004b2f15a8e35" alt="oci12 oci12"
e. We can also select the message format and repeat notification if an alarm continues. Or you can suppress the notification.
data:image/s3,"s3://crabby-images/800a8/800a8d40a77ba08bef2a17c95ffc551c55679521" alt="oci13 oci13"
f. Check Enable this alarm and save it.
data:image/s3,"s3://crabby-images/80db1/80db1c5a976c868d5ee52891fd243a0183b90e17" alt="oci14 oci14"
You should now be able to see the alarm’s details when you click on Alarm like below.
data:image/s3,"s3://crabby-images/0945b/0945bef11790e9878c8223924ad4e51cc45c8ce0" alt="oci15 oci15"
And when the threshold reaches to >=70 as per the alarm setup, we will get an email notification like below.
data:image/s3,"s3://crabby-images/1b072/1b07299a7aa4672f1f7332aa7f3482730a32cc79" alt="oci16 oci16"
Email provides details about Alarm OCID, number of metrics breaching threshold, and dimensions.
data:image/s3,"s3://crabby-images/74002/740025f003d6d51a56ad32e0d80a83cab8e51c54" alt="oci17 oci17"
Stay Tuned for my next O&M Blog!