Managing a facility means dealing with issues at all hours, often when no one is sitting at their desks watching the controls. Building automation systems act as the smart backbone of today’s buildings by connecting HVAC, lighting, fire safety, security, electricity, and more into one seamless platform. Whether it’s a hospital that demands zero downtime … Continued
In a perfect world, log anomalies would speak clearly and never at 2 a.m. But in reality, log data is massive, alerts can be cryptic, and critical issues often get buried in the noise. That’s why choosing the right log management tool is crucial, it’s the first line of defense against downtime, breaches, and costly … Continued
We’re excited to announce the launch of our bi-directional integration between OnPage and Jira! This integration is designed to bridge the gap between ticket creation and incident response, ensuring that IT, DevOps and other tech teams who rely on Jira to manage their incidents can automatically identify and engage the right on-call staff—ensuring critical incidents … Continued
Top IT Conferences of 2025 IT conferences offer valuable opportunities to build lasting partnerships and explore growth-driven technologies. However, with so many options available, it’s crucial for teams to prioritize events that deliver the greatest impact. So, we have established a list of the top IT conferences to attend in 2025: IT Nation Connect When … Continued
Site Reliability Engineer’s Guide to Black Friday It’s gotten to the point where Black Friday reliability prep has to start on…well Black Friday. This year, 32% of consumers in the US claimed that they were going to start their holiday shopping in July-October. Plus, Black Friday isn’t the only day eCommerce businesses have to worry … Continued
How Effective Are Your Alerting Rules? Recently, I came across this Reddit post highlighting the challenges of having ineffective alerting rules: And, here at OnPage we have experience with various companies who have dealt with just that, so I felt I should share some of our top tips for creating effective alerting rules in this … Continued
What Are Large Language Models? Large language models are algorithms designed to understand, generate, and manipulate human language. State-of-the-art large language models include OpenAI’s GPT-4o, Anthropic Claude Sonnet 3.5, and Meta LLaMA 3.1. They are built using neural networks with billions or even trillions of parameters. They are trained on vast datasets that can include … Continued
When it comes to critical incident management, IT teams require a structured approach that will ensure that any cybersecurity event is swiftly remediated. And no incident management plan is complete without a clearly defined incident response team. Whether your team is looking to establish an incident response team from scratch or just improve existing response … Continued
Rethinking IT Management – Introduction We live in a time where immediate communication of critical incidents is vital for maintaining continuous service availability. As companies strive to enhance their IT service management practices, many integrate technologies like Interactive Voice Response (IVR) into their service delivery frameworks. However, this approach may not always be the most … Continued
Crisis Management for Oil and Gas Companies Oil and gas companies operate in a high-stakes environment where the potential for catastrophic incidents, such as oil spills, explosions, and natural disasters always exists. These risks necessitate the establishment of robust crisis management for oil and gas companies to ensure the safety of their personnel and minimize … Continued