Blackout in the data center – When it gets dark

4. September 2020

Real risk to the server room

With almost 99.9 percent reliability, the power supply in Austria is far above the European standard. However, this supposed security also harbors the greatest danger: We are used to our everyday processes working automatically. Heating, air conditioning, water, cooking, refuelling, all this and much more requires energy. From one second to the next, this can be eliminated: blackout. As power supply experts, we have once again devoted ourselves to this scenario and researched the most important facts and figures.

What does “blackout” mean exactly?

A “blackout” occurs suddenly and over a longer period of time. It affects several parts of Europe. In the case of the power outages in Europe so far, help from other European countries has been able to improve the situation and help restore the power grid. In the event of a blackout, each country would be on its own. Help from outside would therefore not be possible. A total supply collapse would be the result.

Power outage in the server room – underestimated danger

In principle, everyone is affected by a blackout: from private households to data centers to production plants. Especially in the production sector, Industry 4.0 and the Internet of Things (IoT) have increased dependence on information and communication technology systems in recent years. The consequences of such a power outage would be far-reaching and in some cases enormously cost-intensive. Due to the increasing flood of data, IT services of data centers today rely heavily on maintaining uninterrupted power. Without adequate security measures, servers crash within a few seconds without power. Data center operators must therefore take preventive measures to keep the consequences of a prolonged power outage as low as possible.

Data Center Emergency Management

The requirements for proper IT operations are increasing with increasing pressure from compliance and auditing. The goal of a fully developed emergency management system in the server room is a defined, proven and targeted response in the event of an emergency. An emergency manual contains the most important points:

  • Differentiation between disruption, emergency, crisis
  • Organizational framework and responsibilities
  • Escalation procedures
  • Communication
  • Emergency plans

The emergency manual should be constantly optimized and further developed based on experience and tests. After installing a full-fledged server room, we recommend that operators carry out a full-load test by an expert.

“As part of the function and load test, we deliberately push the limits of our load capacity and can thus identify possible risks and avoid them preventively,” explains Jürgen Grubmüller, Technical Manager at EPS.

Data Center 1-Day Checks

For the existing server room, we recommend a holistic review by independent data center experts. The different aspects of data center operation are considered: e.g. operating processes, physical protection, energy and climate supply, monitoring and energy efficiency. Here, too, risks and sources of error can be identified preventively and avoided in the future.

UPS systems as a preventive protective measure

When planning a server room, the focus is on the uninterruptible power supply (UPS). A UPS system is considered one of the most important protective measures for server rooms and industrial plants. A UPS is inserted into the power supply line of the system to be secured and guarantees continuity of supply in the event of short failures. In most cases, each UPS has a battery module associated with it. Depending on the necessary capacity and bridging time, the modules can be located in the module itself, in the same cabinet or in a separate battery cabinet.

In the event of prolonged power outages (blackouts), UPS systems offer greater time for a damage-free shutdown of IT components or Industrial plants. The UPS software Intelligent Power Manager (IPM) developed by Eaton is helpful in this regard. This UPS software manages all network-based power infrastructure devices, including UPS systems and rack-based Power Distribution Units (ePDUs®), triggers virtual machine migration plans, and shuts down unneeded devices to maintain business operations during power and environmental events.

In addition to the constantly available emergency power solution, the requirement for high efficiency of a UPS system is becoming increasingly important. Especially in modular systems, UPS manufacturers have developed controllers that switch individual modules to standby mode when they are not needed. Figuratively speaking, they run along at idle speed and switch on automatically as soon as more load is needed. This sustainably reduces operating costs and improves energy efficiency.

Maximum availability at over 96% efficiency in UPS double-conversion operation is offered by the modular and scalable UPS system 93PS (8kW-40kW) from Eaton. This UPS impresses with its extremely low total cost of ownership and guarantees maximum and efficient availability for consumers.

Regular UPS service by specialist staff

In order to ensure maximum availability and flawless function of the UPS in the long term, we recommend that the systems be regularly serviced by specialist personnel and that older components be replaced preventively. Preventive maintenance minimizes operational interruptions, costs due to downtime and at the same time increases the service life of the UPS system. Arrange a non-binding initial consultation now.

Emergency power generators as a bridging measure

Emergency power systems (NEA) or Emergency power generators are used to generate electricity in the event of a failure of the normal power grid. You can work by the hour or secure the power supply for days by supplying all safety-relevant consumers with electrical energy, i.e. in addition to the UPS systems, also possible air conditioning and other systems.

The use of an external power generator is steadily increasing to maintain availability in the data center but also in production plants, because powered by diesel, it provides the necessary replacement energy.

Interruption times of less than one second are referred to as “short interruptions” in the area of a data center. These are reliably absorbed by installed UPS systems. In the event of longer interruptions, emergency power generators take over the complete power supply of the system and charge the existing UPS battery storage system at the same time. Depending on the requirements of the required electrical power and the design of the tank system, the NEA supplies the load to be supplied for the defined period of time. In the event of a network return, an uninterrupted switch-back can be carried out by means of network synchronisation.

For a reliable power supply, the following characteristics of a genset should be defined and dimensioned in advance with qualified specialists: e.g. quantity and quality of the fuel, storage location, storage type and storage period (of the fuel, fuel quality, bridging period, etc.).

Our world is becoming more connected and digital, so the requirement for availability and quality of power supply is more important than ever. Preventive precautions help to maintain the necessary availability for mission-critical and strategic infrastructures in emergencies and thus mitigate negative social and economic consequences,” says Peter Reisinger, Sales and Data Center Project Manager at EPS.

Can I help you?

My name is Peter Reisinger and I am happy to help you plan, dimension and select your server room.

3 tips from the blackout expert

How hard a possible blackout will hit us depends heavily on how we prepare for it. Blackout expert Herbert Saurugg from the Austrian Society for Crisis Prevention gives 3 practical tips:

  1. Create a crisis plan in the family circle: Organize stockpiling, get a radio with batteries, take special needs into account (medication, small children, animals, etc.)
  2. Organize a contingency plan for employers and employees: Emergency plan with emergency operation, create and keep staff transfer up to date, record in writing how many minutes after the power failure the UPS will still work, provide offline plans and restart plans with a fixed order of the systems
  3. Involve self-help and neighbourhood help: address the issue in the neighbourhood and in the community, motivate each other to prepare for a blackout, help people in need in the neighbourhood

The current Covid-19 crisis has shown how quickly our regular everyday life and our professional lives are turned upside down. However, exceptional situations can also be of a different nature: floods, heat, forest fires, technical defects, short circuits, circuit errors or cyber attacks can trigger a prolonged power outage. By now at the latest, alternative power supply solutions should find their way into our crisis management. A good and secure network infrastructure enables us to continue our lives and work in the usual quality.

Sources:

https://www.funkschau.de/datacenter-netzwerke/blackout.125579.html
https://www.datacenter-insider.de/eine-usv-alleine-macht-noch-keine-sicherheit-a-416696/?p=2
https://www.handelszeitung.ch/unternehmen/ein-blackout-kann-teuer-werden-618092
https://www.computerwoche.de/a/it-manager-unterschaetzen-die-gefahr,2515810
https://www.prior1.com/
https://www.saurugg.net

For IT, it’s not just about emergency supply – MP2 IT-Solutions takes precautions

With professional IT emergency planning and the right crisis preparedness, risks can be assessed, measures can be taken and possible damage can be reduced and, in some cases, prevented. Therefore, take precautions in good time – the IT company MP2 IT-Solutions is at your side.

  • Creation of an IT security concept that is suitable for your company
  • Professional emergency planning for IT incl. a concrete catalogue of measures
  • Risk analysis & impact assessment – in accordance with ISO 27001:2013
  • Awareness training and training for your employees

“IT emergency manuals set out instructions for action in an emergency – how to react and decide. They are a guideline where it is important to take the right measures quickly. It is also important to organize test runs, such as shutting down and restarting systems. In this way, weak points can be analyzed and processes optimized.” – Ing. Christoph Kitzler, Managing Director and Technical Director MP2 IT-Solutions.

We will be happy to advise you on your blackout IT prevention.

From

Maximilian
Aass

Share article

Are you interested in new products and innovations?
Get our free EPS-Info Mail

By subscribing to our EPS-Info Mail, you agree to our privacy policy .

Related Articles

Login

|

Inhalte für Ihr Land anzeigen?

Wählen Sie ein anderes Land, um Inhalte für Ihren Standort zu sehen:

Login

|

Show content for your country?

Select a different country to see content for your location:

🍪 Accept cookies?

Cookies make it possible to control campaigns and optimize the website. By clicking “Accept”, you agree to the use of all cookies and enter the website. Read more in the privacy policy.

Passwort Vergessen
Forgot Password