Skip to content

Latest commit

 

History

History
173 lines (104 loc) · 9.21 KB

readme.md

File metadata and controls

173 lines (104 loc) · 9.21 KB

Loading the data

The data are collected from the Johns Hopkins University Center for Systems Science and Engineering and cleaned by datahub (see This repo). In order to normalize the time axis, I filtered the data to retain the data since the 100th case has been reported. I filter out small countries (population less than 10000) and countries which are not in the world SF file, for convenience.

As the number of cases, deaths and healed are not measured/recorded each day, I need to expand the data set for each possible combination (country, time). If I don't do so, some countries will have missing records for some dates and the resulting charts and GIF will be "blinking". Hopefully, tidyr makes that easy with the function complete (equivalent to expand and join) and with fill (filling NA with the previous non-NA value). I'll use log scales, so better to avoid zero values.

## although coordinates are longitude/latitude, st_intersection assumes that they are planar

How large is the pandemic?

First of all, a sensible question to ask is what is the magnitude order of the pandemic. Because, there is a very sensitive trade-off between impacting the whole population of a country, at the risk that some households would go bankrupt and all the consequences that it would have, and save lives. Even if life has no value, induced poverty could affect a much more important part of the population and have long term consequences which could be even worst than the immediate effect of the virus.

Moreover, by digging deeper, the consequences of the virus could have been mitigated. Taking Belgium as an example, where 70% of the deaths are Patients from rest houses, whereas the death rate per capita in Sweden is much lower while they didn't lock-down the country. Of course, there are a lot of factors, but we might ask: did the politics act rationally, and did they protect the population correctly while preserving the well being of the rest of the population, especially those struggling for the day to day living? Obviously the answer is no, for Belgium at least. Is the forthcoming boom of the unemployment rate worth the price? What would be the benefit of the actual virus management? All those questions are delicate and deserve a profound reflection. I'll not dare to answer but I'll just provide the magnitude order so far of the virus compared to the overall yearly mortality for some countries.

Raw ranking

The issue is that there is no common definition of what is a "covid death". Therefore the absolute numbers cannot really be related (excepted if two countries share the same definition) however, the resulting ranking can still make sense.

Excess mortality

One can remove the bias of the definition by using the excess mortality however, we cannot rule out other causes (or deaths indirectly caused by covid such as the saturation of hospital capacity). The zero baseline represents the average of deaths, all causes.

Absolute numbers per million inhabitants

Excess in percentage of expected number of deaths

Monthly

A better way to quantify is to express the excess in the percentage of the expected number of deaths, to quantify the increase relative to the historical expectation.

Weekly

## `geom_smooth()` using method = 'loess' and formula 'y ~ x'

Are excess and covid deaths matching?

The excess mortality is not necessarily due only to the covid, some other causes might exist (indirect deaths due to the lack of hospital capacity or any other cause, which can be specific to a country). However, we might expect the excess and covid deaths to match if the definition of "covid death" set by the authorities is consistent.

In the following illustrations, the area stands for the reported covid deaths, the line for the excess death and the zero value for the average of deaths (all causes).

Top 5 countries, aggregate figures - Worldwide and in EU

Top 5 countries, absolute and per million of inhabitants figures Absolute numbers illustrate that the larger the country the more cases (not necessarily following the population ranking). The relative numbers (per million of inhabitants) illustrate how the country is globally dealing with the crisis (of course, density and other parameters make the fight harder, but mostly the policy, how quick we react and analytics usage are involved). BE is part of the 5 worst countries, using relative numbers. Especially for the number of deaths per million. This is mainly due to inefficient policies, especially regarding the rest houses (accounting for 50% of the deaths in BE)

Worldwide

## label_key: country
## label_key: country
## label_key: country
## label_key: country
## label_key: country
## label_key: country

Europe

Highlighting the most impacted countries in EU.

## label_key: country
## label_key: country
## label_key: country
## label_key: country
## label_key: country
## label_key: country

Some countries without national lockdown

The case of Sweden, for instance, is a bit particular since they didn't apply a national lockdown while most of the other EU countries did. Did it work? Apparently, this decision was pretty armful for the population.

## label_key: country
## label_key: country
## label_key: country
## label_key: country
## label_key: country
## label_key: country

Weekly new cases

Worldwide 16 top countries

Europe - 16 countries randomly chosen

Geomapping

EU

Observations of the last full week

Animation by week since the 100th case

or if you don't like GIFs, using facet:

World

## although coordinates are longitude/latitude, st_intersection assumes that they are planar

Observations of the last full week

Animation by week since the 100th case

USA - regional data

The regional US data are fetch from the NY Times github NY Times github

Last observations

Evolution over time

Using GIF

or facet plot

Evolution per State

## label_key: state
## label_key: state