Our technological developments and commitment in the COVID-19 outbreak
THE SPREAD OF A VIRUS
At the beginning of 2020, reports of an outbreak in China of a previously unknown respiratory tract disease with the causative agent being a virus emerged. This event is fundamentally changing life as we know it, in almost all parts of the world: a pandemic with still no foreseeable end. SARS-CoV-2 is the causative agent of the pandemic outbreak. It is a newly encountered member of the coronavirus family which belongs to the RNA-viruses and is in its behaviour comparable to influenza viruses or SARS-CoV - the causative agent of the pandemic outbreak 2002/03. As soon as virus particles get into a host (human), they start invading cells (in this case predominantly respiratory tract cells), and the host’s cells replicate the virus’s genome. Virus particles get into the host’s saliva and humans infect each other by talking to infected individuals, by touching hands and by close face-to-face interaction.
Governments among others need to learn when and how the virus is spreading to consider the appropriate measures to take. So-called COVID-19 contact tracing apps are of high importance to limit the spread of the disease.
ON THE IMPORTANCE OF CONTACT TRACING APPS
Disease models about the spread of COVID-19 show: high precision isolation is the most effective measure in a virus pandemic. The spread of the deadly coronavirus needs to be slowed down as quickly as possible, while minimizing its economic impact.
COVID-19 mobile contact tracing apps are an important factor of success to achieve that. Several countries have shown that monitoring and tracking the collective movement of millions of people is necessary to cope with a pandemic. Multiple institutions and companies developed COVID-19 smartphone apps that serve the needs of the individual, including symptom analysis and exposure alerts on COVID-19 hotspots. The collected data can be used for statistical insights on symptoms but also enable high precision self-isolation as well as rapid identification of those exposed to COVID-positive people. These applications are necessary to minimize the impact on public health and the economy. There are quite a few COVID-19 apps on the market available; many of them are completely open sourced. These apps have different functionalities but on a general note, their core workflow is similar and can be described like this: Every user collects tracing data on their mobile phones. Those generated contact IDs are stored on the users' phones only. If the health authorities diagnose a user positive with the coronavirus, the user can (but usually doesn't have to!) share their data and transfer it to a server. Any other user of the app can now learn if they have potentially been in contact with a positive tested user by comparing their data with the updated data on the server.
This workflow contains two privacy issues which need to be considered when data is shared and compared: the diagnosed patient's privacy and the user's tracing data.
COVID-19 CONTACT TRACING APP PRIVACY CONCERNS
It is critical that COVID-19 contact tracing apps are adopted by most people to fulfil their purpose. As the apps capture sensitive and private data like contact traces and the infection status, it is indispensable that COVID-19 contact tracing apps are trusted and data privacy is crucial for that: privacy drives trust and trust drives adoption. Cryptographic technologies allow computations in a privacy-preserving manner and are therefore one of the key technologies to help end the pandemic:
Private set intersection (PSI) is a powerful cryptographic technique which allows two parties to compare data with one another without exposing their raw data to the other party.
For COVID-19 apps, PSI allows for a user to check if the tracing data they collected matches the traces of diagnosed patients, whithout revealing their private tracing data to the server. Depending on the type of PSI protocol, the client would then only learn the matching traces itself, or the count of matching traces. This prevents data from becoming publicly available and being exploited or abused. Additionally, the central server does not have to collect all user’s data but only the contact traces of infected users.
Differential Privacy is another cryptographic technique which enables private data analysis. It can be used by organizations to learn statistical information about a dataset while ensuring that the statistical results do not allow any individual’s data to be reverse engineered and identified. Differential privacy is relevant for the government to make use of the app data without exposing the individual user. For more information on the functionality of Differential Privacy, check out our blog post here.
OUR CONTRIBUTION TO FIGHT COVID-19
To help reduce the spread of the coronavirus, we have developed a private set intersection library for contact tracing initiatives to incorporate in their COVID-19 apps. The necessity of privacy preservation in the case of COVID-19 contact tracing apps and how it is done, are also outlined in this article, which we published in collaboration with OpenMined. Our code is open source and you can clone the repository from GitHub here.
You can get more details on the importance of a COVID-19 app from our whitepaper which we published early in the crisis. The executive summary of it: epidemiological modelling for spread of disease shows that high-precision self-isolation is the best approach to stop the pandemic and thus to minimize its economic damage. Read more about this and our call to action for the development of a COVID-19 contact tracing app at https://www.covid-app.io/.
We partner with the largest Open Source community around privacy-preserving artificial intelligence, OpenMined. We are as well actively collaborating with the TCN-coalition which is a global coalition for privacy-first digital contact tracing protocols to fight COVID-19. We have offered our algorithmic privacy core to support COVID-19 contact tracing apps and are actively supporting several initiatives in Europe and the US with our technology. Furthermore, together with Microsoft, Amazon, Facebook, IBM, HP and Intel, we are one of the ten founding adopters of the Open Covid Pledge. You can read more about our commitment in the COVID-19 outbreak on our website.
PSI is versatile in its use and not bound to a mobile phone to central server scenario like in our COVID-19 app development efforts. It is applicable whenever two or more parties have an interest in learning and comparing their data without disclosing the full data itself. Common use cases where a PSI protocol is useful include:
Private Contact Discovery: Users can find which of their private contacts also have a certain communication app (server),
DNA testing and pattern matching: A user who got her DNA sequenced can find out about sequences linked to genetic diseases which are stored on a database (server),
Remote diagnostics: A medical diagnostic program assigns a status (sick or not sick with a certain disease) to a vectorized patient’s (client) electronic health record. While the client learns about her sickness, the program itself remains secret and the program owner (server) does not learn anything about the client’s data,
Private record linkage: Two data owners hold different types of information for the same customer. To make data mining possible, both records must be linked together and made available without giving away any other private data stored,
Chemical compound comparison: In a federated learning setup where two companies jointly train a e.g. QSAR model on their data, the data can be pre-processed with PSI to find and eliminate data duplicates to streamline the model training.
Get more info!
If you are interested in our open source technology in the fight against Covid-19, make sure to check out our GitHub repository we co-developed, and read our accompanying blogpost we co-authored with OpenMined.