Dataset: 9.3K articles from Wikipedia (CC BY-SA).
More datasets: Wikipedia | CORD-19

Logo Beuth University of Applied Sciences Berlin

Made by DATEXIS (Data Science and Text-based Information Systems) at Beuth University of Applied Sciences Berlin

Deep Learning Technology: Sebastian Arnold, Betty van Aken, Paul Grundmann, Felix A. Gers and Alexander Löser. Learning Contextualized Document Representations for Healthcare Answer Retrieval. The Web Conference 2020 (WWW'20)

Funded by The Federal Ministry for Economic Affairs and Energy; Grant: 01MD19013D, Smart-MD Project, Digital Technologies

Imprint / Contact

Highlight for Query ‹COVID-19 risk

Severe acute respiratory syndrome coronavirus 2


Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is the virus strain that causes coronavirus disease 2019 (COVID-19), a respiratory illness. It is colloquially known as the coronavirus, and was previously referred to by its provisional name 2019 novel coronavirus (2019-nCoV). SARS-CoV-2 is a positive-sense single-stranded RNA virus. It is contagious in humans, and the World Health Organization (WHO) has designated the ongoing pandemic of COVID-19 a Public Health Emergency of International Concern. Because the strain was first discovered in Wuhan, China, it is sometimes referred to as "Wuhan virus" or "Wuhan coronavirus". Since the WHO discourages the use of names based on locations such as MERS, and to avoid confusion with the disease SARS, it sometimes refers to SARS-CoV-2 as "the COVID-19 virus" in public health communications. The general public frequently calls both SARS-CoV-2 and the disease it causes "coronavirus", but scientists typically use more precise terminology.

Taxonomically, SARS-CoV-2 is a strain of Severe acute respiratory syndrome-related coronavirus (SARSr-CoV). It is believed to have zoonotic origins and has close genetic similarity to bat coronaviruses, suggesting it emerged from a bat-borne virus. An intermediate animal reservoir such as a pangolin is also thought to be involved in its introduction to humans. The virus shows little genetic diversity, indicating that the spillover event introducing SARS-CoV-2 to humans is likely to have occurred in late 2019.

Epidemiological studies estimate each infection results in 1.4 to 3.9 new ones when no members of the community are immune and no preventive measures taken. The virus is primarily spread between people through close contact and via respiratory droplets produced from coughs or sneezes. It mainly enters human cells by binding to the receptor angiotensin converting enzyme 2 (ACE2).

Virology | Infection

Human-to-human transmission of SARS-CoV-2 has been confirmed during the 2019–20 coronavirus pandemic. Transmission occurs primarily via respiratory droplets from coughs and sneezes within a range of about 1.8 metres (6 ft). Indirect contact via contaminated surfaces is another possible cause of infection. Preliminary research indicates that the virus may remain viable on plastic and steel for up to three days, but does not survive on cardboard for more than one day or on copper for more than four hours; the virus is inactivated by soap, which destabilises its lipid bilayer. Viral RNA has also been found in stool samples from infected individuals.

The degree to which the virus is infectious during the incubation period is uncertain, but research has indicated that the pharynx reaches peak viral load approximately four days after infection. On 1 February 2020, the World Health Organization (WHO) indicated that "transmission from asymptomatic cases is likely not a major driver of transmission". However, an epidemiological model of the beginning of the outbreak in China suggested that "pre-symptomatic shedding may be typical among documented infections" and that subclinical infections may have been the source of a majority of infections.

There is some evidence of human-to-animal transmission of SARS-CoV-2, including examples in felids. Some institutions have advised those infected with SARS-CoV-2 to restrict contact with animals.

Virology | Reservoir

The first known infections from the SARS-CoV-2 strain were discovered in Wuhan, China. The original source of viral transmission to humans remains unclear, as does whether the strain became pathogenic before or after the spillover event. Because many of the first individuals found to be infected by the virus were workers at the Huanan Seafood Market, it has been suggested that the strain might have originated from the market. However, other research indicates that visitors may have introduced the virus to the market, which then facilitated rapid expansion of the infections. A phylogenetic network analysis of 160 early coronavirus genomes sampled from December 2019 until February 2020, using a bat coronavirus as an outgroup, revealed that earlier in the outbreak, the type predominant among samples from Wuhan, "B", is more distantly related to the bat coronavirus than the ancestral type "A".

Research into the natural reservoir of the virus strain that caused the 2002–2004 SARS outbreak has resulted in the discovery of many SARS-like bat coronaviruses, most originating in the Rhinolophus genus of horseshoe bats, and two viral nucleic acid sequences found in samples taken from Rhinolophus sinicus show a resemblance of 80% to SARS-CoV-2. A third viral nucleic acid sequence from Rhinolophus affinis, collected in Yunnan province and designated RaTG13, has a 96% resemblance to SARS-CoV-2. Nearly half of SARS-CoV-2's genome has a phylogenetic lineage distinct from known relatives.

Phylogenetic analysis confirmed that pangolins are the reservoir host rather than the intermediate host. While there is scientific consensus that bats are the ultimate source of coronaviruses, SARS-CoV-2 originated from a pangolin, jumped back to bats, and then jumped to humans. Based on whole genome sequence similarity, pangolin coronaviruses are less similar than RaTG13 to SARS-CoV-2 but are more similar than other bat coronaviruses to SARS-CoV-2. Therefore, a specific population of bats is likely the intermediate host for SARS-CoV-2 while an evolutionary ancestor to bats was the source for general coronaviruses. Microbiologists and geneticists in Texas have independently found evidence of reassortment in coronaviruses suggesting involvement of pangolins in the origin of SARS-CoV-2.

A metagenomic study published in 2019 previously revealed that SARS-CoV, the strain of the virus that causes SARS, was the most widely distributed coronavirus among a sample of Sunda pangolins. On 7 February 2020, South China Agricultural University in Guangzhou announced that researchers had discovered a pangolin sample with a viral nucleic acid sequence "99% similar" to SARS-CoV-2. When released, the results clarified that "the receptor-binding domain of the S protein of the newly discovered Pangolin-CoV is virtually identical to that of 2019-nCoV, with one amino acid difference." Pangolins are protected under Chinese law, but their poaching and trading for use in traditional Chinese medicine remains common.

Virology | Phylogenetics and taxonomy

SARS-CoV-2 belongs to the broad family of viruses known as coronaviruses. It is a positive-sense single-stranded RNA (+ssRNA) virus, with a single linear RNA segment. Other coronaviruses are capable of causing illnesses ranging from the common cold to more severe diseases such as Middle East respiratory syndrome (MERS). It is the seventh known coronavirus to infect people, after 229E, NL63, OC43, HKU1, MERS-CoV, and the original SARS-CoV.

Like the SARS-related coronavirus strain implicated in the 2003 SARS outbreak, SARS-CoV-2 is a member of the subgenus Sarbecovirus (beta-CoV lineage B). Its RNA sequence is approximately 30,000 bases in length. SARS-CoV-2 is unique among known betacoronaviruses in its incorporation of a polybasic cleavage site, a characteristic known to increase pathogenicity and transmissibility in other viruses.

With a sufficient number of sequenced genomes, it is possible to reconstruct a phylogenetic tree of the mutation history of a family of viruses. By 12 January 2020, five genomes of SARS-CoV-2 had been isolated from Wuhan and reported by the Chinese Center for Disease Control and Prevention (CCDC) and other institutions; the number of genomes increased to 42 by 30 January 2020. A phylogenetic analysis of those samples showed they were "highly related with at most seven mutations relative to a common ancestor", implying that the first human infection occurred in November or December 2019. As of 27 March 2020, 1,495 SARS-CoV-2 genomes sampled on six continents were publicly available.

On 11 February 2020, the International Committee on Taxonomy of Viruses (ICTV) announced that according to existing rules that compute hierarchical relationships among coronaviruses on the basis of five conserved sequences of nucleic acids, the differences between what was then called 2019-nCoV and the virus strain from the 2003 SARS outbreak were insufficient to make them separate viral species. Therefore, they identified 2019-nCoV as a strain of Severe acute respiratory syndrome-related coronavirus.

Virology | Structural biology

Each SARS-CoV-2 virion is approximately 50–200 nanometres in diameter. Like other coronaviruses, SARS-CoV-2 has four structural proteins, known as the S (spike), E (envelope), M (membrane), and N (nucleocapsid) proteins; the N protein holds the RNA genome, and the S, E, and M proteins together create the viral envelope. The spike protein, which has been imaged at the atomic level using cryogenic electron microscopy, is the protein responsible for allowing the virus to attach to and fuse with the membrane of a host cell.

Protein modeling experiments on the spike protein of the virus soon suggested that SARS-CoV-2 has sufficient affinity to the receptor angiotensin converting enzyme 2 (ACE2) on human cells to use them as a mechanism of cell entry. By 22 January 2020, a group in China working with the full virus genome and a group in the United States using reverse genetics methods independently and experimentally demonstrated that ACE2 could act as the receptor for SARS-CoV-2. Studies have shown that SARS-CoV-2 has a higher affinity to human ACE2 than the original SARS virus strain. SARS-CoV-2 may also use basigin to assist in cell entry.


Based on the low variability exhibited among known SARS-CoV-2 genomic sequences, the strain is thought to have been detected by health authorities within weeks of its emergence among the human population in late 2019. The earliest case of infection currently known is thought to have been found on 17 November 2019. The virus subsequently spread to all provinces of China and to more than 150 other countries in Asia, Europe, North America, South America, Africa, and Oceania. Human-to-human transmission of the virus has been confirmed in all of these regions. On 30 January 2020, SARS-CoV-2 was designated a Public Health Emergency of International Concern by the WHO, and on 11 March 2020 the WHO declared it a pandemic.

The basic reproduction number (R0) of the virus has been estimated to be between 1.4 and 3.9. This means that each infection from the virus is expected to result in 1.4 to 3.9 new infections when no members of the community are immune and no preventive measures are taken. The reproduction number may be higher in densely populated conditions such as those found on cruise ships. Many forms of preventive efforts may be employed in specific circumstances in order to reduce the propagation of the virus.

There have been about 82,000 confirmed cases of infection in mainland China. While the proportion of infections that result in confirmed cases or progress to diagnosable disease remains unclear, one mathematical model estimated that on 25 January 2020 75,815 people were infected in Wuhan alone, at a time when the number of confirmed cases worldwide was only 2,015. Before 24 February 2020, over 95% of all deaths from COVID-19 worldwide had occurred in Hubei province, where Wuhan is located. As of 17 April 2020, the percentage had decreased to 2.1%.

As of 17 April 2020, there have been 2,234,109 total confirmed cases of SARS-CoV-2 infection in the ongoing pandemic. The total number of deaths attributed to the virus is 153,379. Many recoveries from confirmed infections go unreported, but at least 567,695 people have recovered from confirmed infections.