This was originally written as an Answer to a Question posted to Scientific American Online; however, as what they published was considerably shorter and simpler than what I wrote, I shall post the original here.
The answer to this question is not simple, because, while viruses all share the characteristics of being obligate intracellular parasites which use host cell machinery to make their components which then self-assemble to make particles which contain their genomes, they most definitely do not have a single origin, and indeed their origins may be spread out over a considerable period of geological and evolutionary time. Viruses infect all types of cellular organisms, from Bacteria through Archaea to Eukarya; from E. coli to mushrooms; from amoebae to human beings – and virus particles may even be the single most abundant and varied organisms on the planet, given their abundance in all the waters of all the seas of planet Earth. Given this diversity and abundance, and the propensity of viruses to swap and share successful modules between very different lineages and to pick up bits of genome from their hosts, it is very difficult to speculate sensibly on their deep origins – but I shall outline some of the probable evolutionary scenarios.
It is generally accepted that many viruses have their origins as “escapees” from cells; rogue bits of nucleic acid that have taken the autonomy already characteristic of certain cellular genome components to a new level. Simple RNA viruses are a good example of these: their genetic structure is far too simple for them to be degenerate cells; indeed, many resemble renegade messenger RNAs in their simplicity. What they have in common is a strategy which involves use of a virus-encoded RNA-dependent RNA polymerase (RdRp) or replicase to replicate RNA genomes – a process which does not occur in cells, although most eukaryotes so far investigated do have RdRp-like enzymes involved in regulation of gene expression and resistance to viruses. The surmise is that in some instances, an RdRp-encoding element could have became autonomous – or independent of DNA – by encoding its own replicase, and then acquired structural protein-encoding sequences by recombination, to become wholly autonomous and potentially infectious. Given the very significant diversity in these sorts of viruses, it is quite possible that this has happened a number of times in the evolution of cellular organisms on this planet – and that while some RNA viruses like bacterial RNA viruses or bacteriophages and some plant viruses may be very ancient indeed, others – such as mononegaviruses, including Ebola and rabies viruses – may be evolutionarily much younger.
The possibility that certain non-retro RNA viruses can actually insert bits of themselves by obscure mechanisms into host cell genomes – and afford them protection against future infection – complicates the issue rather, by reversing the probable flow of genetic material!
The retroviruses are another good example of possible cell-derived viruses, as many of these have a very similar genetic structure to elements which appear to be integral parts of cell genomes – termed retrotransposons - and share the peculiar property of replicating their genomes via a pathway which goes from single-stranded RNA through double-stranded DNA (reverse transcription) and back again, and yet have become infectious. They can go full circle, incidentally, by permanently becoming part of the cell genome by insertion into germ-line cells – so that they are then inherited as “endogenous retroviruses“, which can be used as evolutionary markers for species divergence.
Indeed, there is a whole extended family of reverse-transcribing mobile genetic elements in organisms ranging from bacteria all the way through to plants, insects and vertebrates – and which includes two completely different groups of double-standed DNA viruses, the vertebrate-infecting hepadnaviruses or hepatitis B virus-like group, and the plant-infecting badna- and caulimoviruses. All of these cellular elements and viruses have in common a “reverse transcriptase” or RNA-dependent DNA polymerase, which may in fact be an evolutionary link back to the postulated “RNA world” at the dawn of evolutionary history, when the only extant genomes were composed of RNA. Thus, a part of what could be a very primitive machinery indeed has survived into very different nucleic acid lineages, some viral and many wholly cellular in nature, from bacteria through to higher eukaryotes.
There are also obvious similarities in mode of replication between a family of elements which include bacterial plasmids, bacterial single-strand DNA viruses, and viruses of eukaryotes which include geminiviruses and nanoviruses of plants, parvoviruses of insects and vertebrates, and circoviruses and anelloviruses of vertebrates. These agents all share a “rolling circle” DNA replication mechanism, with replication-associated proteins and DNA sequence motifs that appear similar enough to be evolutionarily related – and again demonstrate a continuum from the cell-associated and cell-dependent plasmids through to the completely autonomous agents such as relatively simple bacterial and eukaryote viruses.
However, there are a significant number of viruses with large DNA genomes for which an origin as cell-derived subcomponents is not as obvious. In fact, the largest viruses yet discovered – mimiviruses, with a genome size of greater than 1 million base pairs of DNA – have genomes which are larger and more complex than those of obligately parasitic bacteria such as Mycoplasma genitalium (around 0.5 million), despite their sharing the life habits of tiny viruses like canine parvovirus (0.005 million, or 5000 bases). In fact, it is a striking fact that the largest viral DNA genomes so far characterised seem to infect primitive eukaryotes such as amoebae and simple marine algae – and they and other large DNA viruses like pox- and herpesviruses seem to be related to cellular DNA sequences only at a level close to the base of the “tree of life”. This indicates a very ancient origin or set of origins for these viruses, which may conceivably have been as obligately parasitic cellular lifeforms which then made the final adaptation to the “virus lifestyle”. However, their actual origin could be in an even more complex interaction with early cellular lifeforms, given that viruses may well be responsible for very significant episodes of evolutionary change in cellular life, all the way from the origin of eukaryotes through to the much more recent evolution of placental mammals. In fact, there is informed speculation as to the possibility of viruses having significantly influenced the evolution of eukaryotes as a cognate group of organisms.
In summary, viruses are as much a concept as a unitary entity: all viruses have in common is a base-level strategy for replicating their genomes; their origins are possibly as varied as their genomes, and may remain forever obscure.
Tags: DNA, Evolution, geminivirus, mammals, mimivirus, retrovirus, RNA, rolling circle, virus
2 October, 2008 at 2:18 pm |
[...] samples, or even blood of pandemic survivors to speculate on the origins of specific viruses, of viruses generally, or on the nature of old pandemic strains. Now HIV falls under the spotlight – again – as the [...]
20 November, 2008 at 4:30 pm |
[...] large DNA viruses – pox-, irido-, herpes-, asfar-, mimi- and baculoviruses and their ilk – have a deep and possibly complex evolutionary history, and there is considerable evidence to suggest long histories of co-speciation [...]
21 July, 2009 at 3:55 pm |
Why do virus originate before eukaryotes and prokaryotes? Are there any reasons or theories for this?
21 July, 2009 at 4:20 pm |
Hi Joleen: well, there is no evidence that they did…but almost certainly viruses were there when the first cells were getting going, meaning before eukaryotes – and they may even have helped give rise to eukaryotes, if you believe the “captured virus” scenario for the origin of the nucleus.
1 August, 2009 at 6:36 pm |
Hi Ed: Thank you for the reply. I have a better understanding of the virus now. Thank you.