Short reminder:
The human genome, with all its magic, is about 3,117,275,501 base pairs long. Source: Wikipedia
If you would encode that data digitally, and store it on a SSD drive, it would take up < 1 GB.
So, if we can do so much magic with 1 GB, that should be an inspiration to all software to do more, with less space.
Thank you for coming to my talk.
Yeah, but nobody’s gonna encode all of humanity’s genes at once. It’s like taking the storage of all game sava data of all users combined. It doesn’t make sense.
Normally, you look at the storage space for one individual at a time.
There is an entire research field about looking at sets of genomes. It’s called pangenomics. I think they are at hundreds of thousands of human genomes of available data right now. Ten thousand from a few years ago I know for sure.
Considering multiple genomes is one of the keys to understanding the effects of the different genomic variants on the individual. One can for example look at various chronic diseases and see if there is anything special about the genomes of the sick individuals compared to the healthy ones.
This requires a lot of samples, because just comparing one sick and one healthy individual will bring up a lot of false positive variants, that differ between the individuals, but are not related to the disease.
thanks, I hadn’t thought of that.