Illumina Sequences the First African Human Genome

7 Feb 2008

Product news

Illumina, Inc. today announced that scientists at the Company have sequenced the genome of an anonymous African male (Yoruba from Ibadan, Nigeria), using the Genome Analyzer. Sequencing of this HapMap sample was conducted internally and marks the first human genome sequence generated exclusively with paired reads of 35 to 50 bases in length. Leveraging recent system improvements that increase the throughput and improve the accuracy of the Genome Analyzer, Illumina scientists were able to complete this project in a matter of weeks. More than 95 percent of production runs generated high-quality data with an average of over three billion bases (three Gb) per run. This achievement establishes the direct utility of Illumina’s sequencing technology for accurately sequencing large and complex genomes.

“This landmark project demonstrates that scientists can use the Genome Analyzer today to economically and rapidly complete large-scale sequencing projects including human genome sequencing,” said Jay Flatley, President and Chief Executive Officer of Illumina. “With this project we have established both the efficacy of our technology to consistently produce large volumes of high-quality data, and the utility of our unique short-insert paired read approach for large-scale sequencing.”

Using paired reads primarily from 200 base pair (bp) insert size libraries, supplemented with reads from 2000 bp insert size libraries, Illumina scientists initially conducted 27 runs to generate over 75 Gb of DNA sequence and achieve more than 90 percent coverage of the genome. An initial analysis yielded over 3.7 million single nucleotide polymorphisms (SNPs), including more than one million novel SNPs. As expected, the remaining SNPs correspond to those found in public databases. The data generated by the Genome Analyzer shows excellent concordance with those generated using other technologies: of previously known SNPs, Illumina’s Genome Analyzer shows both 98 percent concordance with data generated using Illumina’s HumanHap550 genotyping BeadChip, as well as with data generated by the HapMap project. The high concordance rate of SNPs detected by the Genome Analyzer with SNPs detected by independent technologies affirms the accuracy of Illumina’s sequencing technology for discovery and validation of polymorphisms. Additional analysis work is under way using both long- and short-insert read pairs to characterize known and novel structural variation in this genome.

“This study is a testament to the robustness of the system and the speed and economic benefits of Illumina’s sequencing technology,” said David Bentley, DPhil, Vice President and Chief Scientist at Illumina. “Making whole human genome sequencing routine will enable the study of natural human variation all over the world, and enable the use of medical sequencing at the whole genome level to unravel the full spectrum of mutations, which give rise to cancer.”

Data and analyses from this project will be presented by Dr. Bentley on February 8th at the 2008 Advances in Genome Biology and Technology (AGBT) conference in Marco Island, Florida.

Links

Tags

Genome AnalysisGenomics, the study of genomes, includes functional genomics, evolutionary genomics and comparative genomics. There are many genomic technologies such as DNA sequencing of whole genomes, computational biology and bioinformatics. DNA and nucleic acids must be isolated and concentrated from cells for analysis with kits, automated analyzers and software. Other useful technologies for studying genomics include PCR, microarrays and electrophoresis.Compound LibrariesCompound libraries, or chemical libraries, are used in drug discovery for the identification of potential therapeutics compounds. Used in conjunction with high-throughput screening, the libraries of stored compounds are often generated for specific purposes as a drug target or disease model. Cheminformatics are commonly used when designing a compound library and software can be used to analyze the screening process.  Microarray AnalysisMicroarrays, also known as biochips, are used for the detection and analysis of multiple genes, proteins, antibodies, or biomarkers on a single microchip. This can reveal information on protein or gene expression, single nucleotide polymorphism (SNP), copy number variation (CNV), epigenetics and patient health in clinical diagnostic tests. Discover a range of microarray scanners and prefabricated antibody, protein, RNA and DNA microarrays for your analysis or consider creating your own custom microarrays with a microarray printer. Find the best microarray products in our peer-reviewed product directory: compare products, check customer reviews and receive pricing direct from manufacturers.SNPs TechnologySingle nucleotide polymorphisms (SNPs) are individual base variations in a DNA sequence. SNPs are used in research to study predispositions to disease and drug discovery. Products for studying single nucleotide polymorphisms include SNP arrays and detection systems to detect polymorphisms, SNP typing systems for genotyping DNA, and PCR to amplify specific SNPs.DNA SequencingDNA sequencing, such as sanger sequencing, is a biological technique that determines the precise order of nucleotide bases in a fragment or template of DNA. DNA sequencers and genetic analyzers are based on capillary electrophoresis, where labeled DNA fragments are electrophoretically separated by size as they migrate through a polymer. Find the best DNA sequencing products, including DNA sequencing kits, genomic libraries and genetic identity kits in our peer-reviewed product directory: compare products, check customer reviews and receive pricing direct from manufacturers.
Illumina Sequences the First African Human Genome