Korean tiger genome analysis
Through large-scale parallel sequencing, a whole genome sequence of a tiger is obtained. The genetic information stored in the genome is analyzed using annotations and a number of analysis techniques. This completes the Tiger Genome Project.
The most complicated and delicate analyzing method is needed in de novo sequencing, a technique utilized in Tiger Genome Project, because it has to manage and process a great amount of data. The Genome Research Foundation is equipped with mass storage computer systems. Cloud computing technology is also secured by the partnership with KT, enabling these large-scale genome analysis processes.
de novo Analysis
"de novo" sequencing uses a next-generation genome analysis technique and production of a special library. Considering the repeated sequence of a genome, diverse genome libraries are built and maximum of 80x genome sequencing data are produced. For a successful analysis of de novo genome sequence result, data analysis using many algorithms and sequence assembly by multiple genome libraries (200bp, 500bp, 2kb, 5kb, and 10 kb) are processed.
<Figure. Schematic overview of the SOAPdenovo algorithm.>