| Wednesday, 14 May 2008 |
|
|
|
Researchers Complete the Papaya Genome Sequencing Project |
|
Boston, US OBBeC - Researchers from the University of Hawaii (UH), leading an International Papaya Genome Consortium, have completed the Papaya Genome Sequencing Project. The team has relied on technology from SGI to provide the processing power and speed required to run the assemblies. The papaya genome was sequenced using a whole genome shotgun approach running on the combination of an SGI Altix system, SGI InfiniteStorage and Qube! scheduling software from PipelineFX. According to the press report, it has taken two years to construct the physical map and to sequence the 372 million base pairs. The disease-resistant "SunUp" papaya is now the first fruit, as well as the first transgenic crop, to be sequenced, helping to pave the way for international export of genetically modified fruits and vegetables."Sequencing the papaya genome is the first large scale genome project done in Hawaii with international collaboration from 22 research institutes and will benefit the papaya research community, papaya farmers, and consumers for years to come," said Maqsudul Alam, Director of the Centre for Advanced Studies in Genomics, Proteomics, and Bioinformatics, College of Natural Sciences, lead institute of the International Papaya Genome Consortium. Papaya is the second largest crop in Hawaii, which was almost wiped out by the papaya ringspot virus a decade ago. The researchers claim that genetically modifying the papaya to create a ringspot-resistant transgenic or "hermaphrodite" variety such as the SunUp saved the papaya from extinction on the islands. The Papaya Genome Project mapped papaya genes to improve the efficiency of agricultural cultivation, and to discover new applications for one of the most important edible fruit crops of tropical and subtropical regions worldwide. Papaya is also used in a wide range of medical, biotechnological and cosmetic applications that can be further developed now that a complete picture of the genomic sequence has been drawn. The Hawaii Papaya Genome Project is a multi-institutional bioinformatics project. In Hawaii, participants include the UH Maui High Performance Computing Centre (MHPCC), Hawaii Agricultural Research Centre (HARC), Hawaii Papaya Industry Association, USDA Pacific Basin Agricultural Research Centre, Pacific Telehealth and Technology Hui, and Hawaii Biotech. Nankai University in China is a major collaborator, as is University of Illinois and Maryland Biotechnology Institute. When the project began over two years ago, Dr. Alam, who heads the multi-institution project housed at the UH MHPCC, looked for a powerful high performance computer that their budget would allow, which is capable of running for weeks without stopping during the 2-year-plus project. He also looked for a scalable backend-distributed processing solution with open API's to develop Web-based interfaces, which allowed the development of custom processes, such as pre-loading standard databases into memory for faster searches. Another major consideration was the project's potential generation of tremendous amount of data. A typical papaya base pairs assembly run can use as much as 40GB of memory and 150GB of disk space. Future plans of processing even larger genome sequencing projects will also require very large, highly scalable disk storage. The research team used an SGI Altix 350 computer system, SGI InfiniteStorage TP9300 Fibre Channel storage, which has been upgraded over the course of the project to over 16TB, and PipelineFX Qube! render management and batch-queuing software for the Project. The complete system, purchased over two years ago, was integrated and sold through SGI Channel Partner CORESystems Hawaii. The SGI Altix system, with 32 CPUs of processing power, gave the team high throughput to their computing needs, allowing them to run highly multi-threaded applications concurrently. PipelineFX Qube! (used widely in digital media for game build engines and animation rendering) was uniquely set up as the Centre's backend software environment. In addition, with its capability of handling distribution tasks, PipelineFX Qube! also managed the interactive Web site, Gbrowse, for national and international collaborators on the project. "Because the SGI system architecture allows very large memory to be shared across all the processors, the Centre personnel were able to write their own software that, every night, downloaded all the versions of the genomic databases, and the most common were uploaded into the Altix system's shared memory for immediate access," said Deepak Thakkar, Higher Education and Research Solutions Manager, SGI. "SGI Altix systems and scalable storage solutions, with PipelineFX Qube!, made a unique genome analysis tool that got the job done. This is an exciting combination that has many other applications in biosciences." |