Français
Home
Products and services
Decrypthon
Our company
Our team
Employment
Links
Contacts
Genomining vcard

Décrypthon

FAQ

Proteome mapping
The scientific challenge of the Décrypthon project consists of exhaustive comparison of all known proteins in the realm of living organisms (proteomes). The database created from the results obtained has be made available to the international scientific community and constitutes a unique tool to accelerate the understanding of many diseases, including genetic diseases, and the development of treatments derived from our knowledge of the genome. This project is part of the new field of proteomics that has been opened by the sequencing of the human genome.
A scientific challenge which requires the mobilization of human resources and a technological exploit. This research is designed to ensure "global" comparison of proteins: each protein has been compared with all other proteins. These comparisons have been performed by using the Smith-Waterman algorithm on a network architecture representing a virtual calculation power of more than 40 Teraflops. Genomining has developed specific software for this purpose: SWC.
or this research, scientists have taken advantage of the power of 75,000 computers each contributing about 200 hours of calculation. If only a single standard personal computer had been used, the calculation would take about 625,000 days, i.e. about 1,710 years.

550,000 proteins compared and classified
This project was designed to compare 550,000 known proteins derived from various species, from unicellular organisms, such as bacteria, to vertebrates (including mice and man), while also including multicellular organisms, such as Drosophila (fruit fly), nematode (a worm) or a plant (Arabidospis thaliana). This research used the protein sequences listed in several databases, including Swiss-prot, IPI, TrEMBL, Refseq, Ensembl.
Calculations have compared the sequences of these proteins in order to classify them into families of proteins homologous from one species to another. The database obtained can be used to determine correlations between this classification and the known three-dimensional structures of some of these proteins. Following mapping of the genome, this new step corresponds to the mapping of the proteome. The unprecedented dimensions of this new database allow scientists to work more easily and more rapidly on these complex proteins that are essential components of our body.
Up until now, due to lack of the necessary technological infrastructure to conduct such an exhaustive analysis, scientists have only been able to compare the proteins of a limited number of species or compare proteins within the same species.

A database at the service of the management of genetic diseases
As a result of these studies, scientists have a new tool - a database - which enables them to:
- discover the unknown function of a protein by comparing its sequence with that of proteins with known functions in other species of living organisms. This process is called proteome annotation, i.e. description of the biological function of protein sequences. Scientists will therefore be able to advance more rapidly in the understanding of the mechanisms of genetic diseases.
- further their knowledge of protein structures, that is limited at the present time. Proteins are complex molecules that are intricately folded to form countless 3D conformation, like balls of wool whose thread is rolled in multiple ways. At the present time, only a limited number of protein structures have been established experimentally and it is very difficult to predict the 3D conformation of a protein based on its linear code of gene sequences. However, this knowledge is essential, as the structure of proteins determines their function.
- reconstitute the history of protein families in order to more clearly understand the evolution of living organisms.
- correlate genome data with proteome data. The human genome comprises a number of variations between individuals (single nucleotide polymorphisms - SNP). The analysis of SNPs can identify the genes involved in certain diseases. These polymorphisms can now be identified on the genome. By comparing these SNPs with the protein sequence, the variations of our genome associated with certain diseases could be identified more rapidly.

More information on Decrypthon : http://www.infobiogen.fr/services/decrypthon/index.html

 
© Genomining 2003-2007 - info@genomining.com - 4, rue René Barthélémy, 92120 Montrouge, France - tel: +33(0)142310808 - fax: +33(0)142312102