With advances in genomic sequencing technology, a large amount of data is publicly available for the research community to extract meaningful and reliable associations among risk genes and the mechanisms of disease. However, this exponential growth of data is spread in over thousand heterogeneous repositories, represented in multiple formats and with different levels of quality. As a result, its management has become a challenge that hinders the differentiation of clinically valid relationships from those that are less well-sustained.
The PROS Researh Center has developed a systematic framework to efficiently manage genomic data that is accessible, informative and reliable enough to extract valuable knowledge. This is what we call Smart Genomic Data.
SILE provides support to the four main stages that are the core for the management of Smart Genomic Information: the selection of the adequate data sources, the identification of the relevant information, the storage with the most efficient technology and the extraction of the underlying knowledge.
Where can the required information be found?
How can the Smart Data be identified?
Where can the information be stored?