Agrigenomics Information Database
» Development of improved analysis system
Calculation node (100 nodes) |
Due to rapid progress of analysis equipment, especially next generation sequencer (NGS), huge amount of genomic data has been accumulated. Progress of NGS has occupied important position also in crops and livestock genome research. In order to meet requirement for huge data analysis derived from NGS, appropriate bioinformatic skills are prerequisite.
We have developed a new system to process such massive and complex genome data efficiently. Since researchers frequently face difficulties in performing massive data analysis by themselves, such as lack of high-performance computers or experience of information processing, we are providing a novel analysis system (designated as “Galaxy/NIAS”), which can process massive and complex data easily and effectively.
» About SOGO
Petabyte Disk (×2) |
This project has set two major goals: One is “Development of improved analysis system”; and the other is “Refinement and maintenance of databases”. Huge amount of genomic data of crops and livestock have been accumulated by untiring research for a couple of decades. A lot of databases focusing on linkage map, physical map, gene functional annotation based on whole genome sequencing etc. have been developed in NIAS.
We inherit the data accumulated in these databases, and develop a novel system to store and analyze data generated with advent of next generation sequence (NGS) technology. SOGO is based on “cloud computing” technology, and users can keep all source data, programs, and analysis results on server-side.
The novel system is designed to deal with various forms of data flexibly. Users can save and manage their own files, and set each of those files shared in their collaborators or open to the public. Users can also arrange and share task schedules.
Users can create new projects to perform particular tasks, and set the projects to possess unique functions. A notable feature of SOGO is that any kinds of data, including not only genome data but also source codes to drive system, can be handled.