|
生命科学和生物技术的核心内容是基本生物信息的开采、分析和应用。人类基因组计划(HGP)从基本的测序入手拉开了人类从信息角度认识生命奥秘的序幕。随之而来的蛋白质组学、结构基因组学、药物基因组学,也都要求研究者能够利用生物信息学的手段获取、分析和挖掘数据结果。尽管人们用不同的方法、从不同的角度进行生物的结构和功能的研究,但信息开采总是先行,系统分析、比较和功能研究,都是为了更好的应用先进技术,认识生命奥秘。 华大基因研究中心是国内较早从事基因组、生物信息学研究的单位,于2001年8月出色完成了人类基因组计划(中国卷)的任务;同年10月独立完成水稻基因组"工作框架图"绘制和数据库建设,并向全球公布;成为我国生命科学领域的重大突破,并被国际科学界誉为"里程碑" 式的贡献。以上两项成果还被江泽民主席在两院院士大会上列入我国生命科学历程中的重大成就。
1%人类基因组数据库(点击进入) 华大基因研究中心于2001年8月出色完成了人类基因组计划(中国卷)的任务。 1%人类基因组数据库包含HGP Beijing Region的所有数据,数据量80GB。
超级杂交稻基因组数据库(点击进入) Rice is one of the most important crops in the world and it provides the main resource of energy for more than half of the world population. The estimated physical size of the rice genome is about 430 Mb, the smallest among all the cereal crops. It corresponds to one seventh of the human genome whose working draft has been established. It is also 3.5 times the size of Arabidopsis.The well-established protocols for relatively high-efficiency genetic transformation, the genetic and physical maps of high density, and the high degree of synteny among genes in cereal genomes, all make rice an ideal model organism for studies on physiology, developmental biology, molecular genetics, evolution, and genomics of plants, especially of the grass family. Essential biological information from the rice genome will undoubtedly improve our understanding of the basic genomics and genetics of other related and economically significant crops, not only wheat, corn, sorghum, and members of the grass family, but also dicot crops such as soybean and cotton. The initiation of the Human Genome Project (HGP) at the beginning of the l990s and the completion of the human genome working drafts at the beginning of this century have not only laid the ground work for genomics and opened a new era for the life science research, but also have set up an unprecedented example for genomics studies on many other organisms. HGP has developed strategies, technologies, definitions and standards for different stages of sequence assembly and analysis such as "working draft", "draft sequences" and "complete map", which are broadly applicable to other organisms. Inspired by the Human Genome Project, the International Rice Genome Project Consortium, headed by Japan, has released 174.4 Mb of BAC/PAC-based non-redundant sequences since 1997, including the complete sequence of a single chromosome (Chr. 1). Monsanto and Syngenta, two private companies, have announced the establishment of a "working draft", independently, in April of 2000 and February of 2001, respectively, but neither has made their sequence data completely available to the public. All of the three projects mentioned above have used subspecies japanica (Nipponbare) as target materials, in spite of the fact that another subspecies, indica, is dominantly planted in Asia and other regions in the world, and has provided the unique template for the unique hybrid rice strain that has greatly contributed to solving the food supply problem in China. Here we report a draft assembly of the genome of 93-11, a cultivar of Oryza sativa ssp. indica, the major food crop in China. The contigs and draft sequences are being made freely available to the public, in order to provide important information for the understanding of the rice genome and its genes at molecular levels. These data will lay the foundation for a complete map of the rice genome, which is our ultimate goal.
嗜热菌基因组数据库(点击进入) Thermoanaerobacter tengcongensis isolated from Chinese hot spring, is a rod-shaped,Gram negative and anaerobic new extremely thermophilic bacterium. The complete genome sequence of T.tengcongensis clone MB4T consists of a 37.6% GC-content 2,689,441, base pair (bp) circular ; chromosome that; contains 2808 predicted coding regions, 1,481,(52.7%) which have functional assignments and 1,327 (47.3%) of which are of unknown function. Efficient metabolic functions are predicted, with sugars as the principal energy and carbon source. Genome analysis also reveals that 30 genes have homology only in the genomes of other thermophilic Eubacteria and Archaea. Of the Eubacteria sequenced to date, T. tengcongensis has the highest percentage (59.8%)of genes that are most similar to Bacillus halodurans.
人类基因选择性剪切数据库(点击进入) 关于人类基因选择性剪切信息的数据库
|