02779nam a2200265 a 450000100080000000500110000800800410001910000220006024500340008226002020011652019550031865300120227370000200228570000140230570000160231970000180233570000210235370000150237470000160238970000210240570000220242670000210244870000220246970000220249118744172011-06-17 2010 bl uuuu u00u1 u #d1 aNASCIMENTO, L. C. aA brazilian soybean database. aIn: INTERNATIONAL CONFERENCE OF THE BRAZILIAN ASSOCIATION FOR BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 6., 2010, Ouro Preto. Abstracts book... Ouro Preto: AB3C, 2010. p. 120. X-Meeting 2010.c2010 aSoybean is a legume with large economic importance in the international market, with a world production of almost two hundred and ten million tons in the 2008/2009 harvest. Brazil appears as the second largest producer, with about twenty-five percent of the world production. In 2007, the Brazilian Soybean Genome Consortium (GENOSOJA) was established with the goal of integrating several institutions currently working with soybean genomics in Brazil. The project has an initiative to search for new treats to improve the soybean production process, emphasizing in stresses that affect the national production, like the occurrence of droughts, pests attacks and the Asian Rust disease. Among the objectives of GENOSOJA is the creation of a relational database, integrating the results achieved by different methodologies utilized in the project. In the GENOSOJA context, we created a brazilian soybean database, integrating: (1) public data consisting of genome and predicted genes from JGI, an assembly of 1,276,813 of NCBI ESTs from several cultivars and 4,712 full-length cDNA sequences from one japanese cultivar; and (2) private data consisting of (i ) three cDNA libraries explored by SuperSAGE methodology, resulting in 4,373,053 tags of 26 bp, (ii ) 22 cDNA subctrative libraries from several brazilian cultivars under different stresses and (iii ) several libraries of soybean microRNAs under eight conditions and size between 19 and 24 bp. All these data were sequenced using Solexa/Illumina sequencing technology. This database offers to the users some features, including keywords searches, statics comparisons, automatic annotation, gene ontology classification and gene expression of the genes under certain conditions. All data are storage in a Fedora Linux machine, running the MySQL database server. The web interface is based in a combination of CGI scripts using Perl language (including BioPerl module) and the Apache Web Server. aSoybean1 aCOSTA, G. G. L.1 aMEYER, L.1 aBINNECK, E.1 aRODRIGUES, F.1 aKULCHESKI, F. R.1 aMARGIS, R.1 aKIDO, E. A.1 aMARCELINO, F. C.1 aNEPOMUCENO, A. L.1 aABDELNOOR, R. V.1 aPEREIRA, G. A. G.1 aCARAZZOLLE, M. F.