|
|
| Acesso ao texto completo restrito à biblioteca da Embrapa Agricultura Digital. Para informações adicionais entre em contato com cnptia.biblioteca@embrapa.br. |
Registro Completo |
Biblioteca(s): |
Embrapa Agricultura Digital. |
Data corrente: |
17/11/2011 |
Data da última atualização: |
24/01/2020 |
Tipo da produção científica: |
Resumo em Anais de Congresso |
Autoria: |
VIEIRA, F. D.; PAIVA, S. R.; YAMAGISHI, M. E. B.; OLIVEIRA, S. R. de M.; HIGA, R. |
Afiliação: |
FÁBIO DANILO VIEIRA, CNPTIA; SAMUEL REZENDE PAIVA, CENARGEN; MICHEL EDUARDO BELEZA YAMAGISHI, CNPTIA; STANLEY ROBSON DE MEDEIROS OLIVEIRA, CNPTIA; ROBERTO HIROSHI HIGA, CNPTIA. |
Título: |
Databases & data integration text mining & information extraction. |
Ano de publicação: |
2011 |
Fonte/Imprenta: |
In: INTERNATIONAL CONFERENCE OF THE BRAZILIAN ASSOCIATION FOR BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 7.; INTERNATIONAL CONFERENCE OF THE IBEROAMERICAN SOCIETY FOR BIOINFORMATICS, 3., 2011, Florianópolis. Proceedings... Florianópolis: Associação Brasileira de Bioinformática e Biologia Computacional, 2011. |
Páginas: |
Não paginado. |
Idioma: |
Inglês |
Notas: |
X-MEETING, 2011. |
Conteúdo: |
In recent years, with the joint development of biotechnology and bioinformatics, applications that use DNA analysis in the animal breeding area have spread out. In particular, applications that make use of molecular markers of type SNP are among the most important ones. With the recent advances achieved by the DNA sequencing technology, it is possible to identify hundred of thousands of SNPs variants per animal. In this work, we propose a methodology to identify in a dataset a set of SNPs whose allele frequencies certify each of the sheep breeds (Creole, Santa Ines and Morada Nova). We used a dataset of SNP markers provided by the Sheep Genome Consortium International, obtained through the Network of Animal Genomics Embrapa. We applied data mining techniques, especially attribute selection methods and algorithms for generating association rules. The first step was to make the selection of attributes, due to the high number of SNP markers, with almost 60,000 markers for each animal. Subsequently, we applied the Apriori algorithm in order to generate association rules with the purpose of obtaining SNPs whose allelic values could determine the breed that each animal belongs to. The results showed that, with minimal support of 15% and minimum confidence of 90%, some SNPs have values that appear only in a particular breed. The best rules appear at the top of the list due to high values of support and confidence achieved. It was observed that the top 35 associations rules are related to Creole, with support of 30% and confidence of 100% . The first rule indicates that one of the selected markers with a certain a homozigous allele characterizes the breed is Creole, appearing in 22 records (30% of the total, which has 72 records) and 100% confidence, i.e., in 22 times that the marker came up with this value, the breed was related to Creole. The results need to be better evaluated and validated by experts, but they seem to be of great importance for future work. The methodology proposed in this work could be used to support breeding programs for sheep in progeny tests. References MACQUEEN, J. B. (1967). Some Methods for classification and Analysis of Multivariate Observations. MenosIn recent years, with the joint development of biotechnology and bioinformatics, applications that use DNA analysis in the animal breeding area have spread out. In particular, applications that make use of molecular markers of type SNP are among the most important ones. With the recent advances achieved by the DNA sequencing technology, it is possible to identify hundred of thousands of SNPs variants per animal. In this work, we propose a methodology to identify in a dataset a set of SNPs whose allele frequencies certify each of the sheep breeds (Creole, Santa Ines and Morada Nova). We used a dataset of SNP markers provided by the Sheep Genome Consortium International, obtained through the Network of Animal Genomics Embrapa. We applied data mining techniques, especially attribute selection methods and algorithms for generating association rules. The first step was to make the selection of attributes, due to the high number of SNP markers, with almost 60,000 markers for each animal. Subsequently, we applied the Apriori algorithm in order to generate association rules with the purpose of obtaining SNPs whose allelic values could determine the breed that each animal belongs to. The results showed that, with minimal support of 15% and minimum confidence of 90%, some SNPs have values that appear only in a particular breed. The best rules appear at the top of the list due to high values of support and confidence achieved. It was observed that the top 35 associations rules are rela... Mostrar Tudo |
Palavras-Chave: |
Marcadores moleculares SNP; Mineração de texto; Recuperação da informação; Text mining. |
Thesagro: |
Base de Dados. |
Thesaurus Nal: |
Databases; Information retrieval. |
Categoria do assunto: |
X Pesquisa, Tecnologia e Engenharia |
Marc: |
LEADER 03250nam a2200265 a 4500 001 1906172 005 2020-01-24 008 2011 bl uuuu u00u1 u #d 100 1 $aVIEIRA, F. D. 245 $aDatabases & data integration text mining & information extraction.$h[electronic resource] 260 $aIn: INTERNATIONAL CONFERENCE OF THE BRAZILIAN ASSOCIATION FOR BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 7.; INTERNATIONAL CONFERENCE OF THE IBEROAMERICAN SOCIETY FOR BIOINFORMATICS, 3., 2011, Florianópolis. Proceedings... Florianópolis: Associação Brasileira de Bioinformática e Biologia Computacional$c2011 300 $aNão paginado. 500 $aX-MEETING, 2011. 520 $aIn recent years, with the joint development of biotechnology and bioinformatics, applications that use DNA analysis in the animal breeding area have spread out. In particular, applications that make use of molecular markers of type SNP are among the most important ones. With the recent advances achieved by the DNA sequencing technology, it is possible to identify hundred of thousands of SNPs variants per animal. In this work, we propose a methodology to identify in a dataset a set of SNPs whose allele frequencies certify each of the sheep breeds (Creole, Santa Ines and Morada Nova). We used a dataset of SNP markers provided by the Sheep Genome Consortium International, obtained through the Network of Animal Genomics Embrapa. We applied data mining techniques, especially attribute selection methods and algorithms for generating association rules. The first step was to make the selection of attributes, due to the high number of SNP markers, with almost 60,000 markers for each animal. Subsequently, we applied the Apriori algorithm in order to generate association rules with the purpose of obtaining SNPs whose allelic values could determine the breed that each animal belongs to. The results showed that, with minimal support of 15% and minimum confidence of 90%, some SNPs have values that appear only in a particular breed. The best rules appear at the top of the list due to high values of support and confidence achieved. It was observed that the top 35 associations rules are related to Creole, with support of 30% and confidence of 100% . The first rule indicates that one of the selected markers with a certain a homozigous allele characterizes the breed is Creole, appearing in 22 records (30% of the total, which has 72 records) and 100% confidence, i.e., in 22 times that the marker came up with this value, the breed was related to Creole. The results need to be better evaluated and validated by experts, but they seem to be of great importance for future work. The methodology proposed in this work could be used to support breeding programs for sheep in progeny tests. References MACQUEEN, J. B. (1967). Some Methods for classification and Analysis of Multivariate Observations. 650 $aDatabases 650 $aInformation retrieval 650 $aBase de Dados 653 $aMarcadores moleculares SNP 653 $aMineração de texto 653 $aRecuperação da informação 653 $aText mining 700 1 $aPAIVA, S. R. 700 1 $aYAMAGISHI, M. E. B. 700 1 $aOLIVEIRA, S. R. de M. 700 1 $aHIGA, R.
Download
Esconder MarcMostrar Marc Completo |
Registro original: |
Embrapa Agricultura Digital (CNPTIA) |
|
Biblioteca |
ID |
Origem |
Tipo/Formato |
Classificação |
Cutter |
Registro |
Volume |
Status |
URL |
Voltar
|
|
| Acesso ao texto completo restrito à biblioteca da Embrapa Amazônia Ocidental. Para informações adicionais entre em contato com cpaa.biblioteca@embrapa.br. |
Registro Completo
Biblioteca(s): |
Embrapa Amazônia Ocidental. |
Data corrente: |
19/02/2010 |
Data da última atualização: |
01/02/2019 |
Tipo da produção científica: |
Artigo em Periódico Indexado |
Circulação/Nível: |
B - 5 |
Autoria: |
TEULÉ, F.; COOPER, A. R.; FURIN, W. A.; BITTENCOURT, D. M. de C.; RECH, E. L.; BROOKS, A.; LEWIS, R. V. |
Afiliação: |
Florence Teulé, University of Wyoming; Alyssa R. Cooper, University of Wyoming; William A. Furin, University of Wyoming; DANIELA MATIAS DE C BITTENCOURT, CPAA; Elibio L. Rech, CPAFAC; Amanda Brooks, University of Wyoming; Randolph V. Lewis, CENARGEN. |
Título: |
A protocol for the production of recombinant spider silk-like proteins for artificial fiber spinning. |
Ano de publicação: |
2009 |
Fonte/Imprenta: |
Nature Protocols, v. 4, n. 3, p. 341-355, fev. 2009. |
ISSN: |
1754-2189 |
Idioma: |
Inglês |
Conteúdo: |
Experimental design. Spider silk-like synthetic genes. Spider silk gene constrution. Cloning of the repetitive insert in the expression vector. Artificial fiber spinning. |
Palavras-Chave: |
Proteína artificial; Protocolo. |
Categoria do assunto: |
-- |
Marc: |
LEADER 00817naa a2200229 a 4500 001 1673904 005 2019-02-01 008 2009 bl uuuu u00u1 u #d 022 $a1754-2189 100 1 $aTEULÉ, F. 245 $aA protocol for the production of recombinant spider silk-like proteins for artificial fiber spinning. 260 $c2009 520 $aExperimental design. Spider silk-like synthetic genes. Spider silk gene constrution. Cloning of the repetitive insert in the expression vector. Artificial fiber spinning. 653 $aProteína artificial 653 $aProtocolo 700 1 $aCOOPER, A. R. 700 1 $aFURIN, W. A. 700 1 $aBITTENCOURT, D. M. de C. 700 1 $aRECH, E. L. 700 1 $aBROOKS, A. 700 1 $aLEWIS, R. V. 773 $tNature Protocols$gv. 4, n. 3, p. 341-355, fev. 2009.
Download
Esconder MarcMostrar Marc Completo |
Registro original: |
Embrapa Amazônia Ocidental (CPAA) |
|
Biblioteca |
ID |
Origem |
Tipo/Formato |
Classificação |
Cutter |
Registro |
Volume |
Status |
Fechar
|
Expressão de busca inválida. Verifique!!! |
|
|