|
|
Registros recuperados : 1 | |
Registros recuperados : 1 | |
|
|
Registro Completo
Biblioteca(s): |
Embrapa Café. |
Data corrente: |
06/05/2019 |
Data da última atualização: |
06/05/2019 |
Tipo da produção científica: |
Artigo em Periódico Indexado |
Circulação/Nível: |
A - 1 |
Autoria: |
ITO, E. A.; KATAHIRA, I.; VICENTE, F. F. da R.; PEREIRA, L. F. P.; LOPES, F. M. |
Afiliação: |
Eric Augusto Ito, Department of Computer Science, Bioinformatics Graduate Program/Federal University of Technology Paraná; Isaque Katahira, Department of Computer Science, Bioinformatics Graduate Program/Federal University of Technology – Paraná; Fábio Fernandes da Rocha Vicente, Department of Computer Science, Bioinformatics Graduate Program/Federal University of Technology – Paraná; LUIZ FILIPE PROTASIO PEREIRA, CNPCa; Fabrício Martins Lopes, Department of Computer Science, Bioinformatics Graduate Program/Federal University of Technology – Paraná. |
Título: |
BASiNET - Biological Sequences NETwork: a case study on coding and non-coding RNAs identification. |
Ano de publicação: |
2018 |
Fonte/Imprenta: |
Nucleic Acids Research, v. 46, n. 16, p. , 2018 |
Idioma: |
Inglês |
Conteúdo: |
With the emergence of Next Generation Sequencing (NGS) technologies, a large volume of sequence data in particular de novo sequencing was rapidly produced at relatively low costs. In this context, computational tools are increasingly important to assist in the identification of relevant information to understand the functioning of organisms. This work introduces BASiNET, an alignment-free tool for classifying biological sequences based on the feature extraction from complex network measurements. The method initially transform the sequences and represents them as complex networks. Then it extracts topological measures and constructs a feature vector that is used to classify the sequences. The method was evaluated in the classification of coding and non-coding RNAs of 13 species and compared to the CNCI, PLEK and CPC2 methods. BASiNET outperformed all compared methods in all adopted organisms and datasets. BASiNET have classified sequences in all organisms with high accuracy and low standard deviation, showing that the method is robust and non-biased by the organism. The proposed methodology is implemented in open source in R language and freely available for download at https://cran.r-project.org/package=BASiNET. |
Palavras-Chave: |
RNA-seq. |
Thesaurus NAL: |
Cardiovascular diseases; Epigenetics; Neurodegenerative diseases; Nucleotides. |
Categoria do assunto: |
-- |
URL: |
https://ainfo.cnptia.embrapa.br/digital/bitstream/item/196978/1/BASinet-Biological-sequences-NETwork.pdf
|
Marc: |
LEADER 01912naa a2200229 a 4500 001 2108754 005 2019-05-06 008 2018 bl uuuu u00u1 u #d 100 1 $aITO, E. A. 245 $aBASiNET - Biological Sequences NETwork$ba case study on coding and non-coding RNAs identification.$h[electronic resource] 260 $c2018 520 $aWith the emergence of Next Generation Sequencing (NGS) technologies, a large volume of sequence data in particular de novo sequencing was rapidly produced at relatively low costs. In this context, computational tools are increasingly important to assist in the identification of relevant information to understand the functioning of organisms. This work introduces BASiNET, an alignment-free tool for classifying biological sequences based on the feature extraction from complex network measurements. The method initially transform the sequences and represents them as complex networks. Then it extracts topological measures and constructs a feature vector that is used to classify the sequences. The method was evaluated in the classification of coding and non-coding RNAs of 13 species and compared to the CNCI, PLEK and CPC2 methods. BASiNET outperformed all compared methods in all adopted organisms and datasets. BASiNET have classified sequences in all organisms with high accuracy and low standard deviation, showing that the method is robust and non-biased by the organism. The proposed methodology is implemented in open source in R language and freely available for download at https://cran.r-project.org/package=BASiNET. 650 $aCardiovascular diseases 650 $aEpigenetics 650 $aNeurodegenerative diseases 650 $aNucleotides 653 $aRNA-seq 700 1 $aKATAHIRA, I. 700 1 $aVICENTE, F. F. da R. 700 1 $aPEREIRA, L. F. P. 700 1 $aLOPES, F. M. 773 $tNucleic Acids Research$gv. 46, n. 16, p. , 2018
Download
Esconder MarcMostrar Marc Completo |
Registro original: |
Embrapa Café (CNPCa) |
|
Biblioteca |
ID |
Origem |
Tipo/Formato |
Classificação |
Cutter |
Registro |
Volume |
Status |
Fechar
|
Nenhum registro encontrado para a expressão de busca informada. |
|
|