High statistics block entropy measures of DNA sequences

P Lio, A Politi, M Buiatti, S Ruffo

Research output: Contribution to journalArticle

24 Citations (Scopus)

Abstract

We have used an improved block-entropy measure in order to gain some further insights into the short-range correlations present in whole chromosomes of S. cerevisiae, viruses and organelles and very large genomic regions of E. coli. Although DNA sequences are largely inhomogeneous and word frequencies are unevenly distributed, the comparison of entire chromosomes and large genomic regions show a ''bulk'' composition homogeneity. This property suggests that biases in selection, directional mutational pressure and recombination processes act in homogenizing the base composition of the DNA molecules within a genome but their mode of action, relative impact and direction may vary in different organisms. The most interesting results appear to be the differences between the SW (C,G/A,T) and RY (A,G/C,T) two-letter alphabet entropies. Deviations from randomness in E. coil and S. cerevisiae sequences particularly concern SW dinucleotide frequencies and RY tetranucleotide frequencies. (C) 1996 Academic Press Limited

Original languageEnglish
Pages (from-to)151-160
Number of pages10
JournalJournal of Theoretical Biology
Volume180
Issue number2
Publication statusPublished - 21 May 1996

Keywords

  • NUCLEOTIDE-SEQUENCES
  • CODON USAGE
  • GENOME
  • MUTATION
  • RECOMBINATION
  • CONSTRAINTS
  • STABILITY
  • EVOLUTION
  • RNA

Cite this

High statistics block entropy measures of DNA sequences. / Lio, P ; Politi, A ; Buiatti, M ; Ruffo, S .

In: Journal of Theoretical Biology, Vol. 180, No. 2, 21.05.1996, p. 151-160.

Research output: Contribution to journalArticle

Lio, P, Politi, A, Buiatti, M & Ruffo, S 1996, 'High statistics block entropy measures of DNA sequences' Journal of Theoretical Biology, vol. 180, no. 2, pp. 151-160.
Lio, P ; Politi, A ; Buiatti, M ; Ruffo, S . / High statistics block entropy measures of DNA sequences. In: Journal of Theoretical Biology. 1996 ; Vol. 180, No. 2. pp. 151-160.
@article{6fc8c3e8c9d04e10b4b2f9dcd4a29214,
title = "High statistics block entropy measures of DNA sequences",
abstract = "We have used an improved block-entropy measure in order to gain some further insights into the short-range correlations present in whole chromosomes of S. cerevisiae, viruses and organelles and very large genomic regions of E. coli. Although DNA sequences are largely inhomogeneous and word frequencies are unevenly distributed, the comparison of entire chromosomes and large genomic regions show a ''bulk'' composition homogeneity. This property suggests that biases in selection, directional mutational pressure and recombination processes act in homogenizing the base composition of the DNA molecules within a genome but their mode of action, relative impact and direction may vary in different organisms. The most interesting results appear to be the differences between the SW (C,G/A,T) and RY (A,G/C,T) two-letter alphabet entropies. Deviations from randomness in E. coil and S. cerevisiae sequences particularly concern SW dinucleotide frequencies and RY tetranucleotide frequencies. (C) 1996 Academic Press Limited",
keywords = "NUCLEOTIDE-SEQUENCES, CODON USAGE, GENOME, MUTATION, RECOMBINATION, CONSTRAINTS, STABILITY, EVOLUTION, RNA",
author = "P Lio and A Politi and M Buiatti and S Ruffo",
year = "1996",
month = "5",
day = "21",
language = "English",
volume = "180",
pages = "151--160",
journal = "Journal of Theoretical Biology",
issn = "0022-5193",
publisher = "Academic Press Inc.",
number = "2",

}

TY - JOUR

T1 - High statistics block entropy measures of DNA sequences

AU - Lio, P

AU - Politi, A

AU - Buiatti, M

AU - Ruffo, S

PY - 1996/5/21

Y1 - 1996/5/21

N2 - We have used an improved block-entropy measure in order to gain some further insights into the short-range correlations present in whole chromosomes of S. cerevisiae, viruses and organelles and very large genomic regions of E. coli. Although DNA sequences are largely inhomogeneous and word frequencies are unevenly distributed, the comparison of entire chromosomes and large genomic regions show a ''bulk'' composition homogeneity. This property suggests that biases in selection, directional mutational pressure and recombination processes act in homogenizing the base composition of the DNA molecules within a genome but their mode of action, relative impact and direction may vary in different organisms. The most interesting results appear to be the differences between the SW (C,G/A,T) and RY (A,G/C,T) two-letter alphabet entropies. Deviations from randomness in E. coil and S. cerevisiae sequences particularly concern SW dinucleotide frequencies and RY tetranucleotide frequencies. (C) 1996 Academic Press Limited

AB - We have used an improved block-entropy measure in order to gain some further insights into the short-range correlations present in whole chromosomes of S. cerevisiae, viruses and organelles and very large genomic regions of E. coli. Although DNA sequences are largely inhomogeneous and word frequencies are unevenly distributed, the comparison of entire chromosomes and large genomic regions show a ''bulk'' composition homogeneity. This property suggests that biases in selection, directional mutational pressure and recombination processes act in homogenizing the base composition of the DNA molecules within a genome but their mode of action, relative impact and direction may vary in different organisms. The most interesting results appear to be the differences between the SW (C,G/A,T) and RY (A,G/C,T) two-letter alphabet entropies. Deviations from randomness in E. coil and S. cerevisiae sequences particularly concern SW dinucleotide frequencies and RY tetranucleotide frequencies. (C) 1996 Academic Press Limited

KW - NUCLEOTIDE-SEQUENCES

KW - CODON USAGE

KW - GENOME

KW - MUTATION

KW - RECOMBINATION

KW - CONSTRAINTS

KW - STABILITY

KW - EVOLUTION

KW - RNA

M3 - Article

VL - 180

SP - 151

EP - 160

JO - Journal of Theoretical Biology

JF - Journal of Theoretical Biology

SN - 0022-5193

IS - 2

ER -