Table of Content

Open Access iconOpen Access

ARTICLE

crossmark

Discrete Circular Distributions with Applications to Shared Orthologs of Paired Circular Genomes

Tomoaki Imoto1, *, Grace S. Shieh2, *, Kunio Shimizu3

1 School of Management and Information, University of Shizuoka, Shizuoka, Japan.
2 Institute of Statistical Science, Academia Sinica, Taipei, Taiwan.
3 School of Statistical Thinking, The Institute of Statistical Mathematics, Tokyo, Japan.

* Corresponding Authors: Tomoaki Imoto. Email: email;
  Grace S. Shieh. Email: email.

(This article belongs to the Special Issue: Data Science and Modeling in Biology, Health, and Medicine)

Computer Modeling in Engineering & Sciences 2020, 123(3), 1131-1149. https://doi.org/10.32604/cmes.2020.08466

Abstract

For structural comparisons of paired prokaryotic genomes, an important topic in synthetic and evolutionary biology, the locations of shared orthologous genes (henceforth orthologs) are observed as binned data. This and other data, e.g., wind directions recorded at monitoring sites and intensive care unit arrival times on the 24-hour clock, are counted in binned circular arcs, thus modeling them by discrete circular distributions (DCDs) is required. We propose a novel method to construct a DCD from a base continuous circular distribution (CCD). The probability mass function is defined to take the normalized values of the probability density function at some pre-fixed equidistant points on the circle. Five families of constructed DCDs which have normalizing constants in closed form are presented. Simulation studies show that DCDs outperform the corresponding CCDs in modeling grouped (discrete) circular data, and minimum chi-square estimation outperforms maximum likelihood estimation for parameters. We apply the constructed DCDs, invariant wrapped Poisson and wrapped discrete skew Laplace to compare the structures of paired bacterial genomes. Specifically, discrete four-parameter wrapped Cauchy (nonnegative trigonometric sums) distribution models multi-modal shared orthologs in Clostridium (Sulfolobus) better than the others considered, in terms of AIC and Freedman’s goodness-of-fit test. The result that different DCDs fit the shared orthologs is consistent with the fact they belong to two kingdoms. Nevertheless, these prokaryotes have a common favored site around 70° on the unit circle; this finding is important for building synthetic prokaryotic genomes in synthetic biology. These DCDs can also be applied to other binned circular data.

Keywords


Cite This Article

APA Style
Imoto, T., Shieh, G.S., Shimizu, K. (2020). Discrete circular distributions with applications to shared orthologs of paired circular genomes. Computer Modeling in Engineering & Sciences, 123(3), 1131-1149. https://doi.org/10.32604/cmes.2020.08466
Vancouver Style
Imoto T, Shieh GS, Shimizu K. Discrete circular distributions with applications to shared orthologs of paired circular genomes. Comput Model Eng Sci. 2020;123(3):1131-1149 https://doi.org/10.32604/cmes.2020.08466
IEEE Style
T. Imoto, G.S. Shieh, and K. Shimizu, “Discrete Circular Distributions with Applications to Shared Orthologs of Paired Circular Genomes,” Comput. Model. Eng. Sci., vol. 123, no. 3, pp. 1131-1149, 2020. https://doi.org/10.32604/cmes.2020.08466



cc Copyright © 2020 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
  • 3490

    View

  • 2158

    Download

  • 0

    Like

Share Link