Reference: Liu G, et al. (2025) CG-Based Stratification of 8-mers Highlights Functional Roles and Phylogenetic Divergence Markers. Int J Mol Sci 26(19)

Reference Help

Abstract


K-mer analysis is a powerful tool for understanding genome structure and evolution. A "k-mer" refers to a short DNA sequence made up of k nucleotides (where k is a specific integer), while an "m-mer" is a similar concept but with a shorter sequence length. The functional mechanisms of CG-containing k-mers, as well as their potential role in evolutionary processes, remain unclear. To explore this issue, we analyzed 8-mers in several species with varying genomic complexities and evolutionary divergences: Homo sapiens, Saccharomyces cerevisiae, Bombyx mori, Ciona intestinalis, Danio rerio, and Caenorhabditis elegans, which were grouped by CG dinucleotide content (0CG, 1CG, and 2CG). We examined the relative frequencies of shorter m-mers (with m = 3 and 4) within each CG-defined group, using information-theoretic, distance-based, and angular metrics. Our results show that 0CG motifs follow random patterns, while 1CG and 2CG motifs display significant deviations, likely due to functional constraints such as nucleosome-binding and CpG island association. The observed unimodal distribution of 8-mers arises from the convergence of the three CG-defined groups. Among them, the 2CG group shows the highest divergence in m-mer composition, followed by 1CG, reflecting varying degrees of selective pressure. Furthermore, species-specific differences in CG-classified 8-mer patterns could provide valuable insights into phylogenetic relationships. Through extensive comparison, we explore how CG content and sequence composition influence genomic organization and contribute to evolutionary divergence across different taxa. These findings deepen our understanding of short motif functions, genome organization, and sequence evolution.

Reference Type
Journal Article
Authors
Liu G, Meng H, Yang Z, Liu G, Xing Y, Xiao N
Primary Lit For
Additional Lit For
Review For

Gene Ontology Annotations


Increase the total number of rows showing on this page using the pull-down located below the table, or use the page scroll at the table's top right to browse through the table's pages; use the arrows to the right of a column header to sort by that column; filter the table using the "Filter" box at the top of the table.

Gene/Complex Qualifier Gene Ontology Term Aspect Annotation Extension Evidence Method Source Assigned On Reference

Phenotype Annotations


Increase the total number of rows showing on this page using the pull-down located below the table, or use the page scroll at the table's top right to browse through the table's pages; use the arrows to the right of a column header to sort by that column; filter the table using the "Filter" box at the top of the table; click on the small "i" buttons located within a cell for an annotation to view further details.

Gene Phenotype Experiment Type Mutant Information Strain Background Chemical Details Reference

Disease Annotations


Increase the total number of rows showing on this page using the pull-down located below the table, or use the page scroll at the table's top right to browse through the table's pages; use the arrows to the right of a column header to sort by that column; filter the table using the "Filter" box at the top of the table.

Gene Disease Ontology Term Qualifier Evidence Method Source Assigned On Reference

Regulation Annotations


Increase the total number of rows displayed on this page using the pull-down located below the table, or use the page scroll at the table's top right to browse through the table's pages; use the arrows to the right of a column header to sort by that column; to filter the table by a specific experiment type, type a keyword into the Filter box (for example, “microarray”); download this table as a .txt file using the Download button or click Analyze to further view and analyze the list of target genes using GO Term Finder, GO Slim Mapper, or SPELL.

Regulator Target Direction Regulation Of Happens During Method Evidence

Post-translational Modifications


Increase the total number of rows showing on this page by using the pull-down located below the table, or use the page scroll at the table's top right to browse through its pages; use the arrows to the right of a column header to sort by that column; filter the table using the "Filter" box at the top of the table.

Site Modification Modifier Reference

Interaction Annotations


Genetic Interactions

Increase the total number of rows showing on this page by using the pull-down located below the table, or use the page scroll at the table's top right to browse through the table's pages; use the arrows to the right of a column header to sort by that column; filter the table using the "Filter" box at the top of the table; click on the small "i" buttons located within a cell for an annotation to view further details about experiment type and any other genes involved in the interaction.

Interactor Interactor Allele Assay Annotation Action Phenotype SGA score P-value Source Reference

Physical Interactions

Increase the total number of rows showing on this page by using the pull-down located below the table, or use the page scroll at the table's top right to browse through the table's pages; use the arrows to the right of a column header to sort by that column; filter the table using the "Filter" box at the top of the table; click on the small "i" buttons located within a cell for an annotation to view further details about experiment type and any other genes involved in the interaction.

Interactor Interactor Assay Annotation Action Modification Source Reference

Functional Complementation Annotations


Increase the total number of rows showing on this page by using the pull-down located below the table, or use the page scroll at the table's top right to browse through its pages; use the arrows to the right of a column header to sort by that column; filter the table using the "Filter" box at the top of the table.

Gene Species Gene ID Strain background Direction Details Source Reference