Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Oct 25.
doi: 10.1021/acs.jctc.5c01329. Online ahead of print.

Sequence-Dependent Conformational Landscapes of Intrinsically Disordered Proteins Reveal Asymmetric Chain Compaction

Affiliations

Sequence-Dependent Conformational Landscapes of Intrinsically Disordered Proteins Reveal Asymmetric Chain Compaction

Cong Wang et al. J Chem Theory Comput. .

Abstract

Intrinsically disordered proteins (IDPs) exhibit highly dynamic and heterogeneous conformational ensembles that are strongly influenced by sequence features. While global properties such as chain compaction and scaling behavior have been widely studied, they often fail to resolve the fine-grained, sequence-specific structural variation that underlies IDP function. Here, we perform long-time scale atomistic simulations of 47 representative IDP sequences from the yeast proteome to systematically investigate the relationship between sequence composition and conformational ensemble. To analyze the high-dimensional structural data, we apply uniform manifold approximation and projection (UMAP), a nonlinear dimensionality reduction technique that preserves local structural relationships. The resulting low-dimensional embeddings effectively differentiate IDP ensembles and reveal a novel descriptor─local compactness asymmetry─that quantifies directional differences in chain organization. This metric, denoted γRg, captures conformational features orthogonal to traditional global measures such as radius of gyration and end-to-end distance. We show that γRg correlates with sequence-level asymmetries in charge and hydropathy, and that conformational dynamics preferentially occur in the more extended region of the chain. The simulation data set generated in this work also provides a valuable resource for training machine learning models and developing improved coarse-grained force fields for disordered proteins.

PubMed Disclaimer